Human population genetic structure and diversity inferred from polymorphic L1 (LINE-1) and Alu insertions

David J. Witherspoon, University of Utah Health Sciences
E. E. Marchani, University of Utah Health Sciences
W. S. Watkins, University of Utah Health Sciences
C. T. Ostler, University of Utah Health Sciences
S. P. Wooding, University of Utah Health Sciences
B. A. Anders, Louisiana State University
J. D. Fowlkes, Louisiana State University
S. Boissinot, Queens College, City University of New York
A. V. Furano, National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK)
D. A. Ray, Louisiana State University
A. R. Rogers, The University of Utah
M. A. Batzer, Louisiana State University
L. B. Jorde, University of Utah Health Sciences

Abstract

Background/Aims: The L1 retrotransposable element family is the most successful self-replicating genomic parasite of the human genome. L1 elements drive replication of Alu elements, and both have had far-reaching impacts on the human genome. We use L1 and Alu insertion polymorphisms to analyze human population structure. Methods: We genotyped 75 recent, polymorphic L1 insertions in 317 individuals from 21 populations in sub-Saharan Africa, East Asia, Europe and the Indian subcontinent. This is the first sample of L1 loci large enough to support detailed population genetic inference. We analyzed these data in parallel with a set of 100 polymorphic Alu insertion loci previously genotyped in the same individuals. Results and Conclusion: The data sets yield congruent results that support the recent African origin model of human ancestry. A genetic clustering algorithm detects clusters of individuals corresponding to continental regions. The number of loci sampled is critical: with fewer than 50 typical loci, structure cannot be reliably discerned in these populations. The inclusion of geographically intermediate populations (from India) reduces the distinctness of clustering. Our results indicate that human genetic variation is neither perfectly correlated with geographic distance (purely clinal) nor independent of distance (purely clustered), but a combination of both: stepped clinal. Copyright © 2006 S. Karger AG.