Large-scale analysis of the Alu Ya5 and Yb8 subfamilies and their contribution to human genomic diversity

Marion L. Carroll, LSUHSC School of Medicine
Astrid M. Roy-Engel, Tulane University School of Public Health and Tropical Medicine
Son V. Nguyen, LSUHSC School of Medicine
Abdel Halim Salem, LSUHSC School of Medicine
Erika Vogel, Tulane University School of Public Health and Tropical Medicine
Bethaney Vincent, LSUHSC School of Medicine
Jeremy Myers, LSUHSC School of Medicine
Zahid Ahmad, LSUHSC School of Medicine
Lan Nguyen, LSUHSC School of Medicine
Mimi Sammarco, LSUHSC School of Medicine
W. Scott Watkins, University of Utah Health Sciences
Jurgen Henke, Institut für Blutgruppenforschung
Wojciech Makalowski, National Library of Medicine (NLM)
Lynn B. Jorde, University of Utah Health Sciences
Prescott L. Deininger, Tulane University School of Public Health and Tropical Medicine
Mark A. Batzer, LSUHSC School of Medicine

Abstract

We have utilized computational biology to screen GenBank for the presence of recently integrated Ya5 and Yb8 Alu family members. Our analysis identified 2640 Ya5 Alu family members and 1852 Yb8 Alu family members from the draff sequence of the human genome. We selected a set of 475 of these elements for detailed analyses. Analysis of the DNA sequences from the individual Alu elements revealed a low level of random mutations within both subfamilies consistent with the recent origin of these elements within the human genome. Polymerase chain reaction assays were used to determine the phylogenetic distribution and human genomic variation associated with each Alu repeat. Over 99% of the Ya5 and Yb8 Alu family members were restricted to the human genome and absent from orthologous positions within the genomes of several non-human primates, confirming the recent origin of these Alu subfamilies in the human genome. Approximately 1% of the analyzed Ya5 and Yb8 Alu family members had integrated into previously undefined repeated regions of the human genome. Analysis of mosaic Yb8 elements suggests gene conversion played an important role in generating sequence diversity among these elements. Of the 475 evaluated elements, a total of 106 of the Ya5 and Yb8 Alu family members were polymorphic for insertion presence/absence within the genomes of a diverse array of human populations. The newly identified Alu insertion polymorphisms will be useful tools for the study of human genomic diversity. © 2001 Academic Press.