| Literature DB >> 28770012 |
Julie Feusier1, David J Witherspoon1, W Scott Watkins1, Clément Goubert1, Thomas A Sasani1, Lynn B Jorde1.
Abstract
BACKGROUND: Polymorphic human Alu elements are excellent tools for assessing population structure, and new retrotransposition events can contribute to disease. Next-generation sequencing has greatly increased the potential to discover Alu elements in human populations, and various sequencing and bioinformatics methods have been designed to tackle the problem of detecting these highly repetitive elements. However, current techniques for Alu discovery may miss rare, polymorphic Alu elements. Combining multiple discovery approaches may provide a better profile of the polymorphic Alu mobilome. AluYb8/9 elements have been a focus of our recent studies as they are young subfamilies (~2.3 million years old) that contribute ~30% of recent polymorphic Alu retrotransposition events. Here, we update our ME-Scan methods for detecting Alu elements and apply these methods to discover new insertions in a large set of individuals with diverse ancestral backgrounds.Entities:
Keywords: Ancestry informative markers; Human ancestry; Mobilome; Polymorphism; Population genetics; Retrotransposon
Year: 2017 PMID: 28770012 PMCID: PMC5531096 DOI: 10.1186/s13100-017-0093-0
Source DB: PubMed Journal: Mob DNA
Fig. 1Venn diagram of Alu elements among 4 regional populations. Each individual was placed into one of four regional groups. Every putative locus per individual (5288 total loci) was added into the particular regional group
Fig. 2Diagram of an identified heterozygous Alu insertion in METTL20. a: Diagram of AluYb8 insertion in METTL20. Open boxes indicate untranslated regions, closed boxes indicate coding regions, and lines indicate intronic regions. b: Diagram of the WT and the AluYb8 insertion sequences in exon 3 of METTL20. The light blue indicates intronic TSD region, green indicates exonic TSD region, and purple is the Alu sequence. The insertion of the Alu element duplicated the AG splice acceptor site, indicated in bold font
Fig. 3Comparison of Alu elements between ME-Scan and Phase3 datasets. a: On left, PCA of 41 1KG individuals (with less than 10% false negative rate) using 1266 polymorphic loci. Good Alu loci (Additional file 1: Table S4) and loci with presence/absence allele frequency of less than 5% or greater than 95% (all samples) were removed. On right, PCA of the same 41 IKG individuals with 2710 polymorphic Alu loci detected through the Phase3 WGS approach. Loci with presence/absence allele frequency for all individuals of less than 5% or greater than 95% were removed. b: PCA of 41 1KG individuals from both methods using the 191 shared loci from (a) . c: Venn Diagram of non-reference elements from ME-Scan and Phase3 in 1KG individuals. Phase3 dataset contains only polymorphic Alu elements, so ME-Scan loci were filtered to 1530 loci that were found in the 41 IKG individuals and absent from Repbase and build hg19. The Phase3 dataset was also filtered to 4670 Alu elements that were present in the 41 IKG individuals and absent from Repbase and build hg19. These two sets were then compared
Fig. 4Subfamily distribution of 68 Sanger-sequenced AluYb elements