| Literature DB >> 29632618 |
Vallmer E Jordan1, Jerilyn A Walker1, Thomas O Beckstrom1, Cody J Steely1, Cullen L McDaniel1, Corey P St Romain1, Kim C Worley2,3, Jane Phillips-Conroy4, Clifford J Jolly5, Jeffrey Rogers2,3, Miriam K Konkel1,6, Mark A Batzer1.
Abstract
BACKGROUND: Since the completion of the human genome project, the diversity of genome sequencing data produced for non-human primates has increased exponentially. Papio baboons are well-established biological models for studying human biology and evolution. Despite substantial interest in the evolution of Papio, the systematics of these species has been widely debated, and the evolutionary history of Papio diversity is not fully understood. Alu elements are primate-specific transposable elements with a well-documented mutation/insertion mechanism and the capacity for resolving controversial phylogenetic relationships. In this study, we conducted a whole genome analysis of Alu insertion polymorphisms unique to the Papio lineage. To complete these analyses, we created a computational algorithm to identify novel Alu insertions in next-generation sequencing data.Entities:
Keywords: Alu; Evolutionary genetics; Hybridization; Papio; Phylogeny; Primates; Retrotransposon; Taxonomy
Year: 2018 PMID: 29632618 PMCID: PMC5885306 DOI: 10.1186/s13100-018-0118-3
Source DB: PubMed Journal: Mob DNA
Fig. 1Computational detection of Alu insertion polymorphisms using split-reads. Alu insertions were identified using sequencing reads spanning the Alu integration locus whether these split-reads spanned the 5′ (a and c) or the 3′ (b and d) end of the insertion. The four split-reads represented in this figure are labeled a, b, c, and d. Green boxes represent Alu sequence; gray boxes denote flanking sequence. If the split-read is paired and its read-pair mapped to the flanking sequence (b and c), these mapping coordinates were used to provide additional support for the location predicted by the split-read. If the split-read’s read-pair mapped to the Alu (a and d), this was used to provide additional support for the presence of the predicted Alu insertion
Fig. 2Alu-based phylogeny of extant Papio baboon species. Phylogenetic relationships of Papio baboons constructed using 123,120 Alu insertion polymorphisms. Genotypes computationally determined in 12 Papio baboons were used to construct a Dollo parsimony tree using M. mulatta as an outgroup. The percentage of bootstrap replicates (out of 10,000 iterations) is listed below each branch; the number of Alu insertions supporting each node is listed above each branch. Homoplasy index (H.I.) and consistency index (C.I.) are included below the cladogram
Fig. 3Common patterns of shared Alu insertion polymorphisms. a The number of Alu insertions shared exclusively between the species highlighted in each row. Markers were clustered based on precise presence/absence genotype data determined for six Papio baboons: one representing each Papio species. This figure displays the 15 largest clusters identified in this analysis. The colors correspond to the (b) Geographical distributions of the six Papio species. Map extrapolated from [30]. White/empty boxes indicate an empty site in that species
Fig. 4Analysis of phylogenetically informative Alu insertions. a Species indicative Alu insertion polymorphisms. For each species, the total number of Alu insertion polymorphisms shared exclusively between individuals belonging to that species. All species indicative markers were identified in multiple representative individuals. Also displayed is the number of Alu insertion polymorphisms supporting alternative northern (b) and southern (c) clade phylogenies. These markers were shared between multiple individuals belonging to each of the sister taxa displayed, yet absent from the third divergent species. Each phylogeny corresponds to the data point above it
Fig. 5Low allele frequency Alu insertions polymorphic among Papio species. A diversity panel of 12 Papio baboons was used in this analysis: two representing each extant species. The only elements used in this analysis were those shared uniquely between two species. Each pie chart represents the average values determined from the two individuals representing that Papio species (the species name is listed above each pie chart). The size of every pie chart slice represents the number of Alu insertions shared between the species listed above that particular chart and the species represented by the color of the slice (indicated by the legend on the right). The numbers outside each pie chart correspond to the total number of Alu insertions represented by each slice