| Literature DB >> 28854642 |
Jerilyn A Walker1, Vallmer E Jordan1, Cody J Steely1, Thomas O Beckstrom1, Cullen L McDaniel1, Corey P St Romain1, Emily C Bennett1, Arianna Robichaux1,2, Brooke N Clement1,3, Miriam K Konkel1, Mark A Batzer1.
Abstract
The genus of Papio (baboon) has six recognized species separated into Northern and Southern clades, each comprised of three species distributed across the African continent. Geographic origin and phenotypic variants such as coat color and body size have commonly been used to identify different species. The existence of multiple hybrid zones, both ancient and current, have complicated efforts to characterize the phylogeny of Papio baboons. More recently, mitochondrial DNA (mtDNA) and Y-chromosome genetic markers have been utilized for species identification with particular focus on the hybrid zones. Alu elements accumulate in a random manner and are a novel source of identical by descent variation with known ancestral states for inferring population genetic and phylogenetic relationships. As part of the Baboon Genome Analysis Consortium, we assembled an Alu insertion polymorphism database of nearly 500 Papio-lineage specific insertions representing all six species and performed population structure and phylogenetic analyses. In this study, we have selected a subset of 48 species indicative Alu insertions and demonstrate their utility as genetic systems for the identification of baboon species within Papio. Individual elements from the panel are easy to genotype and can be used in a hierarchical fashion based on the original level of uncertainty. This Alu-48 panel should serve as a valuable tool during the maintenance of pedigree records in captive populations and assist in the forensic identification of fossils and potential hybrids in the wild.Entities:
Keywords: evolutionary biology; population genomics; retrotransposon
Mesh:
Substances:
Year: 2017 PMID: 28854642 PMCID: PMC5569700 DOI: 10.1093/gbe/evx130
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
. 1.—Structure analysis of 67 Papio baboon individuals using the Alu-48 panel. The y axis shows the probability of assignment of each individual to six population clusters. The x axis shows baboons numbered 1–67 listed in the same order as supplementary file S3, Supplementary Material online. Individuals 1–15 are olive baboons (P. anubis), 16–17 are hamadryas baboons (P. hamadryas), 18–19 are Guinea baboons (P. papio), 23–37 are kinda baboons (P. kindae), and 38–67 are yellow baboons (P. cynocephalus) from the Mikumi National Park, Tanzania. K = 6 population clusters matches the recognized number of Papio species and captures the majority of structure in the data.
Allele Frequency Distribution Data for Each Alu-48 Locus, Sorted by Papio Species
| Loci | Olive | Hamadryas | Guinea | Chacma | Kinda | Yellow | ||
|---|---|---|---|---|---|---|---|---|
| Bab_LPL | 0.000 | 0.000 | 0.000 | 0.000 | ||||
| TB_3063 | 0.000 | 0.000 | 0.000 | |||||
| TB_3084 | 0.000 | 0.000 | 0.000 | 0.000 | ||||
| 69388 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | |||
| 46912 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | |||
| 27402 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | |||
| 27523 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | |||
| 11507 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| TB_3040 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | |||
| TB_3023 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | |||
| TB_76 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | |||
| Ham-09 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| Ham-16 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| Ham-27 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| Ham-28 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| Ham-41 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| Ham-43 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| Ham-44 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| G47-0 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| G47-13 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| G47-17 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| G47-28 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| G88-9 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| G88-19 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| G88-20 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| C-16 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| C-38 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| C-44 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| C-05 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| C-36 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| C-42 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| C-49 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| K-20 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| K-29 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| K-30 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| K-74-85 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | |||
| K-17 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | ||
| K2-10 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | |||
| K-33 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | |||
| T2-103 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | |||
| Y-90 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | |||
| Y-65 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | |||
| Y-71 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | |||
| Y-141 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | |||
| Y-108 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | |||
| Y-119 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | |||
| T2-25 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | |||
| T2-29 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
Note.–Colored fields indicate the species from which the locus was ascertained. Bold font indicates an allele frequency >0.000.
Inferred Population Structure: Probability of Assignment to Each Population Cluster
| Given Pop | Proportional Membership to Population Clusters | Number of Individuals | ||||||
|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | |||
| Unknown (SFBR-Y) | 0 | 0.436 | 0.030 | 0.022 | 0.033 | 0.061 | 0.419 | 12 |
| 1 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 15 | |
| 2 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 2 | |
| 3 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 2 | |
| 4 | 0.000 | 0.000 | 0.000 | 0.999 | 0.000 | 0.000 | 3 | |
| 5 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 15 | |
| 6 | 0.000 | 0.000 | 0.000 | 0.000 | 0.002 | 0.998 | 30 | |
Note.—Colored fields highlight species assignment to population clusters. Cluster 1 (olive), cluster 2 (gray), cluster 3 (lavender), cluster 4 (red), cluster 5 (blue), and cluster 6 (yellow).
Structure Analysis of 12 SFBR-Y Individuals as Unknowns
| Proportional Membership to Population Clusters | ||||||
|---|---|---|---|---|---|---|
| Pop 0 | 1 | 2 | 3 | 4 | 5 | 6 |
| ID | Olive | Hamadryas | Guinea | Chacma | Kinda | Yellow |
| 6968 | 0.607 | 0.032 | 0.021 | 0.030 | 0.038 | 0.273 |
| 1x1763 | 0.352 | 0.063 | 0.032 | 0.039 | 0.047 | 0.467 |
| 1x2092 | 0.725 | 0.016 | 0.014 | 0.020 | 0.025 | 0.200 |
| 9166 | 0.350 | 0.024 | 0.025 | 0.027 | 0.033 | 0.542 |
| 1x1786 | 0.447 | 0.024 | 0.021 | 0.047 | 0.060 | 0.401 |
| 1x3027 | 0.545 | 0.018 | 0.017 | 0.036 | 0.046 | 0.338 |
| 9481 | 0.286 | 0.030 | 0.036 | 0.041 | 0.045 | 0.562 |
| 8919 | 0.289 | 0.073 | 0.031 | 0.063 | 0.072 | 0.472 |
| 9656 | 0.547 | 0.017 | 0.018 | 0.024 | 0.023 | 0.371 |
| 1x2117 | 0.115 | 0.016 | 0.017 | 0.027 | 0.167 | 0.658 |
| 1x2798 | 0.529 | 0.028 | 0.019 | 0.024 | 0.156 | 0.243 |
| 8820 | 0.440 | 0.015 | 0.012 | 0.015 | 0.018 | 0.502 |
Note.—Probability of assignment to each population cluster.