| Literature DB >> 29449901 |
Jasmine N Baker1, Jerilyn A Walker1, Michael W Denham1, Charles D Loupe1, Mark A Batzer1.
Abstract
BACKGROUND: The evolution of Alu elements has been ongoing in primate lineages and Alu insertion polymorphisms are widely used in phylogenetic and population genetics studies. Alu subfamilies in the squirrel monkey (Saimiri), a New World Monkey (NWM), were recently reported. Squirrel monkeys are commonly used in biomedical research and often require species identification. The purpose of this study was two-fold: 1) Perform locus-specific PCR analyses on recently integrated Alu insertions in Saimiri to determine their amplification dynamics, and 2) Identify a subset of Alu insertion polymorphisms with species informative allele frequency distributions between the Saimiri sciureus and Saimiri boliviensis groups.Entities:
Keywords: Alu polymorphism; Population structure; Retroposon; Saimiri
Year: 2018 PMID: 29449901 PMCID: PMC5808450 DOI: 10.1186/s13100-018-0114-7
Source DB: PubMed Journal: Mob DNA
Number of recently integrated Alu elements analyzed for each percent divergence bin
| Percent Divergence | Number of | Number of Loci PCR Validated | Number of Polymorphic Loci | Number of Fixed Loci |
|---|---|---|---|---|
| 0.0 | 7 | 6 | 2 | 4 |
| 0.5 | 49 | 34 | 16 | 18 |
| 1.0 | 395 | 106 | 37 | 69 |
| 1.5 | 1493 | 168 | 44 | 124 |
| 2.0 | 2240 | 68 | 11 | 57 |
Total number of Alu insertions in the [saiBol1] genome from a range of 0% to 2% sequence divergence from their respective consensus sequence. The number of Alu insertions in each divergence category from the PCR validation experiments in this study is shown in the center column and separated by the number of polymorphic versus fixed loci in adjacent columns
PCR validation results for each Alu subfamily
| Subfamily | N | Fixed | Polymorphic | Subfamily | N | Fixed | Polymorphic | ||
|---|---|---|---|---|---|---|---|---|---|
| 1 | sf36 | 14 | 10 | 4 | 25 | subfam15 | 5 | 4 | 1 |
| 2 | sf37 | 12 | 10 | 2 | 26 | subfam17 | 1 | 1 | 0 |
| 3 | sf38 | 16 | 11 | 5 | 27 | subfam18 | 1 | 1 | 0 |
| 4 | sf42 | 24 | 15 | 9 | 28 | subfam2 | 3 | 3 | 0 |
| 5 | sf44 | 17 | 14 | 3 | 29 | subfam21 | 1 | 1 | 0 |
| 6 | sf46 | 15 | 9 | 6 | 30 | subfam26 | 11 | 9 | 2 |
| 7 | sf47 | 11 | 7 | 4 | 31 | subfam27 | 1 | 1 | 0 |
| 8 | sf51 | 16 | 8 | 8 | 32 | subfam29 | 6 | 4 | 2 |
| 9 | sf52 | 17 | 14 | 3 | 33 | subfam30 | 1 | 1 | 0 |
| 10 | sf53 | 3 | 2 | 1 | 34 | subfam32 | 15 | 11 | 4* |
| 11 | sf62 | 12 | 7 | 5 | 35 | subfam36 | 12 | 7 | 5 |
| 12 | sf63 | 16 | 11 | 5 | 36 | subfam37 | 4 | 4 | 0 |
| 13 | sf65 | 1 | 1 | 0 | 37 | subfam39 | 5 | 4 | 1 |
| 14 | sf66 | 13 | 10 | 3 | 38 | subfam4 | 7 | 5 | 2 |
| 15 | sf71 | 14 | 12 | 2 | 39 | subfam43 | 8 | 7 | 1 |
| 16 | sf73 | 11 | 5 | 6 | 40 | subfam45 | 3 | 3 | 0 |
| 17 | sf82 | 15 | 10 | 5 | 41 | subfam47 | 1 | 1 | 0 |
| 18 | sf85 | 3 | 1 | 2 | 42 | subfam5 | 9 | 5 | 4 |
| 19 | sf86 | 11 | 9 | 2 | 43 | subfam7 | 1 | 1 | 0 |
| 20 | subfam0 | 9 | 6 | 3 | 44 | subfam9 | 4 | 3 | 1 |
| 21 | subfam11 | 3 | 1 | 2* | 45 | Ta10 | 5 | 5 | 0 |
| 22 | subfam12 | 12 | 7 | 5 | 46 | Ta15 | 5 | 4 | 1* |
| 23 | subfam13 | 2 | 2 | 0 | |||||
| 24 | subfam14 | 6 | 5 | 1 | Total | 382 | 272 | 110 |
*Three loci in the polymorphic column, L-21071-subfam11, L-38701-subfam32 and L-19471-Ta15, were homozygous absent for the Alu in all 32 squirrel monkey samples on the DNA panel
Fig. 1Gel Image of Polymorphic Locus 35154 (JH378108:33053451–33054957). This image displays a polymorphic locus in the Saimiri genome [saiBol1]. Lanes: 1- 100 bp ladder, 2- TLE (Negative control), 3- Human (HeLa), 4-Callithrix jacchus (Common marmoset), 5–16 Saimiri sciureus (Common squirrel monkey), 17–32 Saimiri boliviensis (Bolivian squirrel monkey), 33–35 Saimiri boliviensis peruviensis (Peruvian squirrel monkey), 36- Saimiri oerstedii oerstedii (Panamanian red back squirrel monkey), 37- Saimiri sciureus macrodon, 38-Saimiri sp. The presence of the Alu element is indicated by the ~ 655 bp band and the absence by the ~ 346 bp band. Species with multiple individuals are grouped together by colored brackets (Orange- Common squirrel monkey, Blue- Bolivian squirrel monkey, Green-Peruvian squirrel monkey). Lanes 7(UWBM# 75531) and 12(MVZ Mamm 193661) share an insertion with the Bolivian squirrel monkeys whom are either homozygous present or heterozygous for the insertion (lanes 17–32). Lane 38 (species unknown) is heterozygous for the insertion
Fig. 2Population Structure analysis based on 110 Alu insertion polymorphisms and 32 squirrel monkey individuals for K = 2. The percent assignment of each individual to K = 2 clusters is shown on the Y-axis. The ID numbers and species names are shown on the X-axis. K = 2 captures the population structure of the two Saimiri groups, S. sciureus and S. boliviensis, and is consistent with the geographic origins of these samples
Fig. 3Population Structure analysis based on 110 Alu insertion polymorphisms and 32 squirrel monkey individuals for K = 3. The percent assignment of each individual to K = 3 clusters is shown on the Y-axis. The ID numbers and species names are shown on the X-axis. K = 3 captures the population structure of the two Saimiri groups, S. sciureus and S. boliviensis while also detecting the genetic isolation of members of a captive breeding colony within the S. boliviensis samples
Average Fst Values for K = 2 and K = 3
| K Value | Cluster Number | Average Fst |
|---|---|---|
| K = 2 | 1 | .7747 |
| K = 2 | 2 | .6950 |
| K = 3 | 1 | .8014 |
| K = 3 | 2 | .7639 |
| K = 3 | 3 | .3391 |
Average Fst values for K (estimated population clusters) equals 2 and K equals 3. If K = 2, Fst values are similar which implies genetic similarity between populations. If K = 3, Fst values are similar for two population clusters and one cluster has an extremely low value of 0.3391. That extremely low value implies Cluster 3 is sharing genetic material through inbreeding and appears to be isolated
Allele frequency data for Alu insertions with species informative distributions
| a. | b. | c. | ||
|---|---|---|---|---|
|
|
|
| ||
| 1 | L-20858-sf38 | 0.000 | 0.893 | 0.000 |
| 2 | L-40335-subfam32 | 0.000 | 0.893 | 0.000 |
| 3 | L-21370-subfam26 | 0.083 | 1.000 | 0.000 |
| 4 | L-26673-subfam29 | 0.167 | 0.857 | 0.000 |
| 5 | L-16089-Subfam26 | 0.167 | 1.000 | 0.050 |
| 6 | L-27488-subfam4 | 0.167 | 1.000 | 0.000 |
| 7 | L-27102-subfam5 | 0.083 | 0.929 | 0.000 |
| 8 | L-29927-Subfam4 | 0.150 | 0.964 | 0.056 |
| 9 | L-22568-sf37 | 0.167 | 0.929 | 0.000 |
| 10 | L-18103-subfam11 | 0.125 | 0.964 | 0.050 |
| 11 | L-11426-sf51 | 0.182 | 1.000 | 0.000 |
| 12 | L-14471-sf63 | 0.083 | 0.964 | 0.000 |
| 13 | L-19033-sf66 | 0.000 | 0.833 | 0.000 |
| 14 | L-12684-sf63 | 0.000 | 0.786 | 0.000 |
| 15 | L-1748-subfam0 | 0.167 | 1.000 | 0.000 |
| 16 | L-13945-sf46 | 0.042 | 1.000 | 0.000 |
| 17 | L-20802-sf62 | 0.167 | 1.000 | 0.000 |
| 18 | L-17843-sf62 | 0.167 | 0.913 | 0.000 |
| 19 | L-6918-subfam43 | 0.208 | 0.964 | 0.050 |
| 20 | L-31469-subfam29 | 0.042 | 0.929 | 0.000 |
| 21 | L-24998-subfam36 | 0.000 | 1.000 | 0.000 |
| 22 | L-40504-sf42 | 0.167 | 1.000 | 0.000 |
| 23 | L-26020-sf85 | 0.167 | 1.000 | 0.000 |
| 24 | L-33213-sf86 | 0.167 | 1.000 | 0.000 |
| 25 | L-2485-sf82 | 0.042 | 1.000 | 0.000 |
| 26 | L-35028-sf63 | 0.125 | 1.000 | 0.000 |
| 27 | L-18718-sf62 | 0.167 | 1.000 | 0.000 |
| 28 | L-6892-sf71 | 0.167 | 1.000 | 0.000 |
| 29 | L-7578-sf82 | 0.167 | 1.000 | 0.000 |
| 30 | L-19942-sf73 | 0.167 | 1.000 | 0.000 |
| 31 | L-20830-sf73 | 0.200 | 1.000 | 0.000 |
| 32 | L-25034-subfam36 | 0.167 | 0.923 | 0.000 |
| 33 | L-38119-subfam12 | 0.167 | 0.964 | 0.000 |
| 34 | L-30099-sf52 | 0.167 | 1.000 | 0.000 |
| 35 | L-36916-subfam12 | 0.208 | 1.000 | 0.050 |
| 36 | L-8051-sf42 | 0.167 | 1.000 | 0.000 |
| 37 | L-24655-s42 | 0.167 | 0.964 | 0.000 |
| 38 | L-39021-sf51 | 0.167 | 1.000 | 0.000 |
| 39 | L-16832-sf82 | 0.083 | 0.929 | 0.000 |
| 40 | L-20778-sf73 | 0.167 | 1.000 | 0.000 |
| 41 | L-37765-sf82 | 0.111 | 1.000 | 0.000 |
| 42 | L-30633-sf86 | 0.125 | 1.000 | 0.000 |
| 43 | L-431-sf66 | 0.125 | 1.000 | 0.000 |
| 44 | L-20383-sf36 | 0.167 | 1.000 | 0.000 |
| 45 | L-30828-subfam5 | 0.167 | 0.893 | 0.000 |
| 46 | L-22291-sf46 | 0.125 | 0.893 | 0.000 |
| 47 | L-25257-sf42 | 0.167 | 1.000 | 0.000 |
| 48 | L-26813-sf42 | 0.125 | 1.000 | 0.000 |
| 49 | L-28766-sf38 | 0.167 | 1.000 | 0.000 |
| 50 | L-38773-sf44 | 0.167 | 1.000 | 0.000 |
| 51 | L-10445-sf46 | 0.000 | 0.857 | 0.000 |
Allele frequency data for 51 polymorphic Alu insertions with species informative distribution between S. sciureus and S. boliviensis squirrel monkey species. Column C. with only ten S. sciureus samples has #75531 and #193661 omitted from the calculation because they clustered more closely with the Bolivian cluster (See Fig. 2). The 14 S. boliviensis group have an allele frequency of 80–100% whereas the 12 samples labeled S. sciureus have a group allele frequency of 0–20%. With #75531 and # 193661 omitted in column C, the group allele frequency in the S. sciureus group drops to near zero (0.5% on average). These 51 Alu insertion polymorphisms represent 26 different subfamilies: 10 Saimiri lineage specific Alu subfamilies reported in Baker et al. 2017 [48] and 16 NWM Alu subfamilies discovered in marmoset [49]