| Literature DB >> 22606979 |
W Scott Watkins1, Jinchuan Xing, Chad Huff, David J Witherspoon, Yuhua Zhang, Ugo A Perego, Scott R Woodward, Lynn B Jorde.
Abstract
BACKGROUND: Populations of the Americas were founded by early migrants from Asia, and some have experienced recent genetic admixture. To better characterize the native and non-native ancestry components in populations from the Americas, we analyzed 815,377 autosomal SNPs, mitochondrial hypervariable segments I and II, and 36 Y-chromosome STRs from 24 Mesoamerican Totonacs and 23 South American Bolivians. RESULTS ANDEntities:
Mesh:
Substances:
Year: 2012 PMID: 22606979 PMCID: PMC3432609 DOI: 10.1186/1471-2156-13-39
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Figure 1Sampling locations and the distribution of major mtDNA and Y-chromosome haplogroups for Mesoamerican Totonacs and South American Bolivians.
Figure 2a) Principal components plot of individual pairwise genetic distance estimates. Panel 1 – most New World Totonac and Bolivian individuals are clustered and have smaller estimated distances to the HapMap CHB/JPT than to the CEU or YRI (~815 K SNPs). Panel 2 – data merged with five Native American HGDP populations typed on the Affymetrix 6.0 platform (~470 K SNPs). Each individual (+) is color coded by population. The percent variance accounted for by each principal component is indicated on the axes. b) Population structure analysis of Totonac and Bolivian individuals at K inferred ancestral populations using a genome-wide panel of 120,958 SNPs (r2 ≤ 0.2). Each individual is shown as a vertical bar with proportionate ancestry indicated by color. The top two panels show European attributable ancestry in ten Bolivians at K = 2, 3. The bottom two panels demonstrate greater similarity between the Totonacs and Bolivians than between other World populations (K = 4, 5), including CHB and JPT samples.
European admixture estimates for admixed Bolivian samples
| mtDNA haplogroup | Y-chromosome haplogroup | CEU - Fraction of genomea | |||
|---|---|---|---|---|---|
| Hapmix | Admixture (3 populations, K = 3) | Admixture (5 populations, K = 5) | |||
| | | | | | |
| Bolivian 105 | B2 | R1b | 0.48 | 0.47 ±0.02 | 0.41 ±0.02 |
| Bolivian 869 | A2 | R1b | 0.37 | 0.35 ±0.02 | 0.32 ±0.02 |
| Bolivian 054 | B2 | J2 | 0.33 | 0.33 ±0.02 | 0.30 ±0.02 |
| Bolivian 455 | D1 | Q1a3a1 | 0.30 | 0.28 ±0.02 | 0.24 ±0.02 |
| Bolivian 853 | B2 | R1b | 0.27 | 0.22 ±0.02 | 0.21 ±0.01 |
| Bolivian 101 | B2 | R1b | 0.25 | 0.23 ±0.02 | 0.21 ±0.01 |
| Bolivian 817 | B2 | R1b | 0.28 | 0.24 ±0.02 | 0.20 ±0.02 |
| Bolivian 081 | C1 | Q1a3a1 | 0.26 | 0.23 ±0.02 | 0.23 ±0.02 |
| Bolivian 458 | B2 | R1b | 0.22 | 0.22 ±0.02 | 0.19 ±0.02 |
| Bolivian 184 | B2 | Q1a3a1 | 0.13 | 0.05 ±0.02 | 0.09 ±0.02 |
| Totonac 867 | A2 | Q1a3a1 | 10−5 | 10−5 ±10−16 | 10−5 ±10−16 |
| CEU NA11993 | H | − | >0.99 | >0.99 ±10−16 | >0.99 ±0.10−3 |
aEstimated proportion of each genome attributable to European ancestry based on the CEU reference population and calculated from 815,377 SNPs (Hapmix) or 120,958 unlinked (r2 ≤ 0.2) SNPs (Admixture).
Figure 3Estimate for the age of admixture in Bolivians. The Hapmix log likelihoods summed over all individuals and chromosomes is plotted for generations 2 through 35.
Figure 4Structure analysis of Bolivians and Totonacs using a panel of 324 AIMs.a) New World ancestry is predicted for all Bolivians and Totonacs. A non-New World ancestry component is correctly distinguished in the ten Bolivians with European admixture. b) A subset of 173 AIMs present in the merged genome-wide data set (this study, [9] and [20]) identifies New World ancestry in other unrelated Native American populations and demonstrates transferability to other New World populations that were not used to ascertain the AIMs. c) 47 AIMs from Kosoy et al. present in the merged data.
Figure 5Accuracy and performance of the 324 AIMs. The root mean square error (RMSE) between Native American ancestry estimates using AIMs and the ancestry estimate using 120,598 genome-wide markers. (Note: the full AIMs panel (324 markers) produces ancestry estimates slightly lower than the genome-wide marker set, and thus the RMSE cannot achieve zero error with respect to the high-density genome-wide marker set.) AIMs are ordered from more informative (left) to less informative.
Location and regional genomic features of highly-differentiated New World SNPs
| SNP rs number | Chr | Position | Derived allele | XP-CLR Ranka (percent) | XP-CLR scoreb | XP-EHH region | XP-EHH Ranka (percent) | Gene | Gene function |
|---|---|---|---|---|---|---|---|---|---|
| rs2320170 | 2 | 95,603,500 | A | 10 | 7 / 7 | − | − | 5′ of TRIM43 | Zn-finger protein |
| rs3774089 | 3 | 10,931,071 | T | 0.1 | 13 / 109 | 10,807,575 – 11,007,575 | 0.02 | SLC6A11 (intronic) | GABA transporter |
| rs1344869 | 3 | 21,282,605 | G | 0.1 | 93/ 152 | 21,152,068 – 21,352,068 | 1 | − | − |
| rs9847307 | 3 | 64,500,753 | A | 10 | 2 / 23 | − | − | ADAMTS9 (intronic) | Protease |
| rs17617120 | 5 | 155,231,791 | T | 0.1 | 22 / 127 | 155,086,458 – 155,286,458 | 2 | SGCD (intronic) | Cardiac, skeletal |
| rs17617422 | 5 | 155,249,830 | G | 0.1 | 80 / 130 | 155,086,458 – 155,286,458 | 2 | SGCD (intronic) | Cardiac, skeletal |
| rs11960137 | 5 | 155,270,659 | G | 0.1 | 130 / 150 | 155,086,458 – 155,286,458 | 2 | SGCD (intronic) | Cardiac, skeletal |
| rs2642515 | 7 | 145,998,474 | T | 1 | 103 / 103 | 145,884,552 – 146,084,552 | 2 | CNTNAP2 (intronic) | Neurexin, regulated by FOXP2 |
| rs174547 | 11 | 61,327,359 | T | 10 | 21 / 23 | − | − | FADS1 (intronic) | Fatty acid desaturase |
| rs174548 | 11 | 61,327,924 | C | 10 | 20 / 25 | − | − | FADS1 (intronic) | Fatty acid desaturase |
| rs174549 | 11 | 61,327,958 | G | 10 | 20 / 25 | − | − | FADS1 (intronic) | Fatty acid desaturase |
| rs11610143 | 12 | 50,635,338 | G | 1 | 26 / 82 | 50,502,870 – 50,702,870 | 0.04 | ACVR1B (intronic) | Signaling, growth factor receptor |
| rs7955663 | 12 | 127,800,083 | A | 10 | 1 / 11 | − | − | − | − |
| rs1538142 | 13 | 37,344,432 | C | 0.1 | 123 / 177 | 37,244,768 – 37,444,768 | 2.5 | 5′ of TRPC4 | Ca2+ channel |
| rs693092 | 13 | 87,858,156 | G | 1 | 24 / 68 | 87,851,770 – 88,051,770 | 2 | − | − |
| rs9515075 | 13 | 88,033,482 | C | 0.1 | 105 / 189 | 87,851,770 – 88,051,770 | 2 | − | − |
| rs566514 | 13 | 32,551,339 | T | 10 | 19 / 32 | − | − | 5′ of STARD13 | GTP-binding, Lipid transfer |
| rs7170342 | 15 | 32,755,246 | C | 10 | 6 / 21 | 32,602,304 – 32,802,304 | 1 | 5′of AA496137 | Expressed in testes |
| rs4924116 | 15 | 35,086,443 | C | 1 | 10 / 56 | 34,986,366 – 35,186,366 | 1 | MEIS2 (intronic) | Homeobox, development |
| rs12439270 | 15 | 58,029,372 | C | 1 | 57 / 87 | 57,850,426 – 58,050,426 | 1 | 5′of FOXB1 | Transcription factor |
| rs1452501 | 16 | 79,180,763 | T | 10 | 8 / 21 | − | − | − | |
| rs470113 | 22 | 39,059,560 | A | 1 | 38 / 101 | − | − | TNRC6B (3′UTR) | Nucleotide binding |
Chr: chromosome; aRanking for a region based on the best score for the SNP ± 25 kb (XP-CLR) or ± 100 kb (XP-EHH), ranked empirically by percent of the distribution (e.g. the top 10%, 1%, 0.1%, …), comparisons to CHB/JPT; bSNP score / Best score for region (SNP location ± 25 kb); positions based on hg18.