| Literature DB >> 26887503 |
Lian Deng1, Andrés Ruiz-Linares2, Shuhua Xu1,3,4, Sijia Wang1,4.
Abstract
Latin American populations stem from the admixture of Europeans, Africans and Native Americans, which started over 400 years ago and had lasted for several centuries. Extreme deviation over the genome-wide average in ancestry estimations at certain genomic locations could reflect recent natural selection. We evaluated the distribution of ancestry estimations using 678 genome-wide microsatellite markers in 249 individuals from 13 admixed populations across Latin America. We found significant deviations in ancestry estimations including three locations with more than 3.5 times standard deviations from the genome-wide average: an excess of European ancestry at 1p36 and 14q32, and an excess of African ancestry at 6p22. Using simulations, we could show that at least the deviation at 6p22 was unlikely to result from genetic drift alone. By applying different linguistic groups as well as the most likely ancestral Native American populations as the ancestry, we showed that the choice of Native American ancestry could affect the local ancestry estimation. However, the signal at 6p22 consistently appeared in most of the analyses using various ancestral groups. This study provided important insights for recent natural selection in the context of the unique history of the New World and implications for disease mapping.Entities:
Mesh:
Year: 2016 PMID: 26887503 PMCID: PMC4757894 DOI: 10.1038/srep21766
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Mean local ancestry admixture estimations for 678 autosomal microsatellite markers of 249 Latin American individuals from 13 populations, by using all 444 Native Americans as ancestry.
The solid lines in black indicate the mean ancestry proportion of European, Native American and African, and the dashed lines represent the proportions showing 3.5 standard deviations from the means. The arrows point to three signals: an excess of European ancestry at 1p36 and 14q32, and an excess of African ancestry at 6p22.
Comparisons of the top 3 signals with the most significant loci identified under selective neutrality.
| Marker | Position | Excess ofAncestral | Top | ||
|---|---|---|---|---|---|
| AAT238 | 1p36 | European | 3.00 | 3.37 ± 0.50 | 0.770 |
| GATA51F04 | 14q32 | European | 3.35 | 3.37 ± 0.50 | 0.516 |
| ATA12D05 | 6p22 | African | 4.71 | 3.23 ± 0.52 | 0.002 |
The averaged Z scores and their standard deviations are described in the fifth column. The P value of each signal is calculated based on Z test, by assuming normal distribution of the highest Z score over 1000 simulations using the most likely ancestral Native Americans.
Excess of ancestry at AAT238 at 1p36, GATA51F04 at 14q32 and ATA12D05 at 6p22 using each of the 5 linguistic groups as Native American ancestry.
| Population | AAT238 | GATA51F04 | ATA12D05 | |||
|---|---|---|---|---|---|---|
| AncestryProportion | AncestryProportion | AncestryProportion | ||||
| Andean | 0.5092 | 2.3489 | 0.5225 | 3.3934 | 0.1096 | 4.6784 |
| Central Amerind | 0.5651 | 1.5179 | 0.5501 | 0.6188 | 0.1208 | 4.0313 |
| Chibchean Paezan | 0.5558 | 2.2653 | 0.5612 | 2.5946 | 0.1231 | 4.5593 |
| Equatorial-Tucanoan | 0.5847 | 3.2489 | 0.5296 | 2.5913 | 0.1187 | 3.5116 |
| Northern Amerind | 0.5441 | 2.5966 | 0.5211 | 1.5919 | 0.1065 | 3.6235 |
FDR adjusted P values significant at the 0.05 level are marked with *, and those significant at the 0.01 level are marked with **.
Figure 2Estimate of African ancestry in 13 Latin American populations.
The curve shows the probability density of the African ancestral proportion across the genome. The estimate of African ancestry at ATA12D05 at 6p22 is larger than the genome-wide average in all the 13 populations, among which 7 populations show significant deviations at this locus. Significance at 5% and 1% are indicated by * and **, respectively.