| Literature DB >> 20360948 |
Hongbin Lin1, Hao Fan, Feng Zhang, Xiaoqin Huang, Keqin Lin, Lei Shi, Songnian Hu, Jiayou Chu, Duen-Mei Wang.
Abstract
Population migrations in Southwest and South China have played an important role in the formation of East Asian populations and led to a high degree of cultural diversity among ethnic minorities living in these areas. To explore the genetic relationships of these ethnic minorities, we systematically surveyed the variation of 10 autosomal STR markers of 1,538 individuals from 30 populations of 25 ethnic minorities, of which the majority were chosen from Southwest China, especially Yunnan Province. With genotyped data of the markers, we constructed phylogenies of these populations with both D(A) and D(C) measures and performed a principal component analysis, as well as a clustering analysis by structure. Results showed that we successfully recovered the genetic structure of analyzed populations formed by historical migrations. Aggregation patterns of these populations accord well with their linguistic affiliations, suggesting that deciphering of genetic relationships does in fact offer clues for study of ethnic differentiation.Entities:
Mesh:
Year: 2010 PMID: 20360948 PMCID: PMC2847899 DOI: 10.1371/journal.pone.0009895
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Geographical location of the 30 sampled populations.
Averaged heterozygosities (H) for the 10 analyzed markers.
| D3S1297 | D3S1304 | D3S1263 | D3S1266 | D3S1285 | D3S1278 | D3S1292 | D3S1279 | D3S1614 | D3S1580 | |
| Map Position (cM) | 8.3 | 22.3 | 36.1 | 52.6 | 91.2 | 129.7 | 146.6 | 169.6 | 177.8 | 207.7 |
| mean | 0.720 | 0.798 | 0.883 | 0.702 | 0.709 | 0.759 | 0.871 | 0.767 | 0.750 | 0.825 |
| s.d. | 0.055 | 0.025 | 0.021 | 0.040 | 0.048 | 0.062 | 0.022 | 0.053 | 0.041 | 0.046 |
| CEPH | 0.820 | 0.800 | 0.860 | 0.730 | 0.730 | 0.870 | 0.850 | 0.850 | 0.830 | 0.840 |
CEPH data were from the panel guide of ABI Prism Linkage Mapping Set v2.5.
Mantel test results with pair of D distance matrices of different loci.
| D3S1297 | D3S1304 | D3S1263 | D3S1266 | D3S1285 | D3S1278 | D3S1292 | D3S1279 | D3S1614 | D3S1580 | |
| Map Position (cM) | 8.3 | 22.3 | 36.1 | 52.6 | 91.2 | 129.7 | 146.6 | 169.6 | 177.8 | 207.7 |
| D3S1297 | - | 0.000 | 0.001 |
| 0.000 | 0.000 | 0.000 | 0.048 | 0.000 | 0.000 |
| D3S1304 | 0.371 | - | 0.001 | 0.000 | 0.017 | 0.002 | 0.000 | 0.000 | 0.000 | 0.000 |
| D3S1263 | 0.320 | 0.293 | - |
| 0.000 | 0.006 | 0.000 | 0.039 | 0.015 | 0.000 |
| D3S1266 | 0.114 | 0.550 | 0.090 | - |
|
| 0.020 | 0.002 | 0.004 | 0.041 |
| D3S1285 | 0.616 | 0.306 | 0.388 | 0.119 | - | 0.000 | 0.000 | 0.046 | 0.001 | 0.000 |
| D3S1278 | 0.601 | 0.352 | 0.285 | 0.121 | 0.661 | - | 0.000 | 0.026 | 0.000 | 0.000 |
| D3S1292 | 0.421 | 0.419 | 0.413 | 0.290 | 0.593 | 0.493 | - | 0.000 | 0.000 | 0.000 |
| D3S1279 | 0.208 | 0.537 | 0.218 | 0.542 | 0.227 | 0.301 | 0.467 | - | 0.000 | 0.024 |
| D3S1614 | 0.426 | 0.482 | 0.242 | 0.422 | 0.509 | 0.497 | 0.502 | 0.441 | - | 0.000 |
| D3S1580 | 0.581 | 0.444 | 0.403 | 0.246 | 0.680 | 0.696 | 0.532 | 0.294 | 0.511 | - |
Values in lower triangle are Pearson's linear correlation coefficients between each pair of matrices. Values in upper triangle are p values for test of coefficients based on 5000 permutations. n.s. stands for not significant at the 0.05 level.
Figure 2Unrooted neighbor-joining trees constructed with D distances.
(A) is the full-loci dataset and (B) the full-population dataset. Numbers labelled here represent percentage of occurrence of corresponding branches in 1,000 bootstrap replicates (where only values >40% are shown). Names of populations were coloured according to their linguistic affiliations.
Figure 3Principal component analysis with normalized allele frequencies for the full-loci dataset.
Percentages of variance accounted for by the two components are indicated in labels. For better visual comparison with geographical distribution of studied populations, the plot was counter-clockwisely rotated 90°. Colouring of linguistic affiliations follows that in Figure 2a.
Figure 4Clustering analysis by structure for the full-loci dataset assuming K = 2, 3, 4.
Populations were ordered according to their respective unrooted N-J trees. Linguistic affiliations and population names are labelled above and beneath the plot, respectively. Data presented here were the results with highest posterior probabilities during 15 runs of each K setting.
Correlation coefficients among genetic (GEN), geographic (GEO), and linguistic (LING) distances.
| Dataset |
|
|
| |
| Full-loci | 2-way | 0.287(0.038) | 0.239( | 0.346( |
| 3-way | 0.224(0.078) | 0.156(0.043) | ||
| non-Chinese | 2-way | 0.286(0.048) | 0.274( | 0.417( |
| 3-way | 0.196( | 0.177(0.024) | ||
| non-Tibeto-Burman | 2-way | 0.488( | 0.309( | 0.310( |
| 3-way | 0.434( | 0.190(0.031) | ||
| non-Tai-Kadai | 2-way | 0.270(0.065) | 0.298( | 0.416( |
| 3-way | 0.168( | 0.212(0.048) | ||
| non-Mon-Khmer | 2-way | 0.317(0.033) | 0.257( | 0.413( |
| 3-way | 0.239(0.060) | 0.146(0.052) | ||
| non-Mongolian | 2-way | 0.398(0.013) | 0.274( | 0.383( |
| 3-way | 0.330(0.027) | 0.144( | ||
| non-Turkic | 2-way | 0.304(0.087) | 0.211(0.042) | 0.353( |
| 3-way | 0.251( | 0.116( | ||
| non-Iranian | 2-way | 0.072( | 0.179(0.017) | 0.316( |
| 3-way | 0.017( | 0.165(0.038) |
In parentheses are presented p values of Mantel tests by 5,000 permutations. Three-way tests were carried out by controlling the distance that does not appear in the subscript. All tests were based on the full-loci dataset, which contains information of 10 loci of 26 populations, and its derivatives by excluding populations of a specific language. Highlighted are highly significant (0.01 level) p values; n.s. stands for not significant at the 0.1 level.