| Literature DB >> 27535707 |
Guang Yao Fan1,2, Yi Ye2,3, Yi Ping Hou1.
Abstract
Detecting population structure and estimating individual biogeographical ancestry are very important in population genetics studies, biomedical research and forensics. Single-nucleotide polymorphism (SNP) has long been considered to be a primary ancestry-informative marker (AIM), but it is constrained by complex and time-consuming genotyping protocols. Following up on our previous study, we propose that a multi-insertion-deletion polymorphism (Multi-InDel) with multiple haplotypes can be useful in ancestry inference and hierarchical genetic population structures. A validation study for the X chromosome Multi-InDel marker (X-Multi-InDel) as a novel AIM was conducted. Genetic polymorphisms and genetic distances among three Chinese populations and 14 worldwide populations obtained from the 1000 Genomes database were analyzed. A Bayesian clustering method (STRUCTURE) was used to discern the continental origins of Europe, East Asia, and Africa. A minimal panel of ten X-Multi-InDels was verified to be sufficient to distinguish human ancestries from three major continental regions with nearly the same efficiency of the earlier panel with 21 insertion-deletion AIMs. Along with the development of more X-Multi-InDels, an approach using this novel marker has the potential for broad applicability as a cost-effective tool toward more accurate determinations of individual biogeographical ancestry and population stratification.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27535707 PMCID: PMC4989243 DOI: 10.1038/srep32178
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Genetic diversity indices for all markers of three Chinese subject populations for females.
| Loci | Han in Chengdu (n = 117) | Tibetan in Lhasa (n = 55) | Uygur in Urumqi (n = 55) | ||||||
|---|---|---|---|---|---|---|---|---|---|
| 0.553 | 0.573 | 0.632 | 0.583 | 0.564 | 0.663 | 0.565 | 0.527 | 0.643 | |
| 0.537 | 0.615 | 0.611 | 0.559 | 0.582 | 0.642 | 0.581 | 0.709 | 0.661 | |
| 0.500 | 0.538 | 0.567 | 0.449 | 0.509 | 0.524 | 0.438 | 0.564 | 0.514 | |
| 0.442 | 0.504 | 0.524 | 0.430 | 0.545 | 0.496 | 0.554 | 0.509 | 0.637 | |
| 0.335 | 0.359 | 0.366 | 0.341 | 0.382 | 0.378 | 0.276 | 0.309 | 0.301 | |
| 0.516 | 0.462 | 0.591 | 0.537 | 0.673 | 0.614 | 0.551 | 0.691 | 0.634 | |
| 0.567 | 0.675 | 0.646 | 0.584 | 0.673 | 0.664 | 0.585 | 0.764 | 0.665 | |
| 0.490 | 0.530 | 0.554 | 0.480 | 0.400 | 0.546 | 0.519 | 0.473 | 0.599 | |
| 0.409 | 0.393 | 0.461 | 0.396 | 0.400 | 0.451 | 0.377 | 0.491 | 0.447 | |
| 0.456 | 0.530 | 0.547 | 0.481 | 0.400 | 0.576 | 0.555 | 0.655 | 0.637 | |
| 0.471 | 0.547 | 0.534 | 0.320 | 0.364 | 0.376 | 0.279 | 0.273 | 0.311 | |
| 0.539 | 0.590 | 0.621 | 0.573 | 0.582 | 0.655 | 0.558 | 0.745 | 0.635 | |
| 0.465 | 0.538 | 0.525 | 0.341 | 0.327 | 0.378 | 0.313 | 0.327 | 0.342 | |
| Mean | 0.483 | 0.527 | 0.552 | 0.467 | 0.492 | 0.536 | 0.473 | 0.541 | 0.540 |
| SD | 0.062 | 0.082 | 0.073 | 0.093 | 0.115 | 0.107 | 0.116 | 0.161 | 0.136 |
Figure 1Clustering analysis by structure for the full-population dataset assuming K = 2, 3, or 4.
Populations were ordered according to their respective geographic label above the plot. Population names are beneath the plot. Data presented here are the results with the highest posterior probabilities during 20 runs of each K setting from STRUCTURE (treated in CLUMPP and plotted with DISTRUCT). The upper three plots (A–C) inferred ancestry with K = 2, 3, or 4 in the “no-admixture” model, respectively. The lower three plots (D–F) inferred ancestry with K = 2, 3, or 4 in the “admixture” model, respectively.
Figure 2Geographical locations of the three Chinese subject populations.
The map was created in ArcGIS 10.2 software (ESRI Inc., Redlands, CA, USA) (http://www.esri.com/).
Averaged heterozygosities (H ) and genetic map positions for the 13 analyzed markers based on a previous study of Chinese Han subjects.
| B1 | Y1 | G3 | R3 | Y3 | G4 | Y2 | R1 | G1 | G2 | R2 | B2 | G5 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 6.3 | 18.5 | 29.1 | 30.4 | 42.8 | 44.0 | 90.9 | 93.6 | 120.7 | 134.1 | 136.6 | 159.8 | 162.6 | |
| 0.632 | 0.554 | 0.366 | 0.525 | 0.547 | 0.591 | 0.461 | 0.534 | 0.567 | 0.524 | 0.621 | 0.611 | 0.646 |