| Literature DB >> 34992635 |
Bin Ma1, Jinwen Chen2, Xiaomin Yang3, Jingya Bai1, Siwei Ouyang1, Xiaodan Mo1, Wangsheng Chen1, Chuan-Chao Wang2,3,4, Xiangjun Hai1.
Abstract
Northwest China is a contacting region for East and West Eurasia and an important center for investigating the migration and admixture history of human populations. However, the comprehensive genetic structure and admixture history of the Altaic speaking populations and Hui group in Northwest China were still not fully characterized due to insufficient sampling and the lack of genome-wide data. Thus, We genotyped genome-wide SNPs for 140 individuals from five Chinese Mongolic, Turkic speaking groups including Dongxiang, Bonan, Yugur, and Salar, as well as the Hui group. Analysis based on allele-sharing and haplotype-sharing were used to elucidate the population history of Northwest Chinese populations, including PCA, ADMIXTURE, pairwise Fst genetic distance, f-statistics, qpWave/qpAdm and ALDER, fineSTRUCTURE and GLOBETROTTER. We observed Dongxiang, Bonan, Yugur, Salar, and Hui people were admixed populations deriving ancestry from both East and West Eurasians, with the proportions of West Eurasian related contributions ranging from 9 to 15%. The genetic admixture was probably driven by male-biased migration- showing a higher frequency of West Eurasian related Y chromosomal lineages than that of mtDNA detected in Northwest China. ALDER-based admixture and haplotype-based GLOBETROTTER showed this observed West Eurasian admixture signal was introduced into East Eurasia approximately 700 ∼1,000 years ago. Generally, our findings provided supporting evidence that the flourish transcontinental communication between East and West Eurasia played a vital role in the genetic formation of northwest Chinese populations.Entities:
Keywords: admixture history; gansu; gene flow; genetic structure; northwest China; steppe population; trans-Eurasia; west Eurasia
Year: 2021 PMID: 34992635 PMCID: PMC8724515 DOI: 10.3389/fgene.2021.795570
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1The geographical map of our samples collection.
FIGURE 2The results of ROH calculation. (A) the length distribution; (B) the average length.
FIGURE 3Patterns of genetic relationship among published East Asian populations and our newly genotyped five populations inferred from the principal component analysis. (A) East Asians including southern populations and with the West Eurasians; (B) East Asians without southern populations and without the West Eurasians.
FIGURE 4ADMIXTURE analysis result visualization at K = 5 as the corresponding cross-validation error was the lowest. And our studied populations in Gansu were marked by red color.
FIGURE 5Phylogenetic tree among our studied populations in Gansu and reference populations in Eurasia. Our samples in Gansu province were marked with red color.
FIGURE 6The heat map of sharing haplotypes and clustering dendrogram by fineSTRUCTURE. DX = Dongxiang. (A) the dendrogram. (B) the heat map of sharing haplotypes.
FIGURE 7Heatmap results of the outgroup-f statistics of the form f (X, Y; Mbuti). The larger values indicated that they shared more genetic drifts. Here the Outgroup was Mbuti.
Admixture f3 statistics of the form (Source1, Source2; Target) with the lowest f3 values.
| Source 1 | Source 2 | Target | f_3 | Std. err | Z | SNPs |
|---|---|---|---|---|---|---|
| Kazakhstan_Andronovo.SG | Upper_YR_LN | Hui | −0.010305 | 0.001639 | −6.288 | 55271 |
| Kazakhstan_Andronovo.SG | Upper_YR_IA | Hui | −0.009701 | 0.001942 | −4.995 | 53232 |
| Kazakhstan_Kangju.SG | Shimao_LN | Hui | −0.009247 | 0.001029 | −8.99 | 161279 |
| Russia_Alan.SG | WLR_LN | Hui | −0.009221 | 0.001209 | −7.63 | 136872 |
| Russia_Alan.SG | YR_LBIA | Hui | −0.009003 | −0.000714 | −12.609 | 165864 |
| Russia_Alan.SG | WLR_LN | Dongxiang | −0.014067 | −0.001139 | −12.348 | 138809 |
| CHB.SG | Anatolia_N | Dongxiang | −0.013755 | −0.000316 | −43.553 | 172663 |
| CHB.SG | Russia_MLBA_Sintashta | Dongxiang | −0.013573 | 0.000324 | −41.928 | 171061 |
| Anatolia_N | YR_LBIA | Dongxiang | −0.013525 | −0.000628 | −21.527 | 170054 |
| Kazakhstan_Kangju.SG | Shimao_LN | Dongxiang | −0.013429 | −0.00097 | −13.847 | 163617 |
| Russia_MLBA_Sintashta | Miaozigou_MN | Bonan | −0.010536 | 0.001382 | −7.621 | 49714 |
| Kazakhstan_Andronovo.SG | Upper_YR_LN | Bonan | −0.010432 | 0.001621 | −6.436 | 55987 |
| Russia_Alan.SG | WLR_LN | Bonan | −0.010298 | 0.001191 | −8.645 | 138290 |
| Kazakhstan_Kangju.SG | Shimao_LN | Bonan | −0.010185 | −0.000996 | −10.226 | 163067 |
| CHB.SG | Anatolia_N | Bonan | −0.010179 | −0.000311 | −32.782 | 172482 |
| Kazakhstan_Andronovo.SG | Upper_YR_LN | Yugur | −0.008896 | 0.001651 | −5.388 | 55670 |
| Russia_Alan.SG | Wuzhuangguoliang | Yugur | −0.008814 | 0.002125 | −4.148 | 28513 |
| Russia_MLBA_Sintashta | Miaozigou_MN | Yugur | −0.008676 | 0.001367 | −6.346 | 49569 |
| Kazakhstan_Kangju.SG | Shimao_LN | Yugur | −0.00843 | 0.001004 | −8.393 | 162332 |
| Kazakhstan_Andronovo.SG | Upper_YR_IA | Yugur | −0.008279 | 0.001914 | −4.326 | 53651 |
| Kazakhstan_Andronovo.SG | Upper_YR_LN | Salar | −0.011922 | 0.001611 | −7.4 | 55593 |
| Kazakhstan_Andronovo.SG | Upper_YR_IA | Salar | −0.011297 | 0.001916 | −5.897 | 53574 |
| Kazakhstan_Kangju.SG | Shimao_LN | Salar | −0.011024 | −0.001022 | −10.781 | 162165 |
| Russia_Alan.SG | WLR_LN | Salar | −0.010966 | 0.001182 | −9.28 | 137507 |
| Russia_Alan.SG | Shimao_LN | Salar | −0.010849 | −0.000996 | −10.893 | 165726 |
FIGURE 8qpAdm based admixture models for the populations in our study in Gansu province. The 2-way admixture models for our Gansu samples were presented when the p values >0.05 at the rank = 1. (A) Hui, Bonan, Yugur, Salar ethnic groups. (B) Dongxiang ethnic group.
Two-way qpAdm models of studied populations in Gansu.
| Studied population | Proportion | Std. err | Proportion | Std. err |
|
|---|---|---|---|---|---|
| Russia_Andronovo.SG | YR_LBIA | ||||
| Hui | 0.091 | 0.007 | 0.909 | 0.007 | 0.206 |
| Bonan | 0.109 | 0.007 | 0.891 | 0.007 | 0.0536 |
| Yugur | 0.111 | 0.007 | 0.889 | 0.007 | 0.546 |
| Salar | 0.118 | 0.007 | 0.882 | 0.007 | 0.098 |
| Russia_Alan.SG | WLR_LN | ||||
| Dongxiang | 0.149 | 0.011 | 0.851 | 0.011 | 0.304 |
The Y-chromosome haplogroups distribution of our studied populations.
| Y Haplogroup | Frequency | |
|---|---|---|
| Hui | D1a1a1a1a2a∼ | 0.100 |
| H1a1a1a | 0.100 | |
| J2a1a | 0.100 | |
| J2a1h2b | 0.100 | |
| J2a2 | 0.100 | |
| N1a2b3 | 0.100 | |
| O2a2a1a2a1a | 0.100 | |
| O2a2b1a1a6b | 0.100 | |
| R1a1a1b2 | 0.200 | |
| Dongxiang | D1a1a1a2 | 0.133 |
| E1b1a1a1a2a1a3b1a10b∼ | 0.067 | |
| J2a1h2 | 0.200 | |
| J2a2 | 0.067 | |
| L1a2a1b2∼ | 0.067 | |
| N1a1a1a1a3a2a∼ | 0.067 | |
| N1a3∼ | 0.067 | |
| O2a2b1a1a6 | 0.133 | |
| R1a1a1b2 | 0.067 | |
| R2a2 | 0.133 | |
| Bonan | C2b1a2a2a∼ | 0.0625 |
| D1a1a1a1a2a∼ | 0.125 | |
| D1a2a1∼ | 0.0625 | |
| J2a1h2 | 0.0625 | |
| N1a2b3a∼ | 0.0625 | |
| O1b1a1a1a1b1b | 0.0625 | |
| O1b1a1a1a2 | 0.0625 | |
| O2a2b1a1a | 0.125 | |
| O2a2b1a2a1d | 0.0625 | |
| Q1b1a3b1a1∼ | 0.0625 | |
| Q2a1c1b1∼ | 0.0625 | |
| R1a1a1b2 | 0.1875 | |
| Yugur | C2b1a1 | 0.133 |
| C2b1a3b∼ | 0.067 | |
| D1a1a1a1a2a∼ | 0.133 | |
| D1a1a1a2 | 0.067 | |
| O2a2b1a1a6 | 0.067 | |
| O2a2b1a2a1a2 | 0.133 | |
| O2a2b1a2b2 | 0.133 | |
| O2a2b2a2a1 | 0.067 | |
| Q1b1a3a∼ | 0.067 | |
| Q1b2b1b2b2∼ | 0.133 | |
| Salar | I2a2a1b2a1b1b2a2∼ | 0.091 |
| J2a1 | 0.091 | |
| N1b2a2∼ | 0.091 | |
| O1b1a1a1b2 | 0.182 | |
| O2a1a1b1a2 | 0.091 | |
| O2a1c1a1a1a1a1b1a∼ | 0.091 | |
| O2a2b1a2a1a1a1 | 0.091 | |
| R1a1a1b2 | 0.273 |
The mtDNA haplogroups distribution for our studied populations.
| Hui | Dongxiang | Bonan | Yugur | Salar | |||||
|---|---|---|---|---|---|---|---|---|---|
| Haplogroup | Frequency | Haplogroup | Frequency | Haplogroup | Frequency | Haplogroup | Frequency | Haplogroup | Frequency |
| A16 | 0.041666667 | A | 0.06666667 | A | 0.03333333 | A1 | 0.1 | A | 0.115385 |
| B4c1b2c | 0.041666667 | A1 | 0.1 | B4 | 0.06666667 | A6b | 0.033333333 | A18 | 0.038462 |
| B5b2 | 0.041666667 | A6b | 0.03333333 | B5 | 0.06666667 | B4a3 | 0.033333333 | A5b1b | 0.038462 |
| C4d | 0.041666667 | B4 | 0.1 | B6a | 0.03333333 | C4 | 0.1 | A8a | 0.038462 |
| D4 | 0.083333333 | C5d2 | 0.03333333 | C4 | 0.1 | D4 | 0.4 | B4b1a2a | 0.038462 |
| D5 | 0.041666667 | D4 | 0.1 | C5b1b | 0.03333333 | D5a2a1 | 0.033333333 | C4d | 0.038462 |
| D5a2a1a1 | 0.04166667 | D5 | 0.06666667 | D4 | 0.2 | F1g | 0.1 | D4 | 0.076923 |
| F1 | 0.125 | F1 | 0.06666667 | F1g | 0.03333333 | M9a1a1c1b1a | 0.066666667 | F1 | 0.115385 |
| F4a2 | 0.041666667 | F2 | 0.1 | G2 | 0.1 | M9a1b1 | 0.033333333 | F3a1 | 0.038462 |
| G3a1′2 | 0.083333333 | F4b | 0.03333333 | H | 0.06666667 | R9b1a3 | 0.033333333 | G1a1 | 0.038462 |
| M7 | 0.125 | H15 | 0.03333333 | M10a1a1b | 0.03333333 | U4b1a1a1 | 0.033333333 | G2a | 0.076923 |
| M8 | 0.083333333 | H5 | 0.06666667 | M7b1a | 0.03333333 | U7a | 0.033333333 | H7b1 | 0.038462 |
| N9a2 | 0.04166667 | M7b1a1a3 | 0.03333333 | M8 | 0.06666667 | M11a2 | 0.038462 | ||
| Z3 | 0.083333333 | M8 | 0.06666667 | M9a1a1c1a | 0.03333333 | M21b | 0.038462 | ||
| Z4 | 0.083333333 | T2a1a | 0.03333333 | X2b4 | 0.03333333 | M9a1b1 | 0.115385 | ||
| X2 | 0.03333333 | Z3a | 0.06666667 | Z3 | 0.115385 | ||||
| Y1b1a | 0.03333333 | ||||||||
The admixture time estimation by ALDER for our studied populations.
| Population | 1-Ref weighted LD with weights Sardinian (generation) | Z-score | 1-Ref weighted LD with weights Han_HGDP (generation) | Z-score | 2-Ref weighted LD with weights Sardinian and Han_HGDP (generation) | Z-score |
|---|---|---|---|---|---|---|
| Hui | 34.98 ± 4.20 | 8.32 | 97.35 ± 35.92 | 2.71 | 31.36 ± 3.27 | 9.58 |
| Dongxiang | 28.71 ± 2.60 | 11.03 | 40.77 ± 7.40 | 5.51 | 26.73 ± 2.61 | 10.24 |
| Bonan | 33.21 ± 2.42 | 13.72 | - | - | 26.08 ± 2.50 | 10.42 |
| Yugur | 33.65 ± 4.53 | 7.42 | - | - | 25.32 ± 3.81 | 6.65 |
| Salar | 25.70 ± 3.64 | 7.07 | 33.74 ± 11.91 | 2.83 | 24.77 ± 3.83 | 6.47 |
The admixture events of our studied populations by GLOBETROTTER.
| Recipient.Population | Model | Gen.1date | Proportion.source1 | Bestmatch.event1.source1 | Bestmatch.event1.source2 | Proportion.event2.source1 | Bestmatch.event2.source1 | Bestmatch.event2.source2 | MaxR2fit.1date | Fit.quality.1event | Fit.quality.2events | Gen.2dates.date1 | Gen.2dates.date2 | Proportion.date1.source1 | Bestmatch.date1.source1 | Bestmatch.date1.source2 | Proportion.date2.source1 | Bestmatch.date2.source1 | Bestmatch.date2.source2 | MaxScore.2events |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Hui_Gansu | 1-DATE | 24.88801703 | 0.18 | English | Han_NChina | 0.2 | Kinh_Vietnam | Salar_Gansu | 0.920102825 | 0.99996966 | 0.999996517 | 1.000004327 | 24.44514919 | 0.42 | Atayal | Salar_Gansu | 0.18 | English | Han_NChina | 0.111809238 |
| Dongxiang_Gansu | 1-DATE | 21.43093694 | 0.24 | English | Han_NChina | 0.41 | Mongol | Dongxiang | 0.943671053 | 0.999996881 | 0.999999612 | 1.000023459 | 26.76153143 | 0.13 | Turkmen | Tu | 0.24 | English | Han_NChina | 0.154655956 |
| Bonan_Gansu | multiple-dates | 24.95247758 | 0.19 | English | Han_NChina | 0.49 | Yugur | Bonan | 0.918212959 | 0.999969001 | 0.9999979 | 8.270277733 | 30.57366368 | 0.34 | Uyghur.DG | Bonan | 0.18 | English | Han_NChina | 0.478201465 |
| Yugur_Gansu | 1-DATE | 23.47510082 | 0.16 | English | Yugur | 0.06 | Atayal | Tibetan_Lhasa | 0.884253347 | 0.999991289 | 0.999998974 | 11.71119508 | 44.60320405 | 0.07 | English | Tu | 0.13 | English | Yugur | 0.073485128 |
| Salar_Gansu | 1-DATE | 20.80295557 | 0.2 | English | Han_NChina | 0.04 | Atayal | Hui_Gansu | 0.943365942 | 0.99999999 | 1 | 10.74070225 | 43.5990788 | 0.07 | English | Hui_Gansu | 0.16 | English | Han_NChina | 0.176636782 |