| Literature DB >> 33202591 |
Chong Chen1,2,3, Yuchun Li4, Ruiyang Tao2,5, Xiaoye Jin1, Yuxin Guo1, Wei Cui3, Anqi Chen2,6, Yue Yang2,7, Xingru Zhang1, Jingyi Zhang2, Chengtao Li2,3,5,6,7, Bofeng Zhu1,3,8.
Abstract
Mitochondrial DNA (mtDNA), coupled with maternal inheritance and relatively high mutation rates, provides a pivotal way for us to investigate the formation histories of populations. The Hui minority with Islamic faith is one of the most widely distributed ethnic groups in China. However, the exploration of Hui's genetic architecture from the complete mitochondrial genome perspective has not been detected yet. Therefore, in this study, we employed the complete mitochondrial genomes of 98 healthy and unrelated individuals from Northwest China, as well as 99 previously published populations containing 7274 individuals from all over the world as reference data, to comprehensively dissect the matrilineal landscape of Hui group. Our results demonstrated that Hui group exhibited closer genetic relationships with Chinese Han populations from different regions, which was largely attributable to the widespread of haplogroups D4, D5, M7, B4, and F1 in these populations. The demographic expansion of Hui group might occur during the Late Pleistocene. Finally, we also found that Hui group might have gene exchanges with Uygur, Tibetan, and Tajik groups in different degrees and retained minor genetic imprint of European-specific lineages, therefore, hinting the existence of multi-ethnic integration events in shaping the genetic landscape of Chinese Hui group.Entities:
Keywords: Hui minority; complete mitochondrial genome; massively parallel sequencing; mitochondrial haplogroup; population genetics
Mesh:
Substances:
Year: 2020 PMID: 33202591 PMCID: PMC7698084 DOI: 10.3390/genes11111352
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1The mitochondrial haplogroup distributions of 98 Hui individuals from Northwest China. European-specific lineages represented by EL, and East Asian-specific lineages represented by EAL.
Diversity indexes and neutrality tests for the studied Hui population and interested populations based on a complete mitogenome.
| Genetic Diversities | Neutrality Tests | Mismatch Distributions | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Population | Abbreviation | n | h | Hd (SD) | S (Eta) | k | Pi (SD) | Tajima’s | Fu’s | SSD ( | HRI ( |
| Hui in Northwest China | HU | 98 | 97 | 1.000(0.002) | 640(650) | 37.06 | 0.0022(0.00006) | −2.39 (0.000) | −24.09 (0.001) | 0.0015 (0.520) | 0.0005 (0.989) |
| Beijing Han Chinese | CHB | 85 | 85 | 1.000(0.002) | 614(619) | 38.37 | 0.0023(0.00005) | −2.37 (0.000) | −24.15 (0.001) | 0.0029 (0.429) | 0.0007 (0.999) |
| Southern Han Chinese | CHS | 50 | 50 | 1.000(0.004) | 442(445) | 38.36 | 0.0023(0.00007) | −2.23 (0.001) | −21.91 (0.000) | 0.0021 (0.652) | 0.0015 (0.981) |
| Minnan Han | MIN | 50 | 48 | 0.998(0.004) | 397(400) | 36.09 | 0.0022(0.00006) | −2.16 (0.003) | −17.98 (0.000) | 0.0013 (0.744) | 0.0014 (0.983) |
| Hakka Han | HAK | 45 | 39 | 0.994(0.006) | 369(372) | 34.73 | 0.0021(0.00007) | −2.16 (0.002) | −7.38 (0.035) | 0.0022 (0.510) | 0.0046 (0.171) |
n: Number of sequences; h: Number of haplotypes; Hd: Haplotype diversity (standard deviation); S: Number of segregating sites (total number of mutations); k: Average number of pairwise nucleotide differences; Pi: Nucleotide diversity; SSD: Sum of squared deviations; HRI: Harpending’s raggedness index.
Analysis of molecular variance (AMOVA) results based on different groups for worldwide populations.
| Percentage of Variation | |||||
|---|---|---|---|---|---|
| Groupings | Number of Populations | Number of Groups | Within Populations | Among Populations with in Groups | Among Groups |
| Linguistic families of worldwide populations | 100 | 13 | 92.01 | 5.57 | 2.42 |
| Geographic distributions of worldwide populations | 100 | 9 | 92.08 | 5.95 | 1.97 |
Figure 2A principal component analysis (PCA) plot showing the genetic relationships between populations based on the haplogroup frequencies of 7372 complete mitochondrial sequences from 100 worldwide populations, the Hui group, and 99 reference populations. The red arrow refers to the studied Hui group (HU). The geographical origins of 100 populations in the plot are listed as follows: six populations from Africa, including ESN, GWD, LWK, MSL, YRI, and PGM; six from America, including ACB, PEL, MXL, CLM, PUR, and ASW; 16 from Central Asia, including LK, LU1, EPK, LU2, BRH, LTJ, PT, ST, WT, BLC, HZR, KLS, PTH, SDH, BRS, and HU; 29 from East Asia, including MOG, KHV, TSO, ATA, BUN, SAI, MAK, PAI, RUK, AMI, PUY, TAO, JPT, CHB, MIN, HAK, CHS, CDX, CDT, KMT, LZT, LST, NQT, DB, LB, NCT, SNT, SP, and ZBT; seven from Europe, including BSF, GBR, CEU, TSI, IBS, PLS, and FIN; 10 from North Asia, including UDR, EVR, EKR, YAR, AKR, BYR, HMR, NVR, KYR, and YUR; seven from South Asia, including ITU, STU, BEB, NP, GIH, PJL, and DR; 18 from Southeast Asia, including MOT, MLS, ABP, ATP, BGP, IBP, IFP, IVP, KKP, KLP, MRP, KRT, LAO, CTT, KHT, LUT, YUT, and BMS; and one from West Asia, PSI.
Figure 3A circle histogram of pairwise F values between Hui and 99 worldwide populations based on the haplogroup frequencies.
Figure 4The Bayesian skyline plot (BSP) analyses are performed with the complete mitochondrial sequences to detect the demographic history of Hui (a), CHB (b), CHS (c), and MIN (d) populations. The Y-axis represents the assumed effective population size on a logscale. The black lines in bold represent the median population size. The blue line demarcates the boundary of the 95% highest posterior density.
Figure 5The median network of mitochondrial D4 haplogroup. The left plot (a) colored by geographic origins, and the right plot (b) colored by haplogroups. The size of the nodes is proportional to the number of individuals carrying that node. The length of the branch is positively correlated with the number of different mutations, the longer the branch, the more different the mutations.
Figure 6The median network of European-specific lineages appearing in Hui group coupled with reference individuals from worldwide populations. The size of the nodes is proportional to the number of individuals carrying that node. The length of the branch is positively correlated with the number of different mutations, the longer the branch, the more different the mutations.