| Literature DB >> 35047024 |
Yan Liu1,2, Jie Xie1, Mengge Wang3,4, Changhui Liu3, Jingrong Zhu5, Xing Zou6, Wenshan Li7, Lin Wang8, Cuo Leng7, Quyi Xu3, Hui-Yuan Yeh9, Chuan-Chao Wang10,11,12, Xiaohong Wen1, Chao Liu3,4, Guanglin He9,10,11,12.
Abstract
Hmong-Mien (HM) -speaking populations, widely distributed in South China, the north of Thailand, Laos, and Vietnam, have experienced different settlement environments, dietary habits, and pathogenic exposure. However, their specific biological adaptation remained largely uncharacterized, which is important in the population evolutionary genetics and Trans-Omics for regional Precision Medicine. Besides, the origin and genetic diversity of HM people and their phylogenetic relationship with surrounding modern and ancient populations are also unknown. Here, we reported genome-wide SNPs in 52 representative Miao people and combined them with 144 HM people from 13 geographically representative populations to characterize the full genetic admixture and adaptive landscape of HM speakers. We found that obvious genetic substructures existed in geographically different HM populations; one localized in the HM clines, and others possessed affinity with Han Chinese. We also identified one new ancestral lineage specifically existed in HM people, which spatially distributed from Sichuan and Guizhou in the north to Thailand in the south. The sharing patterns of the newly identified homogenous ancestry component combined the estimated admixture times via the decay of linkage disequilibrium and haplotype sharing in GLOBETROTTER suggested that the modern HM-speaking populations originated from Southwest China and migrated southward in the historic period, which is consistent with the reconstructed phenomena of linguistic and archeological documents. Additionally, we identified specific adaptive signatures associated with several important human nervous system biological functions. Our pilot work emphasized the importance of anthropologically informed sampling and deeply genetic structure reconstruction via whole-genome sequencing in the next step in the deep Chinese Population Genomic Diversity Project (CPGDP), especially in the regions with rich ethnolinguistic diversity.Entities:
Keywords: Chinese Population Genetic Diversity Project (CPGDP); HM people; biological adaptation; genetic admixture model; genome-wide SNPs
Year: 2022 PMID: 35047024 PMCID: PMC8762323 DOI: 10.3389/fgene.2021.815160
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Genetic affinity of HM people in the context of modern and ancient eastern Eurasians. (A–C), Principal component analyses focused on the genetic diversity from East Asian, South Chinese populations, and HM-speaking populations. Included East Asian ancient populations were projected onto the modern genetic background. Populations were color-coded based on geographical and linguistic categories. (D) Model-based ADMIXTURE results with five predefined ancestral sources showed the ancestral cluttering pattern and their individual ancestral proportion.
FIGURE 2Chromosome painting between Hmong–Mien people and other East Asian reference populations. (A,B) Amount of total length of DNA fragments of modern East Asians copied from donor chromosomes related to Sichuan Miao. (C,D) Average DNA chunk of Sichuan Miao copied from other East Asians. Statistical indexes showed the results of inter-population comparisons. Only unrelated individuals are used here.
FIGURE 3Fine-scale population genetic structure based on the shared haplotype data. (A–C) PCA results based on the coancestry matrix showed a genetic relationship among modern East Asians. The color showed the re-classification of the homogenous population label. (D,E) Clustering patterns of individual-level and population-level East Asians based on the pairwise coincidence matrixes.
FIGURE 4Pairwise qpWave analysis showed the genetic heterogeneity and homogeneity among East Asians. p-values of rank1 tests larger than 0.05 showed the genetic homogeneity among two reference populations, which are marked as “++”, and p values of rank1 tests larger than 0.01 are marked as “+.”
FIGURE 5Results of qpAdm models and TreeMix-based phylogenies. (A) Two-way admixture models showed ancestry comparison in different ancestral source pairs. (B,C) TreeMix-based phylogenetic tree with two migration events showed the genetic relationship between East Asians.
FIGURE 6Deep population history reconstruction based on the best-fitted qpGraph models. Different frameworks in the qpGraph-based models adding the late Neolithic Fujian population (SEastAsia_Coastal_LN, (A), Mongolian Plateau Hunter-Gatherer (Boisman, (B), Australian (C), and Mixe (D).
FIGURE 7Allele frequency spectrum of observed maternal and paternal haplogroups of Chongqing Miao and Chongqing Han. Population comparison between Han and Miao based on the frequency distribution of the observed paternal lineages (A) and maternal lineages (B) via the Pearson and Cramer tests.
FIGURE 8Manhattan showed the natural selection signatures and enrichment analysis. (A) p-values of XPEHH in the Miao population using northern Han as the reference population. (B) Overlap among three gene lists based on gene-level and shared-term level, where blue curves link genes that belong to the same enriched ontology term. The inner-circle represents gene lists, where hits are arranged along the arc. Genes that hit multiple lists are colored in dark orange, and genes unique to a list are shown in light orange. (C) Heatmap of top twenty enriched terms across three input gene lists, colored by p-values. (D) Top 20 clusters with their representative enriched terms. (E) Network of enriched terms colored by cluster-ID.