| Literature DB >> 31827276 |
Enrico Macholdt1, Leonardo Arias1, Nguyen Thuy Duong2, Nguyen Dang Ton2, Nguyen Van Phong2, Roland Schröder1, Brigitte Pakendorf3, Nong Van Hai4, Mark Stoneking5.
Abstract
Vietnam exhibits great cultural and linguistic diversity, yet the genetic history of Vietnamese populations remains poorly understood. Previous studies focused mostly on the majority Kinh group, and thus the genetic diversity of the many other groups has not yet been investigated. Here we analyze complete mtDNA genome sequences and ~2.3 Mb sequences of the male-specific portion of the Y chromosome from the Kinh and 16 minority populations, encompassing all five language families present in Vietnam. We find highly variable levels of diversity within and between groups that do not correlate with either geography or language family. In particular, the Mang and Sila have undergone recent, independent bottlenecks, while the majority group, Kinh, exhibits low levels of differentiation with other groups. The two Austronesian-speaking groups, Giarai and Ede, show a potential impact of matrilocality on their patterns of variation. Overall, we find that isolation, coupled with limited contact involving some groups, has been the major factor influencing the genetic structure of Vietnamese populations, and that there is substantial genetic diversity that is not represented by the Kinh.Entities:
Mesh:
Year: 2019 PMID: 31827276 PMCID: PMC7171127 DOI: 10.1038/s41431-019-0557-4
Source DB: PubMed Journal: Eur J Hum Genet ISSN: 1018-4813 Impact factor: 4.246
Fig. 1Map of sampling locations. Dots show average sampling locations per population.
Population labels are color coded by language family with Austro-Asiatic in purple, Tai-Kadai in red, Austronesian in orange, Hmong-Mien in yellow, and Sino-Tibetan in turquoise.
Fig. 2Diversity statistics shown as the percent difference from the mean.
a The haplotype diversity (H) and b the nucleotide diversity (π). Crosses and dots denote the MSY and mtDNA values, respectively. Population labels are color coded by language family with Austro-Asiatic in purple, Tai-Kadai in red, Austronesian in orange, Hmong-Mien in yellow, and Sino-Tibetan in turquoise. The gray line shows the mean across populations.
Fig. 3Frequency of shared haplotypes between populations.
mtDNA (upper triangle) and MSY (lower triangle) shared haplotype frequencies are represented by the blue and red color scale, respectively. White squares indicate no sharing. Population labels are color coded by language family with Austro-Asiatic in purple, Tai-Kadai in red, Austronesian in orange, Hmong-Mien in yellow, and Sino-Tibetan in turquoise.
Fig. 4MDS plots based on ΦST distances.
a mtDNA and b MSY. Stress values are in percent. Population labels are color coded by language family with Austro-Asiatic in purple, Tai-Kadai in red, Austronesian in orange, Hmong-Mien in yellow, and Sino-Tibetan in turquoise.
AMOVA results.
| Grouping | Number of groups | Marker | Percent variation explained | ||
|---|---|---|---|---|---|
| Among groups | Within groups among populations | Within populations | |||
| – | mtDNA | – | 9.95*** | 90.05*** | |
| Language family | 5 | mtDNA | 1.2 | 8.9*** | 89.9*** |
| District | 7 | mtDNA | 2.5* | 7.8*** | 89.7*** |
| Region | 4 | mtDNA | 0.02 | 9.9*** | 90.1*** |
| – | MSY | – | 22.8*** | 77.2*** | |
| Language family | 5 | MSY | 4.4* | 18.8*** | 76.8*** |
| District | 7 | MSY | 0.5 | 22.3*** | 77.2*** |
| Region | 4 | MSY | −1.0 | 23.5*** | 77.5*** |
Populations included in each group for each classification are indicated below the table. Language family: (Kinh, Mang) (Thai, Tay, Nung, Lachi, Colao) (Ede, Giarai) (Hmong, Dao, Pathen) (Sila, Lolo, Phula, Lahu, Hanhi). Region: (Mang, Gelao, Lachi, Lolo, Nung, Pathen, Dao) (Tay, Thai, Hanhi, Hmong, Lahu, Phula, Sila) (Kinh) (Giarai, Ede). Districts: (Lolo) (Giarai, Ede) (Colao, Dao, Lachi, Nung, Tay) (Kinh) (Hanhi, Lahu, Mang, Sila) (Hmong, Thai) (Pathen, Phula)
*p value < 0.05; ***p value < 0.001
Correlation coefficients obtained in the Mantel correlation tests.
| All populations | Excluding Kinh | Excluding Ede and Giarai | |
|---|---|---|---|
| mtDNA—geography | 0.26* | 0.24* | −0.22 |
| MSY—geography | −0.07 | −0.10 | −0.24 |
| mtDNA—MSY | 0.17* | 0.11 | 0.21* |
*p value < 0.05