| Literature DB >> 25377941 |
Haihua Bai1, Xiaosen Guo2, Dong Zhang3, Narisu Narisu4, Junjie Bu5, Jirimutu Jirimutu1, Fan Liang6, Xiang Zhao6, Yanping Xing3, Dingzhu Wang1, Tongda Li7, Yanru Zhang3, Baozhu Guan8, Xukui Yang6, Zili Yang3, Shuangshan Shuangshan9, Zhe Su6, Huiguang Wu1, Wenjing Li6, Ming Chen10, Shilin Zhu6, Bayinnamula Bayinnamula1, Yuqi Chang6, Ying Gao1, Tianming Lan6, Suyalatu Suyalatu1, Hui Huang6, Yan Su6, Yujie Chen1, Wenqi Li6, Xu Yang6, Qiang Feng2, Jian Wang11, Huanming Yang12, Jun Wang13, Qizhu Wu1, Ye Yin6, Huanmin Zhou3.
Abstract
Mongolians have played a significant role in modern human evolution, especially after the rise of Genghis Khan (1162[?]-1227). Although the social cultural impacts of Genghis Khan and the Mongolian population have been well documented, explorations of their genome structure and genetic imprints on other human populations have been lacking. We here present the genome of a Mongolian male individual. The genome was de novo assembled using a total of 130.8-fold genomic data produced from massively parallel whole-genome sequencing. We identified high-confidence variation sets, including 3.7 million single nucleotide polymorphisms (SNPs) and 756,234 short insertions and deletions. Functional SNP analysis predicted that the individual has a pathogenic risk for carnitine deficiency. We located the patrilineal inheritance of the Mongolian genome to the lineage D3a through Y haplogroup analysis and inferred that the individual has a common patrilineal ancestor with Tibeto-Burman populations and is likely to be the progeny of the earliest settlers in East Asia. We finally investigated the genetic imprints of Mongolians on other human populations using different approaches. We found varying degrees of gene flows between Mongolians and populations living in Europe, South/Central Asia, and the Indian subcontinent. The analyses demonstrate that the genetic impacts of Mongolians likely resulted from the expansion of the Mongolian Empire in the 13th century. The genome will be of great help in further explorations of modern human evolution and genetic causes of diseases/traits specific to Mongolians.Entities:
Keywords: Mongolian genome; de novo assembly; genetic imprints; genetic variations; patrilineal origin
Mesh:
Substances:
Year: 2014 PMID: 25377941 PMCID: PMC4540083 DOI: 10.1093/gbe/evu242
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FMapping depth, detection strategy, and variant composition (SNP and short indel) in construction of genetic variation map. (A) Depth distribution based on the alignment. (B) The strategy of SNP and short indel identification. (C, D) Composition of SNP and short indel sets of the Mongolian genome compared with variant sets of dbSNP and 1000 genomes project.
Summary of the Mongolian Genome Assembly
| Contig | Scaffold | |||
|---|---|---|---|---|
| Size (bp) | Number | Size (bp) | Number | |
| N90 | 13,773 | 52,725 | 1,461,661 | 431 |
| N80 | 24,225 | 37,548 | 2,880,171 | 292 |
| N70 | 34,160 | 27,774 | 4,350,155 | 210 |
| N60 | 44,637 | 20,547 | 5,856,852 | 154 |
| N50 | 56,244 | 14,915 | 7,632,466 | 111 |
| Longest | 517,634 | — | 35,963,476 | — |
| Total size | 2,823,488, 473 | — | 2,881,945, 563 | — |
| Total number (≥100 bp) | — | 320,927 | — | 221,013 |
| Total number (≥2 kb) | — | 84,214 | — | 3,251 |
FGenetic variation pattern of Mongolian genome. Distribution of genetic variations by chromosomes, including SNP, short indel, and SV. The sizes of window and sliding step are 1 and 0.5 Mb for SNP and short indel, 2 and 1 Mb for SV.
Risks of Mendelian Diseases Based on Functional SNPs
| Disease | Gene | SNP ID | Mutation | Pathogenicity |
|---|---|---|---|---|
| Systemic primary Carnitine deficiency | SLC22A5 | rs60376624 | c.1400C > G (p.Ser467Cys) | Pathogenic |
| Endometrial cancer | MLH3 | rs17102999 | c.2825C > T (p.Thr942Ile) | Likely pathogenic |
| Crohn’s disease | IL23R | rs76418789 | c.445G > A (p.Gly149Arg) | Benign |
| Parkinson disease | PARK7 | rs71653619 | c.293G > A (p.Arg98Gln) | Benign |
| Hypertension | XDH | rs45523133 | c.514G > A (p.Gly172Arg) | Likely benign |
| Colorectal cancer | MSH6 | rs63750252 | c.3488A > T (p.Glu1163Val) | Likely benign |
| Glaucoma 1 | OPTN | rs75654767 | c.1634G > A (p.Arg545Gln) | Likely benign |
| Carnitine palmitoyltransferase 1 deficiency | CPT1A | rs2229738 | c.823G > A (p.Ala275Thr) | Likely benign |
| Hemolytic uremic syndrome | CFH | rs62625015 | c.3226C > G (p.Gln1076Glu) | Likely benign |
| Macular dystrophy | ABCA4 | rs76258939 | c.3626T > C (p.Met1209Thr) | Likely benign |
| Hyperoxaluria | AGXT | rs34664134 | c.590G > A (p.Arg197Gln) | Likely benign |
| Idiopathic generalized epilepsy (IGE) | CLCN2 | rs111656822 | c.2063G > A (p.Arg688Gln) | Likely benign |
| Fuchs endothelial cornea dystrophy (FECD) | COL8A2 | rs75864656 | c.464G > A (p.Arg155Gln) | Likely benign |
| Nephrotic syndrome | NPHS1 | rs114849139 | c.2869G > C (p.Val957Leu) | Uncertain |
| Cardiomyopathy | MYBPC3 | rs193068692 | c.478C > T(p.Arg160Trp) | Uncertain |
| Ectopia lentis | ADAMTSL4 | rs76075180 | c.926G > A (p.Arg309Gln) | Uncertain |
aThe pathogenicity is annotated based on the recommendations of the ACMG.
FHaplogroup of Y chromosome and mitochondria genome. (A) Patrilineal transmission of Mongolian genome based on Y haplogroup. Red markers are validated as derived genotypes in the Mongolian genome and remaining ones (black) are ancestral type. The bracketed markers were not validated in the Mongolian genome. (B) Matrilineal transmission of the Mongolian genome based on mitochondria haplogroup. Red markers were confirmed to be derived in the Mongolian genome. The mutations labeled with an exclamation point are reported in the mitochondria sequence of rCRS version. L* represents all L lineages but L3, including L0, L0, L2, L4, L5, and L6; L3* represents all L3 lineages but M, including L3a to L3f, L3h, L3i, L3k, L3x, and N; M* represents all lineages prefixed “M,” D, and Q, but G; G* represents G1, G3, and G4.
FHistory and population genetic structure. (A) World regions once occupied by Mongolians from the 13th century to 19th century. (B) PCA plots of 1,042 individuals from 50 global ethnic groups. (C) The ancestry proportion plot of 1,042 individuals using the ADMIXTURE with K = 9. The Mongolian genome is marked by a red star.
D Test between Mongolians and Other Modern Humans
| Group 1 | Group 2 | Yoruba | Chimp | |||||
|---|---|---|---|---|---|---|---|---|
| SE (%) | SE (%) | |||||||
| East Asian to other (Part 1) | Han | Maya | −3.4 | 0.34 | −10 | −4.4 | 1.20 | −3.6 |
| French | Han | 13.7 | 0.49 | 28.2 | 13.3 | 1.08 | 12.3 | |
| Daur | Russian | −11.9 | 0.21 | −56.8 | −12.1 | 0.11 | −106.7 | |
| Brahui | Daur | 14.2 | 0.35 | 40.8 | 13.3 | 0.14 | 94.8 | |
| Hezhen | Sindhi | −13.6 | 0.48 | −28.6 | −13.2 | 0.31 | −42.3 | |
| Japanese | North Italian | −14.2 | 0.44 | −32.5 | −13.1 | 0.31 | −42.1 | |
| Han | Palestinian | −16.5 | 0.56 | −29.2 | −16.5 | 0.71 | −23.3 | |
| American to other (Part 2) | Maya | Russian | −9.3 | 0.57 | −16.2 | −7.8 | 0.85 | −9.2 |
| Maya | Papuan | −8.3 | 0.15 | −55.4 | −8.8 | 0.25 | −35.7 | |
| Maya | North Italian | −10.9 | 0.40 | −27.1 | −10.1 | 0.31 | −33.1 | |
| Maya | Palestinian | −13.7 | 0.58 | −23.5 | −13.3 | 0.61 | −22.0 | |
| Bedouin | Maya | 32 | 0.25 | 128.3 | 31.7 | 0.43 | 73.0 | |
| Brahui | Maya | 11.3 | 0.91 | 12.4 | 10.1 | 0.66 | 15.4 | |
| Middle East groups to other (Part 3) | Bedouin | Brahui | 23.1 | 0.82 | 28.2 | 24.2 | 0.27 | 90.3 |
| Druze | French | 1.1 | 0.23 | 4.7 | 1.6 | 0.24 | 6.7 | |
| Palestinian | Russian | 5.1 | 0.28 | 18 | 5.1 | 0.51 | 10.1 | |
| Palestinian | Sindhi | 4.4 | 0.47 | 9.4 | 4.3 | 0.38 | 11.4 | |
| In European (Part 4) | Adygei | Russian | 0.2 | 0.19 | 1.2* | −0.7 | 0.26 | −2.8* |
| French | North Italian | −0.2 | 0.17 | −1.2* | −0.3 | 0.15 | −2.2* | |
| French | Tuscan | 0 | 0.40 | −0.1* | −0.7 | 0.40 | −1.8* | |
| French | Orcadian | −0.4 | 0.24 | −1.5* | 0.7 | 0.63 | 1.2* | |
| Basque | Sardinian | −0.6 | 0.24 | −0.3* | −0.8 | 0.80 | −1.0* | |
| Adygei | French | −1.9 | 0.32 | −5.8 | −2.2 | 0.94 | −2.3 | |
| French | Sardinian | −0.3 | 0.08 | −3.8 | −1.1 | 0.23 | −4.9 | |
| In South/Central Asian (Part 5) | Hazara | Uygur | 0 | 0.26 | 0.2* | 0.1 | 0.64 | 0.1* |
| Kalash | Burusho | 0.2 | 0.47 | 0.5* | 0.6 | 0.78 | 0.8* | |
| Pathan | Burusho | 0.6 | 0.27 | 2.1* | 0.1 | 0.25 | 0.2* | |
| Balochi | Brahui | 0.2 | 0.30 | 0.7* | −0.2 | 0.10 | −1.9* | |
| Balochi | Sindhi | −1.5 | 0.99 | −1.5* | −1.6 | 0.86 | −1.9* | |
| Makrani | Sindhi | 0.5 | 0.51 | 1.0* | 0.3 | 0.24 | 1.4* | |
| Hazara | Burusho | −5.3 | 0.25 | −21.2 | −5.5 | 0.24 | −22.6 | |
| Burusho | Brahui | −2.4 | 0.23 | −10.2 | −2.3 | 0.46 | −5.1 | |
| In Indian (Part 6) | Aonaga | Siddi | 27.9 | 1.15 | 24.2 | 23.1 | 0.26 | 89.1 |
| Kashmiri Pandit | Siddi | 20.1 | 1.37 | 14.7 | 16.7 | 0.23 | 71.6 | |
| Meghawal | Siddi | 19.1 | 0.33 | 58 | 17.4 | 0.78 | 22.3 | |
| Siddi | Vysya | −19.2 | 0.39 | −48.7 | −17.4 | 0.45 | −38.8 | |
| Indian to other (Part 7) | Brahui | Siddi | −5 | 0.87 | −5.7 | −5.8 | 5.08 | −11.5 |
| Balochi | Siddi | −4.2 | 0.34 | −12.3 | −5.3 | 0.37 | −14.3 | |
| French | Siddi | −3.5 | 0.57 | −6.2 | −4.4 | 0.54 | −8.2 | |
| North Italian | Siddi | −4.5 | 0.77 | −5.9 | −4.6 | 0.34 | −13.5 | |
Note.—D, the ABBABABA test statistic value; Z, Z-score. *, Mongolians possess statistically equivalent shared alleles with group1 and group 2 in the analyses of using two out-groups simultaneously. A tested stable block size 5 Mb (supplementary table S19, Supplementary Material online) was selected to calculate the D value and standard error by using the bootstrap method. The threshold values is |Z| = 3.
Genetic Imprints of Mongolian Genome on Global Populations Based on Shared Haplotype Blocks
| Group 1 | Group 2 | |||
|---|---|---|---|---|
| Populations | Shared Blocks | Populations | Shared Blocks | |
| Han-Mongo. | 801 | French-Mongo. | 258 | <2.2E-16 |
| Han-Mongo. | 801 | Mongo.-Russian | 292 | <2.2E-16 |
| Han-Mongo. | 801 | Maya-Mongo. | 284 | <2.2E-16 |
| Han-Mongo. | 801 | Adygei-Mongo. | 273 | <2.2E-16 |
| Han-Mongo. | 801 | Mongo.-Brahui | 240 | <2.2E-16 |
| Han-Mongo. | 801 | Mongo.-Burusho | 162 | <2.2E-16 |
| Han-Mongo. | 801 | Mongo.-Palestinian | 222 | <2.2E-16 |
| Maya-Mongo. | 284 | French-Maya | 218 | 3.2E-3 |
| Maya-Mongo. | 284 | Maya-Russian | 232 | 2.2E-2 |
| Maya-Han | 289 | Maya-French | 289 | 1.6E-3 |
| Maya-Mongo. | 284 | Adygei-Maya | 240 | 5.5E-2 |
| Maya-Mongo. | 284 | Maya-Palestinian | 179 | 1.1E-6 |
| Maya-Mongo. | 284 | Maya-Burusho | 110 | <2.2E-16 |
| Maya-Mongo. | 284 | Maya-Brahui | 179 | 1.1E-6 |
| Brahui-Mongol. | 240 | Brahui-Maya | 179 | 2.9E-3 |
| Burusho-Mongo. | 162 | Burusho-Maya | 110 | 1.6E-3 |
| French-Mongo. | 258 | French-Maya | 218 | 6.7E-2 |
| Mongo.-Russian | 292 | Maya-Russian | 232 | 8.8E-3 |
Note.—Mongo., Mongolian. P value is the statistical significance based on the chi-square test (χ test, P = 0.05).
aThe shared haplotype blocks had been filtered by using the haplotype blocks of Africans.