| Literature DB >> 29636655 |
Yuchen Wang1,2, Dongsheng Lu1,2, Yeun-Jun Chung3, Shuhua Xu1,2,4,5,6.
Abstract
BACKGROUND: Han Chinese, Japanese and Korean, the three major ethnic groups of East Asia, share many similarities in appearance, language and culture etc., but their genetic relationships, divergence times and subsequent genetic exchanges have not been well studied.Entities:
Keywords: Admixture; Genetic ancestry; Han Chinese; Japanese; Korean; Population divergence; Population structure; SNP
Mesh:
Year: 2018 PMID: 29636655 PMCID: PMC5889524 DOI: 10.1186/s41065-018-0057-5
Source DB: PubMed Journal: Hereditas ISSN: 0018-0661 Impact factor: 3.271
Fig. 2Population level phylogenetic Tree and Principal component analysis (PCA). (A) The maximum likelihood tree was constructed based on pair-wise FST matrix. And the marked number are bootstrap value; (B) The top two PCs of individuals representing six East Asian populations, mapped to their corresponding geographic locations (generated by R 2.15.2 and Microsoft Excel 2010)
Fig. 1Asian samples location and population relatedness. (a) Geographic location of the sampled populations (generated by R 2.15.2 and Microsoft PowerPoint 2010); (b) Principal component analysis (PCA) of all the 10 Asian sample populations
Fig. 3Effective population size varied along time. Effective population size (Ne) was calculated from LD observations in 2500 distance classes (from 0.001 cM to 2.5 cM with the window size set as 0.001 cM) for each population. Results from 2000 generations ago (GA) to 200 GA are shown (generated by Microsoft Office 365)
Pairwise divergence time (TF)
| TF(GA) | BMON | CDX | CEU | CHB | CHS | JPRK | JPT | KHV | KOR | QHM | TIB | YRI |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| BMON | 0 | 603 | 1399 | 276 | 315 | 321 | 265 | 478 | 238 | 88 | 284 | 3796 |
| CDX | 603 | 0 | 2256 | 218 | 118 | 482 | 374 | 60 | 320 | 1527 | 509 | 4518 |
| CEU | 1399 | 2256 | 0 | 1839 | 1690 | 1369 | 1676 | 1985 | 1782 | 1896 | 1589 | 3290 |
| CHB | 276 | 218 | 1839 | 0 | 24 | 238 | 122 | 149 | 47 | 1126 | 199 | 3948 |
| CHS | 315 | 118 | 1690 | 24 | 0 | 238 | 144 | 73 | 81 | 1059 | 236 | 3682 |
| JPRK | 321 | 482 | 1369 | 238 | 238 | 0 | 79 | 379 | 179 | 894 | 329 | 3156 |
| JPT | 265 | 374 | 1676 | 122 | 144 | 79 | 0 | 285 | 54 | 1030 | 279 | 3685 |
| KHV | 478 | 60 | 1985 | 149 | 73 | 379 | 285 | 0 | 235 | 1347 | 409 | 4172 |
| KOR | 238 | 320 | 1782 | 47 | 81 | 179 | 54 | 235 | 0 | 1050 | 225 | 3826 |
| QHM | 88 | 1527 | 1896 | 1126 | 1059 | 894 | 1030 | 1347 | 1050 | 0 | 998 | 3988 |
| TIB | 284 | 509 | 1589 | 199 | 236 | 329 | 279 | 409 | 225 | 998 | 0 | 3592 |
| YRI | 3796 | 4518 | 3290 | 3948 | 3682 | 3156 | 3685 | 4172 | 3826 | 3988 | 3592 | 0 |
Note: Pairwise divergence time (TF), in units of generations ago (GA), estimated by 2NeFST
Results of F4 test
| popO | popA | popB | popC | F4(A,O;C,B) | F4(B,O;C,A) |
|---|---|---|---|---|---|
| YRI | TIB | CDX | CHB | 0.01255 | 0.01977 |
| YRI | TIB | CDX | CHS | 0.01019 | 0.02478 |
| YRI | TIB | CDX | KOR | 0.01333 | 0.01780 |
| CEU | TIB | CDX | CHB | 0.01063 | 0.02007 |
| CEU | TIB | CDX | CHS | 0.00957 | 0.02605 |
| CEU | TIB | CDX | KOR | 0.01238 | 0.01917 |
| YRI | CDX | JPRK | CHB | 0.01317 | 0.00911 |
| YRI | CDX | JPRK | CHS | 0.01818 | 0.00843 |
| YRI | CDX | JPRK | JPT | 0.00723 | 0.02230 |
| YRI | CDX | JPRK | KOR | 0.01119 | 0.01547 |
| CEU | CDX | JPRK | CHB | 0.01239 | 0.00734 |
| CEU | CDX | JPRK | CHS | 0.01838 | 0.00788 |
| CEU | CDX | JPRK | JPT | 0.00728 | 0.02052 |
| CEU | CDX | JPRK | KOR | 0.01150 | 0.01442 |
| YRI | TIB | JPRK | JPT | 0.00766 | 0.02798 |
| YRI | TIB | JPRK | KOR | 0.01241 | 0.02116 |
| CEU | TIB | JPRK | JPT | 0.00769 | 0.02849 |
| CEU | TIB | JPRK | KOR | 0.01267 | 0.02239 |
Note: population A (popA) and population B (popB) are two donate populations, population C (popC) is a target population who received gene flow, population O (popO) is an out group. Additional file 14: Figure S10C shows the model of F4 test. The value of F4(A,O;C,B) stands for the size of the common part of the two pathway A to O and C to B (labeled as l) on the tree weighted by the gene flow proportion α (labeled as αl), and similarly F4(B,O;C,A) equals (1-α)m. And the calculation of F4 value noted in METHODS
Fig. 4Admixture analysis of East Asian populations together with world-wide non-Asian populations. Result of K = 5 was shown and each vertical bar represents genome makeup of an sample individual (calculated by ADMIXTURE 1.23 [1] and R 2.15.2)
Results of D test
| popW | popA | popY | popZ | D value | Z score |
|---|---|---|---|---|---|
| CHB | YRI | JPT | KOR | −0.0055 | −5.061 |
| CHS | YRI | JPT | KOR | −0.0054 | −4.865 |
| JPT | YRI | CHB | CHS | 0.0014 | 1.515 |
| KOR | YRI | CHB | CHS | 0.0016 | 1.649 |
| CHB | CEU | JPT | KOR | −0.0062 | −6.748 |
| CHS | CEU | JPT | KOR | −0.0060 | −6.596 |
| JPT | CEU | CHB | CHS | −0.0005 | −0.595 |
| KOR | CEU | CHB | CHS | −0.0004 | −0.446 |
Note: Population A (popA) is outgroup, population W (popW) is a reference population, population Y (popY) and population Z (popZ) are two potential target populations. If D value is significantly positive, it suggests that the gene flow between population W and population Y was greater than that between population W and population Z. On the contrary, if D value is significantly negative, it means the gene flow between population W and population Y was smaller than that between population W and population Z. If D value is not significantly positive or negative, it suggests that the gene flow between population W and population Y are equal to that between population W and population Z, or there is no gene flow between population W and population Y or between population W and population Z