| Literature DB >> 29091727 |
Fumihiko Takeuchi1, Tomohiro Katsuya2, Ryosuke Kimura3, Toru Nabika4, Minoru Isomura4, Takayoshi Ohkubo5, Yasuharu Tabara6, Ken Yamamoto7, Mitsuhiro Yokota8, Xuanyao Liu9,10, Woei-Yuh Saw9,11, Dolikun Mamatyusupu12, Wenjun Yang13, Shuhua Xu14,15,16, Yik-Ying Teo1,9,10,11,17,18, Norihiro Kato1.
Abstract
The contemporary Japanese populations largely consist of three genetically distinct groups-Hondo, Ryukyu and Ainu. By principal-component analysis, while the three groups can be clearly separated, the Hondo people, comprising 99% of the Japanese, form one almost indistinguishable cluster. To understand fine-scale genetic structure, we applied powerful haplotype-based statistical methods to genome-wide single nucleotide polymorphism data from 1600 Japanese individuals, sampled from eight distinct regions in Japan. We then combined the Japanese data with 26 other Asian populations data to analyze the shared ancestry and genetic differentiation. We found that the Japanese could be separated into nine genetic clusters in our dataset, showing a marked concordance with geography; and that major components of ancestry profile of Japanese were from the Korean and Han Chinese clusters. We also detected and dated admixture in the Japanese. While genetic differentiation between Ryukyu and Hondo was suggested to be caused in part by positive selection, genetic differentiation among the Hondo clusters appeared to result principally from genetic drift. Notably, in Asians, we found the possibility that positive selection accentuated genetic differentiation among distant populations but attenuated genetic differentiation among close populations. These findings are significant for studies of human evolution and medical genetics.Entities:
Mesh:
Year: 2017 PMID: 29091727 PMCID: PMC5665431 DOI: 10.1371/journal.pone.0185487
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Geographic distribution of the Japanese genetic clusters.
Each population is mapped to its geographic location. The individuals in a population are represented by circles colored according to their assigned genetic clusters. The figure map was generated using the packages maps v3.1.0 and mapdata v2.2–6 in the R software.
Fig 2Genetic clustering of an extended set of individuals from the Shimane prefecture.
a) For each municipality, the individuals therein are represented by diamonds colored according to their assigned genetic clusters. b) Re-classification of the individuals in the Shimane 1, 2 clusters of Fig 1 into the finer clusters Shimane A–E. The map of Shimane prefecture (https://commons.wikimedia.org/wiki/File%3AMap_of_Shimane_Prefecture_Ja.svg) is licensed under the Attribution-ShareAlike 3.0 Unported license. The license terms can be found on the following link: https://creativecommons.org/licenses/by-sa/3.0/.
Fig 3Ancestry profile of the Japanese genetic clusters.
For each Japanese cluster, the non-Japanese genetic components are represented as a mixture of continental Asian clusters. The bar graph indicates the proportion of continental Asian clusters contributing to a given Japanese cluster. Continental Asian clusters with ≥1% contribution to at least one Japanese cluster are shown. The map indicates geographic locations of the populations included in continental Asian clusters. See S4 Table for details. The figure map was generated using the packages maps v2.3–9 and mapdata v2.2–3 in the R software.
Admixture events detected in the Japanese clusters.
| Admixed cluster | Surrogate group approximating one admixture source | |||||||
|---|---|---|---|---|---|---|---|---|
| C282 (Korean) | C284 (Han Chinese 1) | C291 (Han Chinese 2) | C280 (Vietnamese) | |||||
| Admixture time | Admixture time | Admixture time | Admixture time | |||||
| 52.0 ± 3.6 | 4.3E-17 | 49.9 ± 3.2 | 7.1E-36 | 53.5 ± 3.4 | 6.1E-25 | 56.9 ± 7.2 | 8.4E-10 | |
| 46.5 ± 3.9 | 2.0E-10 | 51.6 ± 5.4 | 2.9E-14 | 50.6 ± 6.8 | 1.4E-12 | 55.1 ± 10.6 | 3.0E-05 | |
| 50.5 ± 10.0 | 7.3E-04 | 37.6 ± 5.0 | 4.1E-13 | 47.8 ± 7.2 | 2.9E-10 | 42.8 ± 7.4 | 2.1E-03 | |
| 58.9 ± 13.1 | 3.3E-03 | 51.5 ± 8.4 | 4.9E-02 | 58.0 ± 8.8 | 5.6E-10 | 51.5 ± 8.4 | 9.9E-09 | |
| 33.8 ± 8.3 | 1.9E-03 | 46.8 ± 13.3 | 7.2E-03 | 44.7 ± 11.0 | 5.9E-04 | 40.9 ± 8.8 | 4.2E-05 | |
| 38.1 ± 2.2 | 7.2E-49 | 38.2 ± 2.2 | 3.0E-64 | 39.3 ± 1.9 | 1.9E-60 | 43.8 ± 2.3 | 1.0E-69 | |
| 43.1 ± 2.5 | 2.0E-36 | 43.5 ± 2.6 | 1.7E-50 | 41.4 ± 3.1 | 7.1E-39 | 41.5 ± 4.4 | 1.7E-20 | |
| 47.1 ± 9.2 | 4.1E-06 | 54.4 ± 9.4 | 9.5E-08 | 53.1 ± 8.1 | 8.4E-10 | 55.4 ± 13.3 | 3.5E-04 | |
| 43.2 ± 8.2 | 3.3E-05 | 54.0 ± 10.8 | 6.2E-06 | 54.8 ± 10.7 | 3.3E-06 | 56.0 ± 12.3 | 6.8E-05 | |
Admixture time is shown in generations before present (estimate ± SE).
P-values are corrected for multiple testing by continental Asian reference clusters.
Fig 4Ancestry populations for the Japanese and continental Asians modeled using the ADMIXTURE program.
Eight ancestry populations are inferred. In this bar plot, each individual is represented by a vertical bar, which is colored by eight segments that indicate the proportion contributed by the corresponding ancestry populations. The individuals are sorted by 23 continental Asian clusters (see S4 Table for details) and nine Japanese clusters. The bar widths are halved for Japanese individuals, to fit the plot in space. The inset shows color-codes for eight ancestry populations and geographic areas of clusters, to which the individual population mainly contributes.
Fig 5Relation between locus-specific genetic differentiation and positive selection in East Asia.
a) Genetic differentiation is analyzed in four areas, consisting of different sets of populations. b) Neighbor-joining phylogenetic tree of the genetic clusters. c) Genetic similarity among the individuals is plotted by the first two principal components of EIGENSOFT PCA. d) For each area, 4-SNP windows of the genome are sorted into 20 equal-sized bins arranged according to locus-specific HFST in the corresponding populations. The proportion of positively selected windows (in vertical axis) is plotted across 20 HFST bins (in horizontal axis); the average proportion (1.3%) is indicated by a gray line. Asterisk (*) indicates significant enrichment of positively selected windows in a specific bin, compared to the whole-genome average (Fisher’s exact test P < 0.01/20). The highest (or rightmost) HFST bin in A1 and second to lowest (leftmost) HFST bins in A2 and A3 are significantly enriched. Triangles above the plot indicate the HFST bins, to which genes known to be under selection belong [29–41]. The figure map was generated using the packages maps v3.1.0 and mapdata v2.2–6 in the R software.
Fig 6Relation between locus-specific genetic differentiation and positive selection in Southeast Asia and South Asia.
a and c) Neighbor-joining phylogenetic tree of the genetic clusters. b and d) The proportion of positively selected windows (in vertical axis) is plotted across 20 HFST bins of 4-SNP windows (in horizontal axis); the average proportion is indicated by a gray line. Asterisk (*) indicates significant enrichment of positively selected windows in a specific bin, compared to the whole-genome average (Fisher’s exact test P < 0.01/20). Triangles above the plot indicate the HFST bins, to which genes known to be under selection belong [29,34–36,39].