Literature DB >> 21463511

High frequencies of Y-chromosome haplogroup O2b-SRY465 lineages in Korea: a genetic perspective on the peopling of Korea.

Won Kim1, Wook Kim2, Soon-Hee Kim1,3, Ki-Cheol Kim2, Dong-Jik Shin4, Han-Jun Jin5, Kyoung-Don Kwak6, Myun-Soo Han6, Joon-Myong Song7.   

Abstract

BACKGROUND: Koreans are generally considered a Northeast Asian group, thought to be related to Altaic-language-speaking populations. However, recent findings have indicated that the peopling of Korea might have been more complex, involving dual origins from both southern and northern parts of East Asia. To understand the male lineage history of Korea, more data from informative genetic markers from Korea and its surrounding regions are necessary. In this study, 25 Y-chromosome single nucleotide polymorphism markers and 17 Y-chromosome short tandem repeat (Y-STR) loci were genotyped in 1,108 males from several populations in East Asia.
RESULTS: In general, we found East Asian populations to be characterized by male haplogroup homogeneity, showing major Y-chromosomal expansions of haplogroup O-M175 lineages. Interestingly, a high frequency (31.4%) of haplogroup O2b-SRY465 (and its sublineage) is characteristic of male Koreans, whereas the haplogroup distribution elsewhere in East Asian populations is patchy. The ages of the haplogroup O2b-SRY465 lineages (~9,900 years) and the pattern of variation within the lineages suggested an ancient origin in a nearby part of northeastern Asia, followed by an expansion in the vicinity of the Korean Peninsula. In addition, the coalescence time (~4,400 years) for the age of haplogroup O2b1-47z, and its Y-STR diversity, suggest that this lineage probably originated in Korea. Further studies with sufficiently large sample sizes to cover the vast East Asian region and using genomewide genotyping should provide further insights.
CONCLUSIONS: These findings are consistent with linguistic, archaeological and historical evidence, which suggest that the direct ancestors of Koreans were proto-Koreans who inhabited the northeastern region of China and the Korean Peninsula during the Neolithic (8,000-1,000 BC) and Bronze (1,500-400 BC) Ages.

Entities:  

Year:  2011        PMID: 21463511      PMCID: PMC3087676          DOI: 10.1186/2041-2223-2-10

Source DB:  PubMed          Journal:  Investig Genet        ISSN: 2041-2223


Background

The Koreans are geographically a Northeast Asian group, who are thought to be most closely related to Altaic language-speaking populations. Anthropological and archaeological evidence suggests that the early Korean population was related to Mongolian ethnic groups, who inhabited the general area of the Altai Mountains and the Lake Baikal regions of southeastern Siberia [1]. Based on archaeological data, the earliest modern human lithic cultures date from 25,000 to 45,000 years ago in the Altai Mountains and southeastern Siberia and the Korean Peninsula [2,3]. According to Korea's founding myths, the Gojoseon (the first state-level society) was established around 2,333 BC in the region of southern Manchuria, but later stretched from the northeastern region of China to the northern part of the Korean Peninsula. Thus, the ancient Koreans (proto-Koreans) may have shared a common origin with the Northeast Asian groups who inhabited the general area of the southeastern Siberia and Manchuria during the Neolithic (8,000-1,000 BC) and Bronze (1,500-400 BC) Ages [1,4]. However, like many debates on the genetic history of human populations, the origin of the Korean population remains controversial. Studies of classic genetic markers have shown that, among the East Asians, Koreans tend to have the closest genetic affinity with Mongolians [5,6]. These findings support the first model, that a northeastern Asian origin is most likely, followed by a southeastward migration into the Korean Peninsula. The second model suggests a bi- and/or multidirectional route, with one migration through northeastern Asia and one through southeastern Asia. Recent surveys of genetic variation using two haploid markers (mitochondrial DNA and the Y chromosome) showed that the Korean population contains lineages typical of both Southeast and Northeast Asian populations [7-9]. These results led us to consider that the peopling of Korea might have involved multiple events, and that different aspects might be revealed by appropriate additional genetic markers and DNA samples. Based on recent studies of Y-chromosome variation, the East Asian gene pool is almost completely contained within three major Y-chromosome lineages (haplogroups C, D and K) [7,10]. East Asian populations have major expansions of haplogroup O-M175 lineages (a lineage within K), although there are significant genetic differences in other lineages between the populations [8,11]. Most Y chromosomes found in the Korean population also belong to haplogroup O-M175 and its sublineages [8,9]. The Korean population has a high frequency of the haplogroup O3-M122 lineage, which is shared mainly with Chinese and Southeast Asian populations. By contrast, the haplogroup O2b-SRY465 lineages (and its sublineage) are found with high frequency and diversity specifically among modern populations of Japan and Korea [8,12,13]. These chromosomes are absent in most populations in China, but they have been detected in some samples of Beijing-Han Chinese, Manchurians, Mongolians and Southeast Siberians [7,8,14]. Hammer and Horai [12] hypothesized that the haplogroup O2b-SRY465 and its sublineage, O2b1-47z, might be Yayoi male lineages, which contributed to the contemporary mainland Japanese population via a process of demic diffusion during the Yayoi period from the Korean Peninsula, around 2,300 years ago. Hammer et al. [13] also suggested that the haplogroup O2b1-47z mutation arose on an ancestral O2b-SRY465 chromosome during early phases of the Yayoi migration. This lineage is also distributed sporadically in the Mongol, Manchu and Southeast Siberian populations, and in Indonesia, the Philippines, Vietnam and Micronesia [8,13,14]. Lin et al. [15] suggested that the O2b1-47z Y chromosome associated with the Y2 allele might have originated from an ancestral population in Henan or the southern parts of Shanxi near the Yellow River in China. Thus, to understand the male lineage history of Korea, more data from such informative genetic markers from Korea and its surrounding regions are necessary. In the present study, 25 Y-chromosome single-nucleotide polymorphism (Y-SNP) markers and 17 Y-chromosome short tandem repeat (Y-STR) loci (Yfiler) were genotyped in 1,108 men from East Asian, several populations, to not only identify haplogroup O2b lineages and trace their migration history, but also to distinguish populations with different genetic backgrounds.

Methods

This study was approved by the Ethics Committee and institutional review boards of Institute of Bio-Science and Technology in the Dankook University. Separate written informed consent was obtained for enrollment from all participants.

Subjects

In this study, we analyzed 1,108 men, representatives of several East Asian populations (Korean, Japanese, Mongolian, Chinese, Indonesian, Filipino, Thai, Vietnamese; Table 1). The DNA samples included subsets of the samples examined by Kim et al. [16] and Jin et al. [8], although the exact number of subjects for each population occasionally varied between the studies. In addition, we included new Korean samples collected from 506 people residing in six major provinces in Korea [17]. DNA was prepared from whole blood taken from each participant by a standard method [18], or extracted from buccal cells as previously described [19].
Table 1

Sample sizes and references for East Asian populations studied (n = 1,108)

Geographic affiliationPopulation (n)Ethnic/regional group (n)ReferencesYHRD accession number
Northeast AsiaKorean (506)
Seoul/Gyeonggi (110)Kim et al. (2010) [17]YA003585
Chungcheong (72)Kim et al. (2010) [17]YA003588
Gangwon (63)Kim et al. (2010) [17]YA003587
Gyeongsang (84)Kim et al. (2010) [17]YA003586
Jeolla (90)Kim et al. (2010) [17]YA003584
Jeju (87)Kim et al. (2010) [17]YA003583
Japanese (157)
Ibaraki (50)Kim et al. (2000) [16]YA003199
Osaka/Tokyo (6)Kim et al. (2000) [16]YA003199
Tokushima (57)Kim et al. (2000) [16]YA003199
Yamaguchi (44)Kim et al. (2000) [16]YA003199
Mongolian (81)
Buryat (36)Kim et al. (2000) [16]YA003198
Khalkh (45)Jin et al. (2003) [8]YA003670
Chinese (175)
Beijing-Han (51)Kim et al. (2000) [16]YA003197
Manchurian (30)Jin et al. (2003) [8]YA003195
Xian (34)Kim et al. (2000) [16]YA003671
Southeast AsiaYunnan-Han (60)Jin et al. (2003) [8]YA003196
Indonesian (37)
Java (37)Kim et al. (2000) [16]YA003200
Filipino (64)
Cebuno speak (33)Kim et al. (2000) [16]YA003202
Davao/Mondoro (2)Kim et al. (2000) [16]YA003202
Tagalog speak (29)Kim et al. (2000) [16]YA003202
Thai (40)
Chiang Mai (32)Kim et al. (2000) [16]YA003203
Bangkok (6)Kim et al. (2000) [16]YA003203
Surin (2)Kim et al. (2000) [16]YA003203
Vietnamese (48)
Hanoi (48)Kim et al. (2000) [16]YA003201
Sample sizes and references for East Asian populations studied (n = 1,108)

Y-SNP genotyping

Initially all the samples were analyzed for 12 Y-SNP markers (M9, M45, M89, M119, M122, M174, M175, M214, P31, SRY465, 47z and RPS4Y) using a previously described protocol [17]. The samples belonging to haplogroups C, D, K, NO, O3 and P were subjected to further typing with an additional 13 Y-SNP markers to designate the subclades: two three-plex, three two-plex and one single-plex SNaPshot assay were developed for these 13 Y-SNP markers (Figure 1). The nomenclature of the haplogroups followed that of the Y Chromosome Consortium [20]. Primers for PCR and single-base extension (SBE) reactions were designed (Primer 3.0 program; http://frodo.wi.mit.edu/primer3/, Cambridge, MA, USA) (see Additional file 1, Table S1; see Additional file 2, Table S2). Conditions of the SNaPshot assays were the same as those previously described [17], with the exception of PCR purification; in our assay, the PCR products were purified by adding 2 μl of an exonuclease I-shrimp alkaline phosphatase preparation (Exo-SAP; USB Corp., Cleveland, OH, USA) to 5 μl PCR product.
Figure 1

Phylogenetic tree defined with the 25 Y-chromosome single-nucleotide polymorphisms (Y-SNPs). Gray squares represent an additional 13 haplogroups in this study. aThe O2b1 is associated with the Y2 allele [15].

Phylogenetic tree defined with the 25 Y-chromosome single-nucleotide polymorphisms (Y-SNPs). Gray squares represent an additional 13 haplogroups in this study. aThe O2b1 is associated with the Y2 allele [15].

Y-STR genotyping

Y-STRs (DYS456, DYS389I, DYS390, DYS389II, DYS458, DYS19, DYS385a/b, DYS393, DYS391, DYS439, DYS635, DYS392, YGATA H4, DYS437, DYS438 and DYS448) were amplified (AmpFlSTR Yfiler kit; Applied Biosystems, Foster City, CA, USA) and analyzed by capillary electrophoresis using a genetic analyzer (ABI Prism 3730; Applied Biosystems) and accompanying software (GeneMapper ID v3.2; Applied Biosystems). Alleles were named according to the recommendations of the International Society of Forensic Genetics [21]. The 25 Y-SNP markers and 17 Y-STR loci of the 1,108 men in this study have been submitted to the Y Chromosome Haplotype Reference Database (Table 1).

Data analysis

The genetic differentiation between different population samples and its statistical significance were assessed via FST (Y-SNP) values. The population genetic structure of the ethnic and/or regional groups was assessed by analysis of molecular variance (AMOVA) using the RST (Y-STR) values [22]. The calculations of diversity indices, FST and AMOVA were performed using Arlequin software (v3.0; http://cmpg.unibe.ch/software/arlequin3/) [23]. Population pairwise FST values were visualized by multidimensional scaling (MDS) plot analyses (PASW Statistics 17; SPSS Inc., Cary, NC, USA). Haplotype diversity and mean number of pairwise differences of 15 Y-STR markers (excluding DYS385a/b in Yfiler) linked with the haplogroups O2b*, O2b1, O3 and C were also calculated (Arlequin 3.0 package) [23]. In addition, the genetic diversity of average variance ratio for the Y-STR markers was estimated, as described previously [24]. Median-joining (MJ) network [25] of the haplogroups O2b*, O2b1, O3 and C was constructed from the data of nine Y-STRs (NETWORK 4.5 software; http://www.fluxus-technology.com). The MJ network was constructed by weighting 99 (47z), 10 (DYS438, DYS392), nine (DYS393), eight (DYS437, DYS390), five (DYS391, DYS19), two (DYS389I) and one (DYS439) [26]. Coalescence times for subsets of chromosomes defined by the SNP markers were estimated (BATWING; http://www.mas.ncl.ac.uk/~nijw/) [27], assuming a generation time of 30 years. Locus-specific STR mutation rates (father-son mutation rates) based on previously described measurements, and various demographic models, were tested in different runs of the program [28].

Results and discussion

This survey of East Asian Y-SNPs identified 18 haplogroups, 15 of which were present in our Korean group (Table 2; see Additional file 3, Table S3). In general, we found the East Asian population groups to be characterized by male haplogroup homogeneity, showing mostly expansions of haplogroup O-M175 lineages (Figure 2). Haplogroups O3a3-P201, O2b*-SRY465, O3a*-M324, C3-M217 and O2b1-47z (a sublineage within O2b) were the most frequent found in Koreans, and accounted for 28.9, 22.5, 15.0, 12.3 and 8.9% of Korean Y chromosomes, respectively.
Table 2

Distribution of Y chromosome haplogroup frequencies in the East Asian populations studied

Population (n)Samples of haplogroups in East Asian populations, n (%)

CDF*K*LTNO*NO*O1O2O3P*QR


C*C1C2C3D*D1D2O1aO2*O2b*O2b1O3*O3a*O3a3O3a4
Northeast Asia
Korean (506)1 (0.20)1 (0.20)062 (12.25)008 (1.58)003 (0.59)0023 (4.55)011 (2.17)5 (0.99)114 (22.53)45 (8.89)2 (0.40)76 (15.02)146 (28.85)007 (1.38)2 (0.40)
Japanese (157)08 (5.00)011 (6.88)0046 (29.30)000004 (2.50)03 (1.88)1 (0.63)8 (5.10)38 (23.75)011 (6.88)27 (16.88)0000
Mongolian (81)
 Buryat (36)00016 (44.44)01 (2.78)01 (2.78)00009 (25.00)0001 (2.78)001 (2.78)6 (16.67)0001 (2.78)
 Khalkh (45)00021 (46.67)01 (2.22)0000006 (13.33)0000003 (6.67)11 (24.44)001 (2.22)2 (4.44)
N.China (115)
 Beijing-Han (51)0003 (5.88)01 (1.96)1 (1.96)000001 (1.96)08 (15.69)1 (1.96)1 (1.96)03 (5.88)11 (21.57)21 (41.18)0000
 Manchurian (30)0008 (26.67)000000000001 (3.33)1 (3.33)003 (10.00)16 (53.33)0001 (3.33)
 Xian (34)0008 (23.53)03 (8.82)0000002 (5.88)03 (8.82)2 (5.88)1 (2.94)00013 (38.24)001 (2.94)1 (2.94)
Southeast Asia
South China (60)
 Yunnan-Han (60)1 (1.67)005 (8.33)01 (1.67)07 (11.67)00004 (6.67)05 (8.33)7 (11.67)001 (1.67)3 (5.00)22 (36.67)003 (5.00)1 (1.67)
Indonesian (37)1 (2.70)001 (2.70)0001 (2.70)0000005 (13.51)12 (32.43)3 (8.11)002 (5.41)9 (24.32)002 (5.41)1 (2.70)
Filipino (64)1 (1.56)001 (1.56)0005 (7.81)3 (4.69)0000024 (37.50)1 (1.56)002 (3.13)2 (3.13)21 (32.81)002 (3.13)2 (3.13)
Thai (40)0001 (2.50)01 (2.50)01 (2.50)0000001 (2.50)28 (70.00)01 (2.50)004 (10.00)001 (2.50)2 (5.00)
Vietnamese (48)0006 (12.50)01 (2.08)001 (2.08)0001 (2.08)02 (4.17)5 (10.42)2 (4.17)2 (4.17)1 (2.08)14 (29.17)13 (27.08)0000

Total (1,108)4 (0.36)9 (0.81)0143 (12.87)09 (0.81)55 (4.96)15 (1.35)4 (0.36)3 (0.27)0050 (4.50)062 (5.58)63 (5.67)131 (11.82)86 (7.74)9 (0.81)126 (11.34)309 (27.81)0017 (1.53)13 (1.17)
Figure 2

Distribution of Y haplogroup O lineages in East Asia. Circled areas are proportional to haplogroup frequency, and haplogroups are represented by different colors.

Distribution of Y chromosome haplogroup frequencies in the East Asian populations studied Distribution of Y haplogroup O lineages in East Asia. Circled areas are proportional to haplogroup frequency, and haplogroups are represented by different colors. O3-M122 was the commonest Chinese Y-chromosome haplogroup found, and its presence in Korea may originate from demic diffusion by way of south-to-north migration [9,29,30]. C-RPS4Y was the commonest Mongolian Y-chromosome haplogroup (C3 lineage), and this is shared primarily with populations in northern Asia (including Koreans) [9,31,32]. Our network and diversity analyses of C-RPS4Y and O3 may support a southern origin that migrated into East Asia via the southern route [10,29] (Figure 3, Table 3). Interestingly, our Japanese group seemed to have both C1-M105 and C3-M217 chromosomes, whereas haplogroup C1-M105 was not present in most East Asian populations (except for Koreans), consistent with the previous report of Hammer et al. [13]. Haplogroup D2-M55 was found at high frequency only in Japanese subjects (29.3%), including 1.6% of Koreans, whereas it was absent elsewhere in these East Asian populations, except for the Beijing-Han group. Haplogroup D1-M15 was present at extremely low frequencies in the other East Asian populations. The occurrence of C1 and D2 chromosomes in Korea may be considered concordant with the historical views about the recent connection between the Korean and Japanese populations.
Figure 3

Median-joining network for East Asian Y haplogroup lineages based on the minimum nine Y-chromosome short tandem repeat (Y-STR) haplotypes. (A) Network of haplogroup O2b*; (B) network of haplogroup O2b1; (C) network of haplogroup C; (D) network of haplogroup O3. Circled areas are proportional to haplotype frequency; lines represent the mutational differences between haplotypes.

Table 3

15 Y-STRs diversity of haplogroups C and O3 in East Asian populations

PopulationDiversity parameters

Haplogroup CHaplogroup O3


nHaplotype diversityMean number of pairwise differencesAllele size variation rationHaplotype diversityMean number of pairwise differencesAllele size variation ratio



NorthEast Asia
 Korean630.99 ± 0.005.89 ± 2.850.452220.99 ± 0.008.48 ± 3.940.74
 Japanese151.00 ± 0.027.17 ± 3.560.86380.99 ± 0.018.53 ± 4.030.69
 Mongolian
  Buryat140.97 ± 0.045.69 ± 2.900.3571.00 ± 0.088.52 ± 4.490.67
  Khalkh180.92 ± 0.054.81 ± 2.460.28141.00 ± 0.038.98 ± 4.400.80
 North China
  Beijing-Han31.00 ± 0.278.00 ± 5.130.60341.00 ± 0.018.89 ± 4.200.79
  Manchurian70.52 ± 0.212.57 ± 1.560.16190.95 ± 0.045.99 ± 2.990.49
  Xian71.00 ± 0.086.86 ± 3.680.47131.00 ± 0.038.10 ± 4.020.64
SouthEast Asia
 South China
  Yunnan-Han60.93 ± 0.126.47 ± 3.570.58261.00 ± 0.018.53 ± 4.070.72
 Indonesian21.00 ± 0.5010.00 ± 7.420.84111.00 ± 0.048.33 ± 4.180.80
 Filipino21.00 ± 0.5011.00 ± 8.121.42260.99 ± 0.017.49 ± 3.620.66
 Thai11.00 ± 0.000.00 ± 0.00-41.00 ± 0.188.83 ± 5.170.87
 Vietnamese61.00 ± 0.106.60 ± 3.630.58270.98 ± 0.027.98 ± 3.830.64
Median-joining network for East Asian Y haplogroup lineages based on the minimum nine Y-chromosome short tandem repeat (Y-STR) haplotypes. (A) Network of haplogroup O2b*; (B) network of haplogroup O2b1; (C) network of haplogroup C; (D) network of haplogroup O3. Circled areas are proportional to haplotype frequency; lines represent the mutational differences between haplotypes. 15 Y-STRs diversity of haplogroups C and O3 in East Asian populations Haplogroup N-M231 was present in the Korean and Mongolian groups at moderate frequencies, suggesting that the early Korean population may have shared a common origin with Mongolian ethnic groups who inhabited the general area of the Altai Mountains and Lake Baikal regions of southeastern Siberia [1]. The K-M9 defined chromosomes (L, Q and R subtypes) were also found at low frequencies in the Korean group, and these mainly occur in central and south Asia [13,33]. The presence of these haplogroups in the Korean population implies that the peopling of Korea probably involved multiple events [8,9]. Based on the result of the MDS plot (Figure 4 and Additional file 4, Table S4), the Korean population contains lineages from both the southern and northern areas of East Asia.
Figure 4

Multidimensional scaling (MDS) plot based on distances of 12 populations in East Asia (see Additional file 4, Table S4). Square = Koreans, open triangles = Northeast Asians, closed triangles = Southeast Asians.

Multidimensional scaling (MDS) plot based on distances of 12 populations in East Asia (see Additional file 4, Table S4). Square = Koreans, open triangles = Northeast Asians, closed triangles = Southeast Asians. The geographical distribution of the O2b*-SRY465 (and its sublineage; together designated O2b-SRY465) in East Asia is shown in Figure 2. Interestingly, we found the high frequency of haplogroup O2b*-SRY465 to be characteristic of Koreans, (22.5%), but its distribution elsewhere in the East Asian populations was very patchy (Table 2, Figure 2). The cluster pattern in the O2b*-SRY465 network (Figure 3) was indicative of a single origin, although people with haplogroup O2b*-SRY465 were found to be distributed widely across both northeastern and southeastern Asia. The genetic differences between the Korean and other East Asian populations were analyzed by AMOVA (Table 4). When samples were grouped into Southeast Asians (SEA) and Northeast Asians (NEA), AMOVA could not distinguish between them. In addition, Koreans were again not distinct from either SEA or NEA, indicating that a southern versus a northern origin could not be distinguished from the Y-STR based comparison (Table 4).
Table 4

Results from analysis of molecular variance for 15 Y-chromosome short tandem repeats (excluding DYS385a/b in Yfiler) in East Asian populations

Variance, % (P value)

GroupingBetween groupsBetween populations, within groupsWithin populations
Korean versus SEAa versus NEAb-1.42 (0.86)2.83 (0.18)98.59 (0.06)
Korean versus SEA-2.58 (0.71)3.55 (0.50)99.04 (0.22)
Korean versus NEA-3.18 (0.34)6.02 (0.21)97.16 (0.03)
SEA versus NEA-0.14 (0.54)0.62 (0.17)99.52 (0.17)

aSoutheast Asian: Chinese (Yunnan-Han), Indonesian, Filipino, Thai, Vietnamese.

bNortheast Asian: Korean, Japanese, Chinese (Beijing-Han, Manchurian, Mongolian, Xian).

Results from analysis of molecular variance for 15 Y-chromosome short tandem repeats (excluding DYS385a/b in Yfiler) in East Asian populations aSoutheast Asian: Chinese (Yunnan-Han), Indonesian, Filipino, Thai, Vietnamese. bNortheast Asian: Korean, Japanese, Chinese (Beijing-Han, Manchurian, Mongolian, Xian). Previous studies suggested a southeastern Asian origin for O2b-SRY465 [8,13], because the entire haplogroup O in East Asia has been proposed to have a southeastern Asian origin [10]. Under the southern origin hypothesis of the Y chromosome in Asia, no extensive bottleneck or genetic isolation are expected in far-east Asia, because there is no obvious barrier except perhaps a linguistic one (Sino-Tibetan-speaking people versus Altaic-speaking people) [34]. The very low incidence of the O2b*-SRY465 haplogroup in our Chinese population group is a substantial departure from the southern-expansion hypothesis of O2b*-SRY465, and indicates that O2b*-SRY465 has undergone apparent long-term isolation in far-east Asia. Therefore, it is unlikely that southeastern Asia is the place of the early settlement of modern humans carrying O2b chromosomes. In contrast to some other sublineages within haplogroup O, the diversity of O2b*-SRY465 calculated in a previous study [13] indicated that northern populations in East Asia are more polymorphic than southern populations, implying a northern Asian origin of O2b*-SRY465, in accordance with its highest frequency distribution pattern in far-east Asia [14,35]. However, as to the issue of whether haplotype originated in prehistoric Korea or in the Japanese Archipelago within northeastern Asia, the most likely region can again be identified on the basis of the highest frequency and the highest diversity. Although there was an expected error due to small sample size, the Y-STR diversities (for example, mean number of pairwise differences and average variance ratio) were higher in Koreans than in Japanese (Table 5), consistent with previous results [13]. Both the haplogroup diversity and haplogroup frequency of O2b*-SRY465 thus suggest its early settlement in prehistoric Korea.
Table 5

10 Diversity of Y-chromosome short tandem repeats in haplogroup O2b* and O2b1 in Korean and Japanese populations

Diversity parametersaHaplogroup O2b*Haplogroup O2b1
Population (number of samples)Population (number of samples)


Korean (114)bJapanese (8)bJapanese (22)cKorean (45)bJapanese (37)bJapanese (66)c







Haplotype diversity, mean ± SD0.91 ± 0.020.96 ± 0.080.97 ± 0.020.91 ± 0.040.92 ± 0.030.91 ± 0.03
Number of pairwise differences, mean ± SD2.82 ± 1.501.89 ± 1.202.91 ± 1.592.30 ± 1.292.06 ± 1.182.01 ± 1.15
Allele size variation ratio0.240.120.300.250.150.17

aIndices were generated from 10 Y-chromosome short tandem repeat markers (excluding DYS385a/b, DYS448, DYS456, DYS458, DYS635 and GATA from the Yfiler system) to be consistent with Nonaka et al. (2007).

bPresent study

cNonaka et al. (2007)

10 Diversity of Y-chromosome short tandem repeats in haplogroup O2b* and O2b1 in Korean and Japanese populations aIndices were generated from 10 Y-chromosome short tandem repeat markers (excluding DYS385a/b, DYS448, DYS456, DYS458, DYS635 and GATA from the Yfiler system) to be consistent with Nonaka et al. (2007). bPresent study cNonaka et al. (2007) Age estimates based on our Y-STR data provide further support for the initial expansion of the O2b*-SRY465 in prehistoric Korea, in accordance with the proto-Korean lineages. Based on the Korean demographic and population history, a constant population in three different BATWING models may give the best fit to our data. Thus, time to most recent common ancestor (TMRCA) for the O2b*-SRY465 lineages was 9,900 years, assuming constant population size (Table 6). According to the three different population expansion models, TMRCA within the Korean and Japanese populations and the whole of East Asia ranged from 6,000-10,000 years ago. This date corresponds to the early Neolithic Age in Korea. Therefore, the age of the O2b*-SRY465 and pattern of variation within the lineages suggests a Neolithic proto-Korean founder in a nearby part of northeastern Asia, followed by a population increase in the vicinity of the Korean Peninsula.
Table 6

BATWING estimates

KoreanJapaneseEast Asian



O2bO2b1O2bO2b1O2bO2b1







Constant size9,949 (4,598-20,408)4,407 (1,928-10,514)5,912 (2,438-13,800)4,143 (1,725-9,279)9,804 (4,751-18,489)4,882 (2,592-8,834)
Always expanding7,767 (3,838-17,263)4,520 (2,198-9,281)6,106 (2,324-15,051)4,891 (1,840-11,906)8,165 (3,805-16,361)5,130 (2,553-10,152)
Constant then expanding9,541 (4,490-19,404)4,851 (2,493-9,394)6,707 (1,195-27,981)5,347 (924-22,288)10,479 (3,922-27,984)5,943 (1,997-17,010)
BATWING estimates Interestingly, the O2b1-47z sublineage seems to have diverged about 4,400 years ago (Table 6) rather than in the Yayoi period, consistent with a previous estimate of 4,000 years ago [13]. Therefore, the present data support the possibility of an ancient Korean origin of O2b1-47z, rather than a Japanese origin [13]. Although O2b1-47z is at its highest haplogroup frequency in the Japanese population, the Y-STR data reveal more diversity of O2b1-47z haplotypes in Koreans, as shown by the mean number of pairwise differences and allele size variation ratio (Table 5), supporting an origin of the O2b1-47z mutation in prehistoric Korea. The Japanese samples studied here were derived from Kyushu, Shikoku and southern Honshu (the region closest to Korea), implying that the high frequency of the O2b1 lineage in Japan may be explained by genetic drift [12,16]. This finding is concordant with a previous report of Nonaka et al. [36], showing less diversity in Japan than in Korea (Table 5). TMRCA of O2b1-47z, and a star-cluster pattern in this study (Figure 3) and a previous study [13], all suggest the possible association of O2b1-47z with the peopling of Korea. However, because most of the Japanese O2b*-SRY465 and O2b1-47z samples were also in the core (or close to) of the cluster (Figure 3), it cannot exclude the possibility that the Japanese and the Koreans derive from the same proto-population outside of Korea (carrying these lineages) at roughly the same time. Thus, further studies of sufficient sampling in the vast East Asian region and genomewide genotyping should provide further insights.

Conclusions

Our results support the idea that both haplogroups O2b*-SRY465 and O2b1-47z had an in situ origin among Northeast Asians, particularly among the prehistoric Koreans, rather than in southeast Asia or Japan as previously envisaged. The combination of the O2b initial settlement (which became an indigenous proto-Korean component) in part with the relatively recent O3 and C3 lineages (which include a Chinese component) explains some of the main events formulating the current Y chromosome composition of the Korean population. Thus, our findings are consistent with linguistic, archaeological and historical evidence, which suggest that the direct ancestors of Koreans were proto-Koreans who inhabited the northeastern region of China and the Korean Peninsula during the Neolithic (8,000-1,000 BC) and Bronze (1,500-400 BC) Ages.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

SHK, WNK and WKK conceived and designed the experiments; SHK, KCK, DJS, HJJ and KDK performed the experiments:. SHK, KCK, WNK and WKK analyzed the data;. MSH, JMS, WNK and WKK contributed reagents, materials and analysis tools; and SHK, MSH, WNK and WKK wrote the paper. All authors read and approved the final manuscript.

Additional file 1

Table S1. Thirteen Y-chromosome single-nucleotide polymorphism (Y-SNP) markers and primer information for PCR amplification. Click here for file

Additional file 2

Table S2. SBE primers for the detection of 13 Y-chromosome single-nucleotide polymorphisms (Y-SNPs) studied. Click here for file

Additional file 3

Table S3. Haplotypes for 25 Y-chromosome single-nucleotide polymorphisms (Y-SNPs) and 17 Y-chromosome short tandem repeat (Y-STR) loci in 1,108 males from East Asian populations. Click here for file

Additional file 4

Table S4. FST distances from haplogroup frequencies in East Asian populations (non-significant values are underlined). Click here for file
  27 in total

1.  Y chromosomal DNA variation in east Asian populations and its potential for inferring the peopling of Korea.

Authors:  W Kim; D J Shin; S Harihara; Y J Kim
Journal:  J Hum Genet       Date:  2000       Impact factor: 3.172

2.  The genetic legacy of the Mongols.

Authors:  Tatiana Zerjal; Yali Xue; Giorgio Bertorelle; R Spencer Wells; Weidong Bao; Suling Zhu; Raheel Qamar; Qasim Ayub; Aisha Mohyuddin; Songbin Fu; Pu Li; Nadira Yuldasheva; Ruslan Ruzibakiev; Jiujin Xu; Qunfang Shu; Ruofu Du; Huanming Yang; Matthew E Hurles; Elizabeth Robinson; Tudevdagva Gerelsaikhan; Bumbein Dashnyam; S Qasim Mehdi; Chris Tyler-Smith
Journal:  Am J Hum Genet       Date:  2003-01-17       Impact factor: 11.025

3.  Y chromosome homogeneity in the Korean population.

Authors:  Soon Hee Kim; Myun Soo Han; Wook Kim; Won Kim
Journal:  Int J Legal Med       Date:  2010-08-17       Impact factor: 2.686

4.  Y-chromosome evidence of southern origin of the East Asian-specific haplogroup O3-M122.

Authors:  Hong Shi; Yong-Li Dong; Bo Wen; Chun-Jie Xiao; Peter A Underhill; Pei-Dong Shen; Ranajit Chakraborty; Li Jin; Bing Su
Journal:  Am J Hum Genet       Date:  2005-07-14       Impact factor: 11.025

5.  Three major lineages of Asian Y chromosomes: implications for the peopling of east and southeast Asia.

Authors:  Atsushi Tajima; I-Hung Pan; Goonnapa Fucharoen; Supan Fucharoen; Masafumi Matsuo; Katsushi Tokunaga; Takeo Juji; Masanori Hayami; Keiichi Omoto; Satoshi Horai
Journal:  Hum Genet       Date:  2001-11-28       Impact factor: 4.132

6.  Y-chromosomal binary haplogroups in the Japanese population and their relationship to 16 Y-STR polymorphisms.

Authors:  I Nonaka; K Minaguchi; N Takezaki
Journal:  Ann Hum Genet       Date:  2007-02-02       Impact factor: 1.670

7.  Multiplex PCR amplification from the CFTR gene using DNA prepared from buccal brushes/swabs.

Authors:  B Richards; J Skoletsky; A P Shuber; R Balfour; R C Stern; H L Dorkin; R B Parad; D Witt; K W Klinger
Journal:  Hum Mol Genet       Date:  1993-02       Impact factor: 6.150

8.  Genealogical inference from microsatellite data.

Authors:  I J Wilson; D J Balding
Journal:  Genetics       Date:  1998-09       Impact factor: 4.562

9.  Y-Chromosome evidence for a northward migration of modern humans into Eastern Asia during the last Ice Age.

Authors:  B Su; J Xiao; P Underhill; R Deka; W Zhang; J Akey; W Huang; D Shen; D Lu; J Luo; J Chu; J Tan; P Shen; R Davis; L Cavalli-Sforza; R Chakraborty; M Xiong; R Du; P Oefner; Z Chen; L Jin
Journal:  Am J Hum Genet       Date:  1999-12       Impact factor: 11.025

10.  Comprehensive mutation analysis of 17 Y-chromosomal short tandem repeat polymorphisms included in the AmpFlSTR Yfiler PCR amplification kit.

Authors:  Miriam Goedbloed; Mark Vermeulen; Rixun N Fang; Maria Lembring; Andreas Wollstein; Kaye Ballantyne; Oscar Lao; Silke Brauer; Carmen Krüger; Lutz Roewer; Rüdiger Lessig; Rafal Ploski; Tadeusz Dobosz; Lotte Henke; Jürgen Henke; Manohar R Furtado; Manfred Kayser
Journal:  Int J Legal Med       Date:  2009-03-26       Impact factor: 2.686

View more
  9 in total

1.  Genetic characteristics of Y-chromosome short tandem repeat haplotypes from cigarette butt samples presumed to be smoked by North Korean men.

Authors:  Kyu-Sik Jeong; Heejin Shin; Sung-Jin Lee; Hyo-Sook Kim; Jang-Yong Kim; Myun-Soo Han; Yang-Han Lee; Ki-Won Park; Byung-Won Chun
Journal:  Genes Genomics       Date:  2018-04-25       Impact factor: 1.839

2.  A new self-learning computational method for footprints of early human migration processes.

Authors:  Z Juhász; E Dudás; Horolma Pamjav
Journal:  Mol Genet Genomics       Date:  2018-07-04       Impact factor: 3.291

3.  Y-chromosome descent clusters and male differential reproductive success: young lineage expansions dominate Asian pastoral nomadic populations.

Authors:  Patricia Balaresque; Nicolas Poulet; Sylvain Cussat-Blanc; Patrice Gerard; Lluis Quintana-Murci; Evelyne Heyer; Mark A Jobling
Journal:  Eur J Hum Genet       Date:  2015-01-14       Impact factor: 4.246

4.  Understanding the Y chromosome variation in Korea--relevance of combined haplogroup and haplotype analyses.

Authors:  Myung Jin Park; Hwan Young Lee; Woo Ick Yang; Kyoung-Jin Shin
Journal:  Int J Legal Med       Date:  2012-05-09       Impact factor: 2.686

5.  The Ami and Yami aborigines of Taiwan and their genetic relationship to East Asian and Pacific populations.

Authors:  Kai Tätte; Ene Metspalu; Helen Post; Leire Palencia-Madrid; Javier Rodríguez Luis; Maere Reidla; Anneliis Rea; Erika Tamm; Everett J Moding; Marian M de Pancorbo; Ralph Garcia-Bertrand; Mait Metspalu; Rene J Herrera
Journal:  Eur J Hum Genet       Date:  2021-03-23       Impact factor: 5.351

6.  Patrilineal perspective on the Austronesian diffusion in Mainland Southeast Asia.

Authors:  Jun-Dong He; Min-Sheng Peng; Huy Ho Quang; Khoa Pham Dang; An Vu Trieu; Shi-Fang Wu; Jie-Qiong Jin; Robert W Murphy; Yong-Gang Yao; Ya-Ping Zhang
Journal:  PLoS One       Date:  2012-05-07       Impact factor: 3.240

7.  Divergence of East Asians and Europeans estimated using male- and female-specific genetic markers.

Authors:  Yoshio Tateno; Tomoyoshi Komiyama; Toru Katoh; Batmunkh Munkhbat; Akira Oka; Yuko Haida; Hiroyuki Kobayashi; Gen Tamiya; Hidetoshi Inoko
Journal:  Genome Biol Evol       Date:  2014-03       Impact factor: 3.416

8.  Taiwan Y-chromosomal DNA variation and its relationship with Island Southeast Asia.

Authors:  Jean A Trejaut; Estella S Poloni; Ju-Chen Yen; Ying-Hui Lai; Jun-Hun Loo; Chien-Liang Lee; Chun-Lin He; Marie Lin
Journal:  BMC Genet       Date:  2014-06-26       Impact factor: 2.797

9.  The Origin and Composition of Korean Ethnicity Analyzed by Ancient and Present-Day Genome Sequences.

Authors:  Jungeun Kim; Sungwon Jeon; Jae-Pil Choi; Asta Blazyte; Yeonsu Jeon; Jong-Il Kim; Jun Ohashi; Katsushi Tokunaga; Sumio Sugano; Suthat Fucharoen; Fahd Al-Mulla; Jong Bhak
Journal:  Genome Biol Evol       Date:  2020-05-01       Impact factor: 3.416

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.