| Literature DB >> 28244283 |
Sohyung Park1, Sohee Cho2, Hee Jin Seo3, Ji Hyun Lee3, Moon Young Kim3, Soong Deok Lee2,4.
Abstract
Mitochondrial DNA (mtDNA) genome analysis has been a potent tool in forensic practice as well as in the understanding of human phylogeny in the maternal lineage. The traditional mtDNA analysis is focused on the control region, but the introduction of massive parallel sequencing (MPS) has made the typing of the entire mtDNA genome (mtGenome) more accessible for routine analysis. The complete mtDNA information can provide large amounts of novel genetic data for diverse populations as well as improved discrimination power for identification. The genetic diversity of the mtDNA sequence in different ethnic populations has been revealed through MPS analysis, but the Korean population not only has limited MPS data for the entire mtGenome, the existing data is mainly focused on the control region. In this study, the complete mtGenome data for 186 Koreans, obtained using Ion Torrent Personal Genome Machine (PGM) technology and retrieved from rather common mtDNA haplogroups based on the control region sequence, are described. The results showed that 24 haplogroups, determined with hypervariable regions only, branched into 47 subhaplogroups, and point heteroplasmy was more frequent in the coding regions. In addition, sequence variations in the coding regions observed in this study were compared with those presented in other reports on different populations, and there were similar features observed in the sequence variants for the predominant haplogroups among East Asian populations, such as Haplogroup D and macrohaplogroups M9, G, and D. This study is expected to be the trigger for the development of Korean specific mtGenome data followed by numerous future studies.Entities:
Keywords: Heteroplasmy; Ion Torrent PGM; Korean; Massive Parallel Sequencing; Mitochondrial DNA; mtDNA Haplogroup
Mesh:
Substances:
Year: 2017 PMID: 28244283 PMCID: PMC5334155 DOI: 10.3346/jkms.2017.32.4.587
Source DB: PubMed Journal: J Korean Med Sci ISSN: 1011-8934 Impact factor: 2.153
Haplogroup distribution across 186 Korean individuals in this study
| Haplogroup | No. | Frequency, % | References |
|---|---|---|---|
| D4a | 40 | 27.96 | |
| D4a3 | 3 | - | |
| D4b1 | 6 | - | |
| D5b | 3 | - | |
| A5a | 20 | 10.75 | |
| M7a1 | 9 | 10.22 | |
| M7b1 | 10 | - | |
| F1b1 | 16 | 8.6 | |
| G1a1 | 19 | 10.22 | |
| B4a | 16 | 10.75 | |
| B4c1c | 4 | - | |
| N9a2a | 11 | 5.91 | |
| M9;G;D | 17 | 9.14 | |
| Y1 | 12 | 6.45 | |
| Total | 186 | - | - |
No. = number of samples.
Details of miscalled variants on VCF report
| Variant caller | Interpretation | Haplogroup containing variants | Total No. of variants |
|---|---|---|---|
| Not reported | 71.1G | B4a | 3 |
| 247GA → 247G | 249del | F1b1 | 16 |
| Not reported | 455.1T | D4 | 3 |
| 513GCA → 513G | 523–524del | A5a, B4a, D4b, F1b1, G1a1, M7a1, M9;G;D | 73 |
| Not reported | 573.1C | A5a, M9;G;D | 3 |
| Not reported | 960.1C | N9a2a | 11 |
| 2150T → 2150TA | 2156.1A | A5a | 20 |
| 4246ATT → AT, ATC | 4248C | A5a | 20 |
| 5894A → 5894AC | 5899.1C | M7a1 | 9 |
| 8270CACCCCCTCT → 8270C | 8281–8289del | B4a, B4c | 20 |
| 9540C or 9540T/C | 9540C | D4 | 11 |
| Not reported | 16182C, 16183C | B4, F1b1, M7b, D5b | 40 |
VCF = variant call format.
Distribution of homoplasmic variants within control, coding, and entire regions of mtDNA
| Haplogroup | CR | CodR | Entire mtGenome | CodR:CR ratio | |||
|---|---|---|---|---|---|---|---|
| No. | % | No. | % | No. | % | ||
| D4a, D4a3, D4b1, D5b | 446 | 22.32 | 1,552 | 77.68 | 1,998 | 100.00 | 3.48 |
| A5a | 168 | 27.50 | 443 | 72.50 | 611 | 100.00 | 2.64 |
| M7a1, M7b1 | 168 | 21.71 | 606 | 78.29 | 774 | 100.00 | 3.61 |
| F1b1 | 212 | 34.81 | 397 | 65.19 | 609 | 100.00 | 1.87 |
| G1a1 | 171 | 22.35 | 594 | 77.65 | 765 | 100.00 | 3.47 |
| B4a, B4c1c | 204 | 39.77 | 309 | 60.23 | 513 | 100.00 | 1.51 |
| N9a2a | 97 | 32.44 | 202 | 67.56 | 299 | 100.00 | 2.08 |
| M9;G;D | 101 | 17.88 | 464 | 82.12 | 565 | 100.00 | 4.59 |
| Y1 | 115 | 31.86 | 246 | 68.14 | 361 | 100.00 | 2.14 |
| Total | 1,682 | 25.90 | 4,813 | 74.10 | 6,495 | 100.00 | 2.86 |
mtDNA = mitochondrial DNA, No. = number of samples, CodR = coding region, CR = control region, mtGenome = mtDNA genome.
Haplogroup resolution based on entire mtGenome variants
| CR (No.) | Entire mtGenome (No.) |
|---|---|
| D4a (40) | D4a (3), D4a1 (11), D4a1a1 (2), D4a1b (10), D4a1c (4), D4a2 (8), D4a2a (1), D4f1 (1) |
| D4a3 (3) | D4a3b (3) |
| D4b1 (6) | D4b1a1 (6) |
| D5b (3) | D5b1b (3) |
| A5a (20) | A5a (6), A5a1a (11), A5a1a1 (1), A5a1a2 (1), A5a3 (1) |
| M7a1 (9) | M7a1a (4), M7a1a1 (1), M7a1a5a (1), M7a1a9 (3) |
| M7b1 (10) | M7b1a1a (1), M7b1a1a1 (9) |
| F1b1 (16) | F1b1a (1), F1b1a1 (4), F1b1a1a (4), F1b1a1a1 (1), F1b1a1a2 (2), F1b1a2 (4) |
| G1a1 (19) | G1a1a (11), G1a1a1 (8) |
| B4a (16) | B4a (4), B4+16261 (2), B4a1c1a (4), B4a1c1a1 (6) |
| B4c1c (4) | B4c1c (4) |
| N9a2a (11) | N9a2a (10), N9a2a2 (1) |
| M9;G;D (17) | D4b2a2 (1), D4d (4), D4e2 (2), D4e2a (2), D4f1 (3), D4j (5) |
| Y1 (12) | Y1 (9), Y1a (1), Y1b1 (1), Y1b1a (1) |
CR = control region, No. = number of samples, mtGenome = mtDNA genome.
Distribution of PHP
| Haplogroup | No. | No. of PHP | |||
|---|---|---|---|---|---|
| Total | Individuals with PHP | CR | CodR | Entire | |
| D4a, D4a3, D4b1, D5b | 52 | 11 (21.15) | 4 | 9 | 13 |
| A5a | 20 | 4 (20.00) | 1 | 3 | 4 |
| M7a1, M7b1 | 19 | 5 (26.32) | 1 | 4 | 5 |
| F1b1 | 16 | 1 (6.25) | 0 | 3 | 3 |
| G1a1 | 19 | 4 (21.05) | 5 | 4 | 9 |
| B4a, B4c1c | 20 | 6 (30.00) | 3 | 3 | 6 |
| N9a2a | 11 | 4 (36.36) | 1 | 5 | 6 |
| M9;G;D | 17 | 5 (29.41) | 1 | 5 | 6 |
| Y1 | 12 | 0 (0.00) | 0 | 0 | 0 |
| Total | 186 | 40 (21.51) | 16 (30.77) | 36 (69.23) | 52 (100) |
Values are presented as number (%).
No. = number of samples, PHP = point heteroplasmy, CodR = coding region, CR = control region.