Literature DB >> 32358724

Mitochondrial DNA control region variation in a population sample from Thailand.

Dirk Christopher Sauer1,2, Jana Naue1,2, Uta-Dorothee Immel3,4, Sabine Lutz-Bonengel5,6.   

Abstract

Mitochondrial DNA (mtDNA) control region sequences from hair samples of 213 individuals from Thailand were analyzed using Sanger sequencing. A total of 170 different haplotypes were identified, of which 146 occurred only once (unique haplotypes). The dataset showed a random match probability of 0.87% and a haplotype diversity of 0.9960. The samples were assigned to 85 different haplogroups with B5a, F1a1a, and M being the most frequent ones. Pairwise FST-values between this and other Southeast and East Asian populations revealed significant but relatively low differences, indicating a close relation. Heteroplasmic positions were observed in 12.2% of hair samples confirming the frequent appearance of heteroplasmic positions in hairs. This dataset will complement existing data as an mtDNA reference for forensic investigations.

Entities:  

Keywords:  Control region; Forensic database; Haplogroup; Mitochondrial DNA; Thailand

Mesh:

Substances:

Year:  2020        PMID: 32358724      PMCID: PMC7417404          DOI: 10.1007/s00414-020-02303-2

Source DB:  PubMed          Journal:  Int J Legal Med        ISSN: 0937-9827            Impact factor:   2.686


Introduction

Mitochondrial DNA (mtDNA) analysis has become a routine approach in forensic casework where STR markers cannot be used. The associated estimation of the frequency of obtained mtDNA profiles in the respective population sample is based on the availability of suitable population data sets. MtDNA data from Thailand has already been published [1-5]. However, these datasets either are limited to SNPs in the mitochondrial hypervariable regions 1 and 2 [1] or the collection strategy focuses on a different priority, such as language [2-4]. Furthermore, a forensic study on a population from Thailand was done in the northern province of Chiang Mai [5]. Given that the population of Thailand is composed of different ethnolinguistic groups [6], a regional population sample cannot be considered to be representative for the whole country. In this paper, we present a population dataset of 213 individuals living in all four major regions of Thailand.

Materials and methods

Samples

Hair samples were obtained from 213 unrelated individuals of both sexes living in southern, central, northern, and northeastern Thailand (Fig. 1, Supplementary Table S1). The samples were collected from volunteer donors and anonymized. Written informed consent was obtained from all participants. Ethical approval for mtDNA sequencing analysis was given by the Ethics Committee of the University of Freiburg, Germany (398/16).
Fig. 1

Schematic map of Thailand. The four major regions of Thailand and number of collected samples from each region are provided

Schematic map of Thailand. The four major regions of Thailand and number of collected samples from each region are provided

DNA extraction, amplification, and sequencing

Total DNA of 10–12 hairs per individual was extracted with the MagCore® Genomic DNA Tissue Kit (RBC Bioscience, New Taipei City, Taiwan) at the University of Khon Kaen (Khon Kaen, Thailand). PCR and sequencing of the entire control region were performed at the University Medical Center Freiburg – University of Freiburg (Freiburg, Germany) as described in [7] using primers given in Supplementary Tables S2 and S3.

Sequence analysis and haplogroup assignment

A total of 213 full double strand control region sequences were successfully analyzed and aligned to the revised Cambridge Reference Sequence (rCRS) [8, 9] using Sequencher V5.2.2 (GeneCodes, Ann Arbor, MI, USA). Alignment was done according to the revised and extended guidelines of the ISFG [10]. Haplogroups were assigned with Haplogrep 2 (www.haplogrep.uibk.ac.at) [11] and EMPOP v4/R12 (www.empop.online) [12] based on PhyloTree, build 17 [13]. Assignments were revised manually and conservative estimates of most recent common ancestors (MRCA) were chosen. MtDNA data quality was controlled using the EMPOP tool NETWORK [14]. Further quality control was done by the team of EMPOP at Medical University of Innsbruck. All 213 sequences are incorporated into the EMPOP database under the accession number EMP00699.

Statistical analysis

Intra- and inter-population statistical analysis was done using Arlequin v3.5.2.2 [15]. Numbers of different and unique haplotypes were counted and genetic diversity indices of the population (random match probability, haplotype diversity, number of polymorphic positions, mean number of pairwise differences, and nucleotide diversity) were calculated. Length variants at nucleotide positions 16193, 309, and 573 were ignored for statistical tests. Random match probability was calculated as the sum of squared haplotype frequencies. We compared our data with six other Southeast and East Asian populations from recent studies [5, 16–20] only considering control region data (np 16024–576). We performed a molecular variance analysis (AMOVA) and calculated genetic diversity indices for the additional included studies, pairwise differences between and within populations, and pairwise FST-values.

Results and discussion

We obtained 213 high-quality mtDNA control region sequences from Thailand to establish reference data (Supplementary Table S1). Summary statistics are presented in Table 1. From a total of 170 different haplotypes, 146 were unique. The population sample had a random match probability of 0.87% and a haplotype diversity of 0.9960 ± 0.0013 revealing a high heterogeneity in the population making it useful for forensic analyses.
Table 1

Molecular diversity indices for mtDNA control region data of a population sample from Thailand

Population statistics*Thailand (n = 213)
Number of different haplotypes170
Unique haplotypes; portion of samples146 (68.5%)
Random match probability0.87%
Haplotype diversity0.9960 ± 0.0013
Number of polymorphic positions (including indels)196 of 1134
Mean number of pairwise differences12.86 ± 5.82
Nucleotide diversity0.011337 ± 0.005674

*Insertions at nucleotide positions 16193, 309, and 573 were disregarded

Molecular diversity indices for mtDNA control region data of a population sample from Thailand *Insertions at nucleotide positions 16193, 309, and 573 were disregarded We compared the detected haplotypes with those of six earlier studies of Southeast and East Asian populations including one with samples from Northern Thailand [5, 16–20]. A total of 44 of the 170 haplotypes (25.9%) were found in at least one other population (Supplementary Table S4). Accordingly, 126 haplotypes (74.1%) of our study were not observed in the other studies, including the most common haplotype of our study (9 samples) (cf. Supplementary Table S1).

Haplogroup composition

The 213 samples from Thailand were assigned to 85 different haplogroups (Supplementary Table S1). Some sequences could not be classified to a terminal branch of the PhyloTree [13] and were assigned to their MRCA such as macrohaplogroup M (8.9%). The most frequent terminally assigned haplogroups were B5a (9.4%), F1a1a (8.9%), and M (8.9%). All samples belong to macrohaplogroups R (50.7%), M (39.4%), and N (9.4%) except of one sample which could only be assigned to L3 as MRCA. In general, the haplogroup composition is comparable with other Southeast Asian population samples with F1, B5, and M being the most frequent lineages [2, 4, 5, 16–19].

Genetic distances between Southeast and East Asian populations

We compared the genetic structures of our population sample from Thailand with the six other Southeast and East Asian population samples [5, 16–20]. The total number of samples was 1789. Analysis of molecular variance (AMOVA) revealed that 98.06% of the genetic variation is due to differences within populations. Thus, only 1.94% of the total genetic variance is caused by differences between populations (Table 2). Bodner et al. (2011) had found an inter-population variance of only 0.84% in a very similar dataset, but only considering hypervariable regions (HVS-I and HVS-II) and a regional restricted population [16].
Table 2

Analysis of molecular variance (AMOVA) of 7 Southeast and East Asian populations (analyzed range: np 16024–576). AMOVA design and results

Source of variationDegrees of freedomSum of squaresVariance componentsPercentage of variation
Among populations6229.2380.12603b1.94a,b
Within populations178211,375.4676.3835498.06
Total178811,604.7056.50957

aCorresponds to FST of 0.0194

bp value < 0.001, number of permutations: 1023

Analysis of molecular variance (AMOVA) of 7 Southeast and East Asian populations (analyzed range: np 16024–576). AMOVA design and results aCorresponds to FST of 0.0194 bp value < 0.001, number of permutations: 1023 The number of mean pairwise differences (MPD) in the Thai population is 12.86, which is in the dimension of that observed in other Asian populations (Table 3). The lowest MPD value (11.75) was observed in a South Korean population, the highest (13.42) in Northern Thailand.
Table 3

Analysis of molecular variance (AMOVA) of 7 Southeast and East Asian populations (analyzed range: np 16024–576). Mean number of pairwise differences (MPD) between seven Southeast and East Asian populations

PopulationThailandNorthern ThailandLaosNorthern VietnamMyanmarHong KongSouth Korea
ReferenceThis study[5][16][17][18][19][20]
Thailand12.8613.1913.0712.9112.7513.1712.85
Northern Thailand0.0513.4213.4013.2013.0913.3613.07
Laos0.050.0913.1813.0013.0713.3313.07
Northern Thailand0.070.07− 0.01*12.8312.8713.1012.83
Myanmar0.100.150.250.2412.4513.1012.75
Hong Kong0.170.070.160.110.3013.1612.65
South Korea0.550.480.600.540.650.2011.75

Above diagonal: Average number of pairwise differences between populations (XY)

Diagonal elements: Average number of pairwise differences within population (X)

Below diagonal: Corrected average pairwise difference (XY-(X+Y)/2)

*Corrected p value > 0.05

Analysis of molecular variance (AMOVA) of 7 Southeast and East Asian populations (analyzed range: np 16024–576). Mean number of pairwise differences (MPD) between seven Southeast and East Asian populations Above diagonal: Average number of pairwise differences between populations (XY) Diagonal elements: Average number of pairwise differences within population (X) Below diagonal: Corrected average pairwise difference (XY-(X+Y)/2) *Corrected p value > 0.05 Pairwise FST-values between population samples were relatively low and similar, indicating a close relation between populations. Higher variance was found between Southeast Asian populations (Thailand, Northern Thailand, Laos, Northern Vietnam, and Myanmar) and South Korea (FST 0.037–0.051), whereas genetic variance between Hong Kong and South Korea was low (FST 0.016). All FST-values between Thailand and other population samples were significant (Table 4).
Table 4

Analysis of molecular variance (AMOVA) of 7 Southeast and East Asian populations (analyzed range: np 16024–576). Population pairwise FST-values

Population:ThailandNorthern ThailandLaosNorthern VietnamMyanmarHong KongSouth Korea
Reference:This study[5][16][17][18][19][20]
Thailand*0.0040.0040.0050.0080.0130.043
Northern Thailand0.027*0.0070.0050.0120.0050.037
Laos0.0280.006*− 0.0010.0190.0120.046
Northern Vietnam0.0150.0120.598*0.0180.0080.042
Myanmar0.0000.0000.0000.000*0.0230.051
Hong Kong0.0000.0040.0000.0000.000*0.016
South Korea0.0000.0000.0000.0000.0000.000*

Above diagonal: Population pairwise FST-values

Below diagonal: p values (significance level = 0.05; number of permutations: 1023)

Analysis of molecular variance (AMOVA) of 7 Southeast and East Asian populations (analyzed range: np 16024–576). Population pairwise FST-values Above diagonal: Population pairwise FST-values Below diagonal: p values (significance level = 0.05; number of permutations: 1023)

Heteroplasmy

We observed a total of 27 point heteroplasmies at 14 different positions in 26 hair samples (12.2%) (Supplementary Table S5). In a former study including 691 hair shaft samples, the frequency of point heteroplasmy was 11.4% [21]. Based on other studies using various types of tissue material, in Southeast and East Asian populations, the percentage of samples with point heteroplasmy was calculated as follows: Northern Thailand 2.6% (blood), Laos 3.7% (blood), Hong Kong 8.5% (blood), Myanmar 8.6% (blood), South Korea 10.3% (blood and buccal swabs), and Northern Vietnam 15.0% (buccal swabs) [5, 16–20]. These values confirm that mtDNA heteroplasmy frequency is dependent on the analyzed tissue as specified in [7]. However, it has to be considered that low level heteroplasmic positions were not detected using Sanger sequencing.

Conclusion

The sample of 213 mtDNA control region sequences will serve as a high-quality mtDNA reference for Thailand. Most of the detected haplotypes were unique within the known data and will complement the available data from Northern Thailand [5] and other Southeast Asian populations. Recently, 960 complete mtDNA genomes from Thailand originally sequenced to investigate anthropologic questions [4] were also incorporated into the EMPOP database [14]. The increasing data of Southeast Asian mtDNA sequences is leading to a reliable forensic reference for this region. (DOCX 62 kb) (XLSX 22 kb)
  15 in total

1.  Forensic and phylogeographic characterization of mtDNA lineages from northern Thailand (Chiang Mai).

Authors:  Bettina Zimmermann; Martin Bodner; Sylvain Amory; Liane Fendt; Alexander Röck; David Horst; Basil Horst; Torpong Sanguansermsri; Walther Parson; Anita Brandstätter
Journal:  Int J Legal Med       Date:  2009-11       Impact factor: 2.686

2.  Single nucleotide polymorphisms minisequencing in hypervariable regions for screening of Thais.

Authors:  Punlop Thongngam; Worraanong Leewattanapasuk; Tanin Bhoopat; Padchanee Sangthong
Journal:  Gene       Date:  2017-07-08       Impact factor: 3.688

3.  DNA Commission of the International Society for Forensic Genetics: revised and extended guidelines for mitochondrial DNA typing.

Authors:  W Parson; L Gusmão; D R Hares; J A Irwin; W R Mayr; N Morling; E Pokorak; M Prinz; A Salas; P M Schneider; T J Parsons
Journal:  Forensic Sci Int Genet       Date:  2014-07-29       Impact factor: 4.882

4.  Evidence for frequent and tissue-specific sequence heteroplasmy in human mitochondrial DNA.

Authors:  Jana Naue; Steffen Hörer; Timo Sänger; Christina Strobl; Petra Hatzer-Grubwieser; Walther Parson; Sabine Lutz-Bonengel
Journal:  Mitochondrion       Date:  2014-12-16       Impact factor: 4.160

5.  Updated comprehensive phylogenetic tree of global human mitochondrial DNA variation.

Authors:  Mannis van Oven; Manfred Kayser
Journal:  Hum Mutat       Date:  2009-02       Impact factor: 4.878

6.  Next generation database search algorithm for forensic mitogenome analyses.

Authors:  Nicole Huber; Walther Parson; Arne Dür
Journal:  Forensic Sci Int Genet       Date:  2018-09-09       Impact factor: 4.882

7.  HaploGrep 2: mitochondrial haplogroup classification in the era of high-throughput sequencing.

Authors:  Hansi Weissensteiner; Dominic Pacher; Anita Kloss-Brandstätter; Lukas Forer; Günther Specht; Hans-Jürgen Bandelt; Florian Kronenberg; Antonio Salas; Sebastian Schönherr
Journal:  Nucleic Acids Res       Date:  2016-04-15       Impact factor: 16.971

8.  Complete mitochondrial genomes of Thai and Lao populations indicate an ancient origin of Austroasiatic groups and demic diffusion in the spread of Tai-Kadai languages.

Authors:  Wibhu Kutanan; Jatupol Kampuansai; Metawee Srikummool; Daoroong Kangwanpong; Silvia Ghirotto; Andrea Brunelli; Mark Stoneking
Journal:  Hum Genet       Date:  2016-11-11       Impact factor: 4.132

9.  Contrasting maternal and paternal genetic variation of hunter-gatherer groups in Thailand.

Authors:  Wibhu Kutanan; Jatupol Kampuansai; Piya Changmai; Pavel Flegontov; Roland Schröder; Enrico Macholdt; Alexander Hübner; Daoroong Kangwanpong; Mark Stoneking
Journal:  Sci Rep       Date:  2018-01-24       Impact factor: 4.379

10.  New insights from Thailand into the maternal genetic history of Mainland Southeast Asia.

Authors:  Wibhu Kutanan; Jatupol Kampuansai; Andrea Brunelli; Silvia Ghirotto; Pittayawat Pittayaporn; Sukhum Ruangchai; Roland Schröder; Enrico Macholdt; Metawee Srikummool; Daoroong Kangwanpong; Alexander Hübner; Leonardo Arias; Mark Stoneking
Journal:  Eur J Hum Genet       Date:  2018-02-26       Impact factor: 4.246

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.