Literature DB >> 26387877

Simultaneous Whole Mitochondrial Genome Sequencing with Short Overlapping Amplicons Suitable for Degraded DNA Using the Ion Torrent Personal Genome Machine.

Lakshmi Chaitanya¹, Arwin Ralf¹, Mannis van Oven¹, Tomasz Kupiec², Joseph Chang³, Robert Lagacé³, Manfred Kayser¹.

Abstract

Whole mitochondrial (mt) genome analysis enables a considerable increase in analysis throughput, and improves the discriminatory power to the maximum possible phylogenetic resolution. Most established protocols on the different massively parallel sequencing (MPS) platforms, however, invariably involve the PCR amplification of large fragments, typically several kilobases in size, which may fail due to mtDNA fragmentation in the available degraded materials. We introduce a MPS tiling approach for simultaneous whole human mt genome sequencing using 161 short overlapping amplicons (average 200 bp) with the Ion Torrent Personal Genome Machine. We illustrate the performance of this new method by sequencing 20 DNA samples belonging to different worldwide mtDNA haplogroups. Additional quality control, particularly regarding the potential detection of nuclear insertions of mtDNA (NUMTs), was performed by comparative MPS analysis using the conventional long-range amplification method. Preliminary sensitivity testing revealed that detailed haplogroup inference was feasible with 100 pg genomic input DNA. Complete mt genome coverage was achieved from DNA samples experimentally degraded down to genomic fragment sizes of about 220 bp, and up to 90% coverage from naturally degraded samples. Overall, we introduce a new approach for whole mt genome MPS analysis from degraded and nondegraded materials relevant to resolve and infer maternal genetic ancestry at complete resolution in anthropological, evolutionary, medical, and forensic applications.

Entities: Chemical Disease Gene Mutation Species

Keywords: MPS; NGS; massively parallel sequencing; mitochondria; mtDNA; next-generation sequencing

Mesh：

Substances：
DNA, Mitochondrial

Year: 2015 PMID： 26387877 PMCID： PMC5057296 DOI： 10.1002/humu.22905

Source DB: PubMed Journal: Hum Mutat ISSN： 1059-7794 Impact factor: 4.878

Introduction

Previous years have witnessed conspicuous progress in the establishment of the high‐copy‐number human mitochondrial DNA (mtDNA) as an imperative tool in forensic, anthropological, and medical genetics. This small, circular, double‐stranded genome has enthralled fundamental and applied geneticists with its unique features, bolstering its recognized value as a useful molecular marker when analyzing degraded or low‐copy‐number DNA samples, as often confronted at crime scenes, from old and ancient human remains, and from limited samples available in medical and anthropological applications. The higher mutation rate of mtDNA as compared with nuclear DNA, and the resulting significant sequence variability between maternally unrelated individuals, has made mtDNA an indispensable tool in various applications particularly forensics and anthropology [Wilson et al., 1995a, 1995b; Holland and Parsons, 1999; Kivisild, 2015]. Most applications employ traditional Sanger sequencing of (the hypervariable portions of) the noncoding control region of the mt genome to delineate haplotypes that enable maternal lineage identification [e.g., Gill et al., 1994; Wilson et al., 1995a; Holland and Parsons, 1999; Brandstätter et al., 2007; Palo et al., 2007]. In addition to being labor‐intensive and expensive, the control‐region Sanger sequencing approach does not always allow reliable inference of maternal haplogroups, due to the high level of homoplasy in the control region that can obscure phylogenetic signatures [Schlebusch et al., 2009; King et al., 2014] as well as the fact that many haplogroup‐defining variants are located outside the control region. Since not all the major mtDNA haplogroups can be distinguished through control‐region sequencing only, several studies [Schlebusch et al., 2009; van Oven et al., 2011; Ballantyne et al., 2012] have resorted to the use of multiplex single‐base primer extension assays for simultaneous genotyping of a limited number of mtDNA coding‐region SNPs to improve maternal haplogroup definition. However, such assays are hampered by technical limitations in terms of multiplexing capacity allowing not more than 20–40 SNPs in a single multiplex assay. Whole mtDNA genome sequencing would augment the haplotype and haplogroup resolution to the maximum possible resolution level, thereby greatly enhancing the discriminatory power, and allowing inference of matrilineal biogeographic ancestry at a greater resolution; however, whole mtDNA genome analysis via Sanger sequencing is highly labor‐intensive given the >16 kb involved [Fendt et al., 2009; Ramos et al., 2009]. Massively parallel sequencing (MPS) or next‐generation sequencing (NGS) technology in principle provides a solution to simultaneous whole mtDNA genome analysis allowing high‐throughput analysis at reduced per‐sample costs. It had been pioneered in the field of ancient DNA analysis where degradation problems are most severe. The shotgun‐sequencing approach using MPS technology has facilitated the complete mtDNA genome analysis of extinct nonhuman species such as mammoths [Gilbert et al., 2007, 2008], cave bears, and others [Miller et al., 2009; Stiller et al., 2009; Willerslev et al., 2009; Lindqvist et al., 2010; Ho & Gilbert, 2010] as well as extinct human species such as Neanderthals [Green et al., 2008]. Recently, Davis et al. (2015) used the MPS approach to analyze the hypervariable segments (HVS)‐I and II of human mtDNA using short amplicons in tissue and bone samples keeping with the previously discussed limitations of partial mtDNA analysis. However, the most established currently available MPS protocols for whole mtDNA genome sequencing are based on PCR amplification of large mtDNA fragments, typically several kilobases in size [Sosa et al., 2012; Parson et al., 2013], which fail when applied to degraded DNA as encountered in many mtDNA applications. A very recent publication [Parson et al., 2015] described a midi‐sized amplicon approach for whole mtDNA MPS analysis using 62 PCR amplicons of 300–500 bp (average about 380 bp) in two multiplex assays, which proved useful for human hair analysis. However, many DNA samples encountered in mtDNA testing, especially for forensic and anthropological purposes are more severely degraded resulting in smaller‐sized fragments. Recently, we introduced a MPS tool based on short amplicons (203 bp on average) using the Ion Torrent Personal Genome Machine (PGMTM) for simultaneous analysis of >530 Y‐chromosome SNPs covering the entire phylogenetic Y‐chromosome tree that allows classification of Y chromosomes into >430 worldwide Y haplogroups for ultra‐high‐resolution paternal lineage and paternal ancestry inference [Ralf et al., 2015]. As a maternal counterpart, with this study we introduce a short amplicon‐based MPS tiling approach for simultaneous whole mtDNA genome analysis using the Ion Torrent PGMTM allowing to obtain maximum‐resolution maternal lineage and maternal ancestry inference from degraded and nondegraded DNA. We tested the performance of the newly developed method using geographically diverse DNA samples with available whole mtDNA genome data based on Sanger sequencing, and using samples for which we generated whole mtDNA genome data based on an alternative MPS protocol. Additionally, we assessed the efficacy of sample pooling, the sensitivity, and the robustness of our new approach regarding experimental and natural DNA degradation.

Materials and Methods

Primer Design

Primer pairs were designed for a total of 161 partially overlapping amplicons in two separate primer pools (pool 1 with 80 pairs and pool 2 with 81 pairs). Each individual primer pool produces a battery of amplicons separated by gaps across the entire mt genome. By combining the PCR amplicons resulting from each primer pool, the gaps were complemented by amplicons from the other primer pool and the entire 16.5‐kb mt genome could be covered. Amplicons of comparable sizes were preferred (∼200 bp); however, some regions proved to be difficult to amplify because of, for example, high GC content or repetitive sequences. Also, highly polymorphic positions need to be avoided to overlap with primer annealing sites as much as possible; to overcome these issues, primer positions had to be shifted leading to more variation in amplicon size. Several rounds of test sequencing and primer redesigning were needed to get to the final protocol. All primers were synthesized by Life Technologies, a part of Thermo Fisher Scientific Inc., using their proprietary AmpliSeq™ modification, which allows multiplexing hundreds or even thousands of targets in one reaction. The primers were all received at a concentration of 614 μM. For each pool, 1 μl of each primer is combined and each pool is then brought to a volume of 3,070 μl. Each primer is now at a concentration of 200 nM. For the 20‐μl PCR, 10 μl of primer pool is used, so the final concentration in the PCR is 100 nM for each primer. Supp. Table S1 outlines the primer sequences used in this study without the proprietary modification. In the final version, and with the primer pools 1 and 2 together, the entire human mtDNA genome is covered via 161 overlapping amplicons with fragment size of 144–230 bp (average across amplicons: 200 bp).

DNA Samples

With the aim to cover the global variation of the mtDNA phylogeny [van Oven and Kayser 2009], 20 DNA samples known to belong to widely divergent haplogroups were selected from four different sources: the Centre d'Etude du Polymorphisme Humain (CEPH, Paris, France) Human Genome Diversity Project (HGDP) panel (http://www.ceph.fr/HGDP‐CEPH‐Panel), individuals from across The Netherlands [Lao et al., 2013], the commercially available Ethnic Diversity DNA panel (EDP‐1) manufactured by the European Collection of Cell Cultures (ECACC) (http://www.phe‐culturecollections.org.uk/products/dna/ethnicdna.jsp), and distributed by Sigma, and volunteers recruited at the Erasmus MC with informed consent.

Library Construction

DNA libraries were constructed with 10 ng of input DNA using the Ion Ampliseq™ Library Kit 2.0 (Life Technologies, a part of Thermo Fisher Scientific Inc., Foster City, CA) and 18 cycles with two separate primer pools, amplifying the 161 amplicons simultaneously, following the manufacturer's recommendations. The quantity of DNA was determined using the Qubit® dsDNA BR Quantification Kit and a Qubit® 2.0 Fluorometer (Invitrogen, Life Technologies, a part of Thermo Fisher Scientific Inc., Grand Island, NY). In order to test the feasibility of sequencing multiple samples simultaneously, Ion Xpress™ Barcode Adapters (Life Technologies, a part of Thermo Fisher Scientific Inc.) were used. To evaluate the sensitivity of the tiling approach, variable amounts of DNA input ranging from 10 ng to 100 pg were run and their sequences were evaluated. Three of the 20 DNA samples, hailing from different populations—Italy (haplogroup, T2b), The Netherlands (haplogroup, V3c), and South Africa (haplogroup, L3d3a1a), were sequenced at 10 and 1 ng, 500, 250, and 100 pg of genomic DNA input. All the DNA dilutions were quantified and confirmed in duplicates with the Quantifiler® Human DNA Quantification kit (Applied Biosystems, Foster City, CA) following the manufacturer's guidelines. The robustness of the assay to successfully type degraded materials was tested by subjecting an additional DNA sample (in this case from an Indian individual belonging to mtDNA haplogroup M35a) to DNase I treatment at different time intervals: 5, 10, 15, 20, 30, and 45 min. Three microliter of each of these DNase‐treated samples was used for the MPS. Furthermore, sample 15 (from Italy with haplogroup T2b) was subjected to two different degradation methods—exposure to ultraviolet (UV) light for 30 min using a Bio‐Link (Vilber Lourmat) at a strength of 50 J/cm2 and enzymatic shearing using the Ion Shear™ Plus Reagents Kit (Life Technologies, a part of Thermo Fisher Scientific Inc.). Three microliter of each of the degraded samples was used for the NGS. In addition, we applied our whole mt genome MPS method to DNA extracted from six different bones and teeth samples from naturally degraded human remains found in Poland. Samples 1, 2, 4, 5, and 6 were extracted from teeth, whereas sample 3 was extracted from a femoral bone. Samples 1, 3, 4, and 5 gave a full profile with the AmpFℓSTR® NGM™ (Applied Biosystems), whereas samples 2 and 6 gave partial profiles with only 4 and 6 loci, respectively, out of the 16 loci (15 STRs plus amelogenin) of a full NGM profile. For samples 4, 5, and 6, the control region was analyzed via the Sanger sequencing (unpublished study). The remains that gave rise to sample 2 are believed to be from an archaeological stand from the XV–XVI century. Sample 5 likely comes from a recently deceased individual as decomposed soft tissue was found together with the hard tissues. The remains that gave rise to sample 1 were found in soil and the time since death was reported to be approximately 7–8 months, whereas those of sample 3 were found in a river. The samples 1–6 were PCR‐quantified using the Quantifiler® Human DNA Quantification Kit (Applied Biosystems) and their concentrations were 0.05, 0.1, 0.045, 5.4, 1.82, and 0.4 ng/μl, respectively. For MPS, we used 4 μl of each of the samples. Furthermore, as a comparison to currently available NGS assays, five out of the 20 DNA samples were also sequenced using a different library preparation method. The entire mtDNA genome was amplified with two overlapping >8 kb fragments using the amplification primers from Fendt et al. (2009). The SequalPrep™ Long PCR Kit with dNTPs (Invitrogen) was used for the long‐range amplification following the manufacturer's guidelines. The amplified samples were quantified using the Qubit® dsDNA BR Quantification Kit and a Qubit® 2.0 Fluorometer (Invitrogen) and the amplified samples were normalized to 100 ng of input DNA. Libraries were constructed using the Ion Shear™ Plus Reagents Kit (Life Technologies, a part of Thermo Fisher Scientific Inc.) for enzymatic fragmentation of DNA, followed by barcoded adapter ligation using the Ion Xpress™ Barcode Adapters (Life Technologies, a part of Thermo Fisher Scientific Inc.) and Ion Plus Fragment Library Kit (Life Technologies, a part of Thermo Fisher Scientific Inc.) as per the manufacturer's protocol. The libraries were size selected using the E‐Gel® SizeSelect™ 2% Agarose Gel (Invitrogen, Carlsbad, CA).

Template Preparation

Using emulsion PCR, the generated libraries are attached to beads and further amplified. The concentration of each of the libraries was determined with qPCR using the Ion Library Quantitation Kit (Life Technologies, a part of Thermo Fisher Scientific Inc.), and the template dilution factors were calculated. High‐quality templated Ion Sphere™ particles, containing massively parallel clonally amplified DNA, were prepared for the 200 base‐read libraries using the Ion PGM™ Template OT2 200 Kit and the Ion OneTouch™ 2 System (Life Technologies, a part of Thermo Fisher Scientific Inc.). The template‐positive Ion Sphere™ particles were enriched with the Ion OneTouch™ Enrichment System (Life Technologies, a part of Thermo Fisher Scientific Inc.) as per the manufacturer's guidelines.

PGMTM Sequencing

Semiconductor sequencing on the Ion PGMTM detects the change in pH when the proton (H+) is released during nucleotide incorporation. The sequencing was conducted on the Ion 318™ Chip v2 using the Ion PGM™ Hi‐Q™ Sequencing Kit (six barcoded samples per chip) coupled with the PGMTM sequencer (Life Technologies, a part of Thermo Fisher Scientific Inc.). A chlorite cleaning, followed by 18 MΩ water wash (Thermo Fisher Scientific, Millipore, MA) was performed prior to the initialization of the Ion PGM™ System.

MPS Data Analysis

Commercially available NextGENe® software (v2.4.0.2) was used to analyze all the sequences generated on the Ion PGMTM sequencer. All the sequences were trimmed by 20 bases on both the 3′ and the 5′ ends and aligned to the mtDNA revised Cambridge Reference Sequence (rCRS) [Andrews et al., 1999] (GenBank NC_012920.1). The arbitrary threshold of 50 reads was considered as a minimum required to address full sequencing coverage to reliably report variants/polymorphisms. This threshold is the coverage minimum to be able to see heteroplasmies and to be able to clearly identify stochastic sequencing errors. A BED file that contains the amplicon regions was also uploaded for the analysis. The NextGENe® software allows visualization of sequence reads in addition to summarizing the polymorphic sites in a tabular column. The NextGENe® software uses BLAST‐like Alignment Tool to align the sequence reads to the reference. Nonetheless, in some cases, manual inspection was needed to confirm certain variants and anomalies, especially regarding length heteroplasmy. It is known that the corrected version of the Cambridge Reference Sequence (the rCRS) retained the same nucleotide numbering scheme as the original reference sequence [the CRS; Anderson et al., 1981], and to correct one of the erroneous positions, an “N” was recorded at position 3,107 to represent a deletion at that position [Andrews et al., 1999]. The NextGENe® software lists it as 3,107d in any tested sample, but as this deletion does not represent a genuine sequence variant, it shall be ignored. Additionally, this software does not adhere to the nomenclature recommendations detailed by the forensic community [Carracedo et al., 2000; Bandelt and Parson 2008]. In particular, rather than reporting the common dinucleotide deletion in the dimeric repeat region between 514 and 524 as m.523A>del and m.524C>del (placing the deletion at the most 3′ position possible), NextGENe® employs a 5′ indel alignment and notes these variants as m.513G>del and m.514G>del. Furthermore, we noticed that the two sequential transitions that sometimes co‐occur at positions 151 and 152 were instead reported as a deletion at np 151 and an insertion at 152, following the recommendations by Wilson et al. (2000a, 2000b). With the mtDNA sequence variants detected, the haplogroups were inferred using the Web‐based bioinformatics tool, MitoTool [Fan and Yao, 2011], which uses the most recent version of PhyloTree (http://www.phylotree.org; Build 16) [van Oven and Kayser, 2009].

Results and Discussion

In recent years, DNA sequencing technology has evolved from traditional Sanger sequencing of single reads to sequencing thousands of reads with high coverage in a massively parallel fashion with MPS technologies such as for analyzing human mtDNA [Sosa et al., 2012, Parson et al. 2013, 2015; Davis et al, 2015]. Contrary to the large‐fragment sequencing strategy employed mostly so far for MPS analysis of whole mt genomes [Sosa et al., 2012; Parson et al., 2013], here we introduce an MPS approach for analyzing the entire human mtDNA genome using 161 pairs of short amplicons (144–230 bp, average 200 bp) in two different primer pools. We demonstrate the whole mt genome coverage of our approach via analyzing worldwide DNA samples belonging to different mtDNA haplogroups, perform quality control via comparison with data obtained with alternative methods in the same samples and provide preliminary data on sensitivity testing, degraded DNA testing using artificially and naturally degraded DNA, and multiple DNA sample testing using barcoded adapters; the DNA variants were determined using the NextGENe® v2.4.0.2 software.

Method Application to Worldwide Samples and Quality Comparison with Alternative Methods

To test the performance of the newly designed method for simultaneous whole mt genome sequencing using the Ion Torrent PGM, we applied it to 20 carefully selected DNA samples that belong to different global mtDNA haplogroups (Table 1). For these 20 samples, the average sequencing coverage ranged from 1,302 to 5,637 reads with an average of 3,293 reads across all samples. The total number of reads per sample ranged from 211,908 to 641,486, with an average across samples of 366,327, of which 88% were successfully aligned to the rCRS (325,349 reads). Using a coverage threshold of 50 reads, 100% mt genome coverage was achieved in all the samples except three (samples 14, 17, and 20), which each miss one small piece of mtDNA sequence with our MPS approach (Table 1). In sample 17, the 9‐bp deletion at np 8,281–8,289 (which is the defining mutation of haplogroup B, among others) was not detected by the NextGENe® software in the final variant table, elucidating the current limitation in the software package used. Further improvements in the bioinformatics pipelines are needed to deal with such data. A discreet manual inspection was done to confirm the deletion. In samples 14 and 20, fragments of 66 bp (np 10,468–10,533) and 61 bp (np 10,269–10,329), respectively, were missed when using the 50‐reads threshold applied. Figure 1 illustrates the average coverage of each of the 161 amplicons across all the 20 samples. The mtDNA variants detected for all the 20 samples analyzed are detailed in Supp. Table S2.

Table 1

Performance Summary of the 20 Geographically Diverse DNA Samples for Whole mt Genome Sequencing with the MPS Tiling Approach via 161 Short Overlapping Amplicons

Sample ID	Total reads	Aligned reads	Percent of aligned reads	Maximum coverage	Average coverage	Percent of coverage
1	641,486	535,003	83.40	67,847	5,593	100%
2	502,725	427,031	84.94	54,668	4,852	100%
3	268,055	224,250	83.66	68,677	4,494	100%
4	408,522	343,567	84.10	44,394	4,285	100%
5	355,691	305,619	85.92	50,539	3,804	100%
6	493,881	485,970	98	32,082	3,143	100%
7	289,696	258,720	89.31	43,687	2,607	100%
8	264,828	232,310	87.72	23,412	1,488	100%
9	264,231	204,023	77.21	47,124	3,777	100%
10	352,588	279,159	79.17	53,959	3,496	100%
11	476,939	426,050	89.33	66,262	5,637	100%
12	268,968	235,011	87.38	37,458	2,560	100%
13	249,338	198,790	79.73	46,631	1,840	100%
14	433,566	389,219	89.77	52,840	3,358	99.59%
15	541,295	532,291	98.34	44,460	3,488	100%
16	211,908	194,369	91.72	25,927	1,302	100%
17	307,441	278,527	90.60	30,230	1,865	100%a
18	324,954	299,098	92.04	48,897	2,006	100%
19	420,730	416,278	98.94	38,839	3,644	100%
20	249,701	241,699	96.80	37,051	2,628	99.62%
Average	366,327	325,349	88	45,749	3,293

The 9‐bp deletion at np 8,281–8,289 in sample 17 is the defining mutation of haplogroup B. Hence, the coverage for sample 17 was considered to be 100% despite the 9‐bp deletion.

Figure 1

The average amplicon coverage, across all the 20 samples tested with the newly developed MPS tiling approach for whole mt genome sequencing, are presented in three plots. A: Amplicons 1–53. B: Amplicons 54–106. C: Amplicons 107–161.

Performance Summary of the 20 Geographically Diverse DNA Samples for Whole mt Genome Sequencing with the MPS Tiling Approach via 161 Short Overlapping Amplicons The 9‐bp deletion at np 8,281–8,289 in sample 17 is the defining mutation of haplogroup B. Hence, the coverage for sample 17 was considered to be 100% despite the 9‐bp deletion. The average amplicon coverage, across all the 20 samples tested with the newly developed MPS tiling approach for whole mt genome sequencing, are presented in three plots. A: Amplicons 1–53. B: Amplicons 54–106. C: Amplicons 107–161. Further, the whole mt genome data we generated via our MPS approach from the samples belonging to the CEPH–HGDP panel (samples 1–11) were compared with those previously reported by Hartmann et al. (2009) based on whole mt genome sequencing using the Sanger approach. The remaining nine samples (12–20) we analyzed via MPS were subjected to de novo Sanger sequencing of the control region (unpublished study, data not shown). Some of the positions in the coding region were further compared with the results previously obtained from the five SNaPshot multiplex assays established earlier (data not shown) [Chaitanya et al., 2014]. The data obtained via our PGM method were then compared with the data generated by alternative methods for quality control purposes. The 9‐bp deletion at np 8,281–8,289 in sample 17 was not tabulated in the variant table generated by NextGENe®. However, this deletion was evident from results obtained with the SNaPshot multiplex assay and also apparent in the NextGENe® viewer. Several mis‐calls by NextGENe® were revised after a meticulous manual inspection of the sequence in the NextGENe® viewer. Disparities observed between the Sanger sequencing/SNaPshot and MPS are tabulated in Supp. Table S3. Some of the discrepancies were observed in calling of the variants involving length heteroplasmy in the polycytosine stretches of the HVS‐I and HVS‐II. In samples 1, 6, 11, and 17, due to an erroneous shift in the bases, the variant was reported as m.16183A>M (an apparent heteroplasmy), instead of m.16183A>C (a homoplasmic transversion) (Supp. Fig. S1A). In the HVS‐II region, for samples 8, 13, 16, 19, and 20, only one insertion of the base C in the np 303–309 region was reported with Sanger sequencing. However, with MPS, two insertions of the base C were noted (Supp. Fig. S1B). Such differences will in practice not have any influence on haplogroup determination since the insertions at np 309, 315, 16,193, AC indels at 515–524, m.16182A>C, m.16183A>C, and m.16519T>C are usually not considered for phylogenetic reconstruction and are therefore not included in the PhyloTree [van Oven and Kayser, 2009]. The variant reported as 357M in the samples 2, 7, 14, and 15 is specious and appears due to a shift in the base position, possibly because of the polyadenine stretches as shown in Supp. Figure S2. Similarly, in samples 8, 12, and 17, the variant m.13128C>M is spurious due to the shift in the base because of the polycytosine stretches as shown in Supp. Figure S3. Furthermore, spurious gaps were observed in the samples 8 and 12 at position 13,128 (Supp. Fig. S3 boxed region). Samples 9, 10, and 14 report an insertion 539.1C (Supp. Fig. S4), which was not evident in the Sanger sequencing data. It has been described earlier that the PGM produces a high frequency of homopolymer sequencing errors and indels [Loman et al., 2012; Seo et al., 2013]. In samples 2 and 4, a spurious deletion was noticed at position 5,824 that was not reported in the Sanger sequencing data of Hartmann et al. (2009) (Supp. Fig. S5). Further, differences to the Sanger sequencing were observed in sample 9 at position 16,209 and in sample 10 at position 204 (Supp. Table S3; Supp. Fig. S6). Spurious single‐bp deletions were observed at some sequence positions in some reads in few of the samples (Supp. Fig. S6 boxed region) but not reported in the variant table. Such gaps did not influence the final consensus sequence and therefore did not affect the final haplogroup determination. One potential disadvantage of using a tiling approach based on small amplicons over a large fragment amplification approach is the potential detection of nuclear copies of mt sequences (NUMTs) with the tiling approach that likely are not detected with the long‐range approach simply because NUMTs typically are much smaller. To test for this, we generated comparative whole mt genome sequencing data using the previously described two overlapping 8.5‐kb amplification method [Fendt et al., 2009] in five of the samples (samples 1, 2, 3, 4, and 20) we used for PGM sequencing. It is noteworthy that the variants detected were in concordance with those detected when sequenced with the short overlapping fragments, except in sample 2. In sample 2, the polymorphisms m.10664C>Y, m.8251G>R, and m.8252C>M (Supp. Fig. S7; Table 2) were discordant with the tiling approach (m.10664C>T, m.8251G>A, and no variant at position 8,252, respectively). The erroneous calling at positions 8,251 and 8,252 could be due to the base position shifts as seen in Supp. Figure S7. The disparities observed in the HVS‐I and HVS‐II due to the length heteroplasmy were also noticed with the long‐range amplification. Hence, from the samples analyzed, we have no evidence that our tiling MPS approach picks up NUMTs.

Table 2

Differences Between the Previously Developed Long‐Range Amplification MPS Approach and the Newly Introduced MPS Tiling Approach for Whole mt Genome Analysis Both Obtained via the PGM in Sample 2

		NGS data from NextGENe® software
		%A	%C	%G	%T	%Insertions	%Deletions
Long range	m.10664C>Y	0	17.46	0	78.31	0	4.23
Tiling	m.10664C>T	0	0.34	0	99.35	0	0.3
Long range	m.8251G>R	71.43	0	28.57	0	0	0
	m.8252C>M	29.37	70.63	0	0	2.1	0
Tiling	m.8251G>A	97.27	0	2.1	0	0	0.63

Human mitochondrial genome, rCRS (GenBank NC_012920.1).

Differences Between the Previously Developed Long‐Range Amplification MPS Approach and the Newly Introduced MPS Tiling Approach for Whole mt Genome Analysis Both Obtained via the PGM in Sample 2 Human mitochondrial genome, rCRS (GenBank NC_012920.1). It has been described earlier [Parson et al., 2013; Seo et al., 2015] that false deletions were observed in the mtDNA sequencing data generated using the PGM. However, no such deletions were observed when the variants reported with our tool and the long‐range PCR approach was compared. Except for the discordant calls in sample 2, no other differences in variant calling were observed. Nonetheless, this may change when more samples are tested, as in this study only a few samples have been tested. Point heteroplasmies at 22 positions in 14 samples were reported in the NextGENe® variant table: sample 1: m.8155G>R; sample 2: m.1048C>Y, m.357A>M; sample 3: m.14423G>S; sample 4: m.4788G>R, m.14921G>R; sample 5: m.1171A>R, m.13020T>Y, m.15326A>R, m.16189T>Y; sample 6: m.7759T>Y; sample 7: m.771A>R, m.4248T>Y, m.357A>M; sample 8: m.3296T>Y, m.13128C>M; sample 9: m.10185C>Y, m.15326A>R, m.16209T>Y; sample 10: m.204T>Y, m.13928G>S, m.15326A>R, m.14831G>R; sample 12: m.13128C>M; sample 14: m.357A>M; sample 15: m.357A>M; and sample 17: m.13128C>M. Heteroplasmy threshold was set at 20% (of total coverage) [Parson et al., 2013]. Out of the 22, only seven of the point heteroplasmies (sample 1: m.8155G>R; sample 3: m.14423G>S; sample 4: m.4788G>R, m.14921G>R; sample 5: m.1171A>R, m.16189T>Y; sample 6: m.7759T>Y) were also observed in the Sanger sequencing data of Hartmann et al. (2009). A heteroplasmy at position 3,296 was not reported by Hartmann et al. (2009), contrary to the m.3296T>Y from this study. However, in a recent publication, wherein some samples from the CEPH‐HGDP panel were resequenced on the Illumina platform, a variant at 3,296 was recorded for the sample 8 as m.3296T>N [Lippold et al., 2014], suggesting that this heteroplasmy could be genuine. In general, the availability of a viewer in the NextGENe® software proved advantageous in manually evaluating the discrepancies and solving them accordingly, especially in assessing the true heteroplasmy.

mtDNA Haplogroup Assignment and Maternal Ancestry Inference

The haplogroups for the 20 samples were inferred from the obtained whole mtDNA genome data using MitoTool [Fan and Yao, 2011] and are presented in Table 3 together with the geographic sampling origin and the previously reported geographic region of haplogroup origin indicative of maternal biogeographic ancestry. All the haplogroups determined for the 20 samples were in agreement with their biogeographic origin. It is important to assert that mtDNA construes only the matrilineal ancestry information of an individual. To achieve comprehensive biogeographic ancestry inference from DNA, in addition to the global matrilineal biogeographic ancestry assignment, information about paternal ancestry using male‐specific Y‐chromosomal DNA (in the case of males) and from biparental ancestry using ancestry‐informative autosomal DNA markers have to be considered [Kayser and de Knijff, 2011].

Table 3

Haplogroups of the 20 DNA Samples Whole mt Genome Sequenced with the MPS Tiling Tool and Interpreted Using MitoTool

Sample ID	Haplogroup	Broad haplogroup	Known sampling region	Main geographic region of the (broad) haplogroup origin	References
1	L3d3a1a	L3*(xM,N)	South Africa	Africa, West Asia	Behar et al. (2008)
2	L0d1a1a (199 missing)	L0	South Africa	Southern Africa	Behar et al. (2008); Barbieri et al. (2013)
3	H1c	H	Russia (Caucasus)	West Eurasia, Northern Africa	Loogväli et al. (2004); Achilli et al. (2004); Roostalu et al. (2007)
4	U7a2	U7	Israel	West Eurasia, Central Asia, Southern Asia	Palanichamy et al. (2004); Brisighelli et al. (2009)
5	P1d1	P	New Guinea	Oceania (Papuan, Melanesian, and Australian Aborigines)	Friedlaender et al. (2007); Hudjashov et al. (2007)
6	V	HV*(xH)	Algeria	West Eurasia, Northern Africa	Achilli et al. (2005); Álvarez‐Iglesias et al. (2009)
7	A1a1 (missing 235)	A*(xA2)	China	East Asia	Kong et al. (2006); Derenko et al. (2007)
8	J2b1a	J	Italy	West Eurasia	Pala et al. (2012)
9	M7c1a2a	M*(xM1,C,D)	China	South Asia, East Asia, Southeast Asia	Kong et al., 2006; Derenko et al. (2007)
10	F4a1a	R9	China	East Asia, Southeast Asia	Kong et al. (2006)
11	X2b5	X	Orkney Islands	West Eurasia, Northern Africa, Americas	Reidla et al. (2003); Achilli et al. (2008)
12	J2b1(J2b1a1: missing16278)	J	Italy	West Eurasia	Pala et al. (2012)
13	P	P	Australia (Aborigine)	Oceania (Papuan, Melanesian, and Australian Aborigines)	Friedlaender et al. (2007); Hudjashov et al. (2007)
14	M72a	M*(xM1,C,D)	Thailand	South Asia, East Asia, Southeast Asia	Tabbada et al. (2010); Peng et al. (2010)
15	T2b	T	Italy	West Eurasia	Pala et al. (2012)
16	K1a12	U	The Netherlands	West Eurasia	Achilli et al. (2005); Behar et al. (2006)
17	B5a1a	B5	The Netherlands	East Asia	Kong et al. (2006)
18	W5a1a	W	The Netherlands	West Eurasia	Finnilä et al. (2001); Palanichamy et al. (2004)
19	V3c	HV*(xH)	The Netherlands	West Eurasia, Northern Africa	Achilli et al. (2005); Álvarez‐Iglesias et al. (2009)
20	H23	H	The Netherlands	West Eurasia, Northern Africa	Loogväli et al. (2004); Achilli et al. (2004); Roostalu et al. (2007)

Haplogroups of the 20 DNA Samples Whole mt Genome Sequenced with the MPS Tiling Tool and Interpreted Using MitoTool

Preliminary Sensitivity Testing

In order to determine the limit of detection provided the 50 reads coverage threshold used and given the sensitivity of the developed PGM assay, preliminary sensitivity tests were performed with differing starting amounts of DNA at 100, 250, and 500pg, and 1 and 10 ng (measured as genomic DNA). For this, we used three of the 20 DNA samples originating from different populations with different haplogroups—South Africa with haplogroup L3d3a1a (sample 1); Italy with T2b (sample 15); and The Netherlands with V3c (sample 19). The results of the sensitivity study on the samples are summarized in Supp. Table S4. For all the sample dilutions, it was possible to detect the correct haplogroup down to merely 100 pg input genomic DNA, even though the input manufacturer's recommendation for library construction is 10 ng. Regarding whole mt genome coverage, we achieved 100% for all sample dilutions, except for sample 19 at 250 pg (99.89% at 576 reads on average), and for all three samples at 100 pg (99.93% at 2,591 average reads, 99.91% at 1,175 average reads, and 99.74% at 462 average reads for samples 1, 15, and 19, respectively). Notably, samples were further diluted to 50 pg input, but could not proceed to sequencing as the template dilution factors calculated were below one, whereas one is recommended by the manufacturer, thus implying that the amount of DNA library generated from such low quantity DNA samples was not sufficient to perform optimal emulsion PCR [Ralf et al., 2015]. Previous studies have shown that it is possible to achieve successful sequencing results with the Sanger protocol using 50 pg of input DNA, and sometimes even at 10 pg of input DNA with minimal failures [Lyons et al., 2013; Just et al., 2014]. However, to achieve full mtDNA coverage at a high resolution with the Sanger approach, several individual sequences are required, as opposed to the parallel nature of the MPS where a high volume of data is generated in relatively short time. Though it was possible to achieve correct haplogroup information at 100 pg, increasing the number of PCR cycles could result in good coverage with input below 100 pg of DNA. Hence, further testing needs to be done to show the full‐sensitivity limits of this system.

Preliminary Analysis of Experimentally and Naturally Degraded DNA Samples

The robustness of our MPS assay regarding DNA degradation was tested by sequencing experimentally and naturally degraded DNA samples. Aliquots of a sample (1 ng genomic DNA), belonging to haplogroup M35a1, was subjected to DNase treatment at different time intervals: 5, 10, 15, 20, 30, and 45 min. The effect of DNA degradation in these samples was first monitored by analyzing them with the AmpFlSTR® Identifiler® PCR Amplification Kit (Applied Biosystems) targeting 15 autosomal STRs plus the amelogenin sex typing system, which is routinely used for human identification purposes (Fig. 2). From Figure 2, it is evident that the DNA degradation‐induced STR locus dropouts are clearly correlated with PCR fragment size, as expected. Particularly, at 5 min of DNase treatment, there were complete locus dropouts at three STRs CSF1PO (allelic fragment length 330 bp), FGA (280 bp), and D2S1338 (310 bp); at 10 min, additional dropout at D7S820 (271 bp), D16S539 (280 bp), and D18S51 (270 bp); at 15 min, additional dropouts at TPOX (238 bp) and D13S317 (222 bp); at 20 min, additional dropout at D21S11 (206 bp), TH01 (172 bp), and vWA (178 bp); at 30 min, additional dropout at D5S818 (154 bp), D8S1179 (146 bp), and D19S433 (128 bp). After 45 min of DNase treatment, all of the 16 loci covering allelic fragment sizes of about 128–330 bp dropped out, except the two loci with shortest alleles, amelogenin (108 bp) and D3S1358 (116 bp). This analysis provides a rough idea on DNA fragmentation in these experimentally degraded samples as relevant for the subsequent mtDNA genome sequencing using our MPS tiling approach.

Figure 2

STR profiles from the AmpFlSTR® Identifiler® PCR Amplification Kit (Applied Biosystems) targeting 15 autosomal STRs plus amelogenin to illustrate the degree of DNA fragmentation for the sample treated with DNase at different time intervals: 5, 10, 15, 20, 30, and 45 min. Table 4 enlists the performance summary of our whole mt genome tiling MPS approach in the DNase‐treated samples at different time intervals. Most notable, for the samples degraded for 5, 10, and 15 min, we obtained 100% mt genome coverage using a 50x threshold, at 6,810, 3,886, and 3,007 average reads, respectively. Prior STR analysis showed that eight loci with allelic fragment length from 222 to 330 bp already dropped out in these three degraded samples. For samples degraded for 20, 30, and 45 min, the coverage dropped down to 94.4%, 82.6%, and 19.25%, respectively, at 2,073, 1,183, and 487 average reads, respectively. Regarding the total number of mtDNA variants detected in these samples, after 5 min of DNase treatment, it was 37 as well as after 10 and 15 min where we obtained 100% mt genome coverage, whereas it dropped down to 30, 24, and 13 in the samples that received DNase treatment for 20, 30, and 45 min, respectively. However, despite the loss of reads, fragments, and thus DNA variants, it was still possible to determine the correct mtDNA haplogroup (M35a1) in all degraded DNA samples from 5 to 45 min of DNase treatment. Figure 3 elucidates the average amplicon coverage across all amplicons for the different time intervals of enzymatic degradation clearly showing the loss of sequence reads with increased degradation time, as expected. Figure 4 illustrates the amplicon coverage according to amplicon length for the different time intervals of enzymatic degradation, clearly demonstrating the effect of DNA degradation on amplicon length and number of reads. It is to be noted that successful sequencing achieving 100% mt genome coverage was obtained from degraded DNA samples with a maximal fragment size of about 220 bp (Fig. 2).

Table 4

Performance Summary of PGM‐Based Whole mt Genome Sequencing of DNase‐Treated Sample at Different Time Intervals

		Total reads	Aligned reads	Percent of aligned reads	Maximum coverage	Average coverage	Percent of coverage	Haplogroup
DNase‐treated sample at different time intervals	5 min	463,854	452,505	97.55%	51,989	6,810	100%	M35a1
	10 min	242,109	234,156	96.72%	45,167	3,886	100%
	15 min	163,854	152,505	93.07%	38,249	3,007	100%
	20 min	60,988	59,085	96.88%	31,262	2,073	94.40%
	30 min	83,565	80,367	96.17%	10,731	1,183	82.60%
	45 min	15,642	13,606	86.98%	9,625	487	19.25%
Sample 15	UV for 30 min	258,179	255,277	98.88%	21,901	1,362	88.65%	T2b
	Enzymatic shearing	315,030	278,648	88.45%	5,827	975	89.90%
Ancient and degraded bone and teeth samples	Sample 1	542,889	525,140	96.73%	9,572	4,042.52	87.87%	U5b2b
	Sample 2	95,396	58,311	61.13%	2,379	225.57	50.36%	H4a1
	Sample 3	538,548	512,826	95.22%	7,273	3,321.23	90.12%	H
	Sample 4	529,929	480,856	90.74%	6,048	3,141.47	88.32%	T2b
	Sample 5	477,578	450,506	94.33%	8,218	2,924.11	59.59%	U4a2
	Sample 6	175,642	140,021	79.72%	3,535	612.48	75.25%	T1a

Sample 15 subjected to three different degradation methods and six different highly degraded bone and teeth samples.

Figure 3

The average mt DNA amplicon coverage across all amplicons for the DNase‐treated sample at the different time intervals as obtained with the MPS tiling approach.

Figure 4

A: The amplicon coverage (number of times amplicons were observed in the MPS data) of all the 161 amplicons used to obtain complete mt genome coverage with our MPS approach, arranged according to amplicon length from the shortest amplicon used (144 bp) on the left‐hand side to the largest amplicon used (230 bp) on the right‐hand side, for the different time intervals of enzymatic DNA degradation: 5, 10, 15, 20, 30, and 45 min. All the amplicons below X represents the coverage below the 50 reads threshold. B: Additional zoomed‐in image of the longer amplicons from amplicon lengths of 206 bp (left‐hand side) to 230 bp (right‐hand side).

Performance Summary of PGM‐Based Whole mt Genome Sequencing of DNase‐Treated Sample at Different Time Intervals Sample 15 subjected to three different degradation methods and six different highly degraded bone and teeth samples. The average mt DNA amplicon coverage across all amplicons for the DNase‐treated sample at the different time intervals as obtained with the MPS tiling approach. A: The amplicon coverage (number of times amplicons were observed in the MPS data) of all the 161 amplicons used to obtain complete mt genome coverage with our MPS approach, arranged according to amplicon length from the shortest amplicon used (144 bp) on the left‐hand side to the largest amplicon used (230 bp) on the right‐hand side, for the different time intervals of enzymatic DNA degradation: 5, 10, 15, 20, 30, and 45 min. All the amplicons below X represents the coverage below the 50 reads threshold. B: Additional zoomed‐in image of the longer amplicons from amplicon lengths of 206 bp (left‐hand side) to 230 bp (right‐hand side). To test two additional experimental degradation methods, aliquots of 1 ng genomic DNA of sample 15, belonging to haplogroup T2b, were exposed to UV radiation for 30 min using a Bio‐Link (Vilber Lourmat) at a strength of 50 J/cm2 and enzymatic shearing using the Ion Shear™ Plus Reagents Kit (Life Technologies, a part of Thermo Fisher Scientific Inc.). The performance summary of the samples is depicted in Table 4. At 1 ng unexposed DNA, the total number of mtDNA variants detected was 41 at 100% mt genome coverage (1,865 reads on average); when the sample was exposed to UV for 30 min, only 38 of the 41 variants were detected (missed variants were m.7310T>C, m.8697G>A, and m.10463T>C) with a mt genome coverage of 88.65% at 1,362 reads on average. At position 13,368, the variant was reported as m.13368G>R (49.36% A and 50.61% G), whereas the true variant was m.13368G>A. In contrast, all the 41 variants were correctly detected when the sample was subjected to enzymatic shearing, even though only 89.9% mt genome coverage was obtained at 975 reads on average. The variant m.357A>M, which was explained earlier as a sequencing error probably because of a base shift position due to the poly‐A stretch, was detected in both degradation approaches. However, both degradation methods allowed our MPS approach to determine the correct haplogroup of sample 15 (i.e., T2b). Additionally, DNA extracts from six human remains (i.e., teeth and bones) were sequenced with our MPS tiling approach to investigate naturally degraded DNA (Table 4). Notably, samples 1, 3, 4, and 5 delivered a complete 16 loci (15 STRs plus amelogenin) DNA profile with the AmpFℓSTR® NGM™ Kit (Thermo Fisher Scientific) regularly used for human identification, whereas samples 2 and 6 delivered only a partial NGM profile with 12 and 10 of the 16 loci missing, respectively. With our MPS approach, we obtained mt genome coverage of close to 90% for samples 1, 3, and 4, whereas for samples 6, 5, and 2, our method delivered 75%, 60%, and 50% of mt genome coverage (Table 4). Notably, sample 2 with the lowest observed mt genome coverage of 50% (225 reads on average) likely originates from the XV–XVI century. For samples 4, 5, and 6, mtDNA control region HVI and HVII data were obtained successfully via Sanger sequencing and compared with those of the tiling MPS approach. Sanger sequencing of these samples was performed as part of another unpublished study and hence the protocol was not described here. These data were comparable, except that the variant m.16519T>C, which was clearly evident in the tiling MPS approach, but was not reported in the Sanger sequencing data for all the three samples. However, Sanger sequencing mtDNA control region data were not available for samples 1, 2, and 3. Although mt genome coverage was only obtained to the degree of 50%–90% from these six naturally degraded DNA samples, our MPS tiling approach allowed haplogroup assignment for all of them: U5b2b, H4a1, H, T2b, U4a2, and T1a for samples 1–6, respectively. The AmpliSeq‐based MPS approach we developed here for complete human mt genome analysis is designed for and suited to nondegraded and mildly to considerably degraded DNA leading to DNA fragmentation down to a size range of around 200 bp and larger, as often confronted with in forensic, medical, and anthropological studies. Our approach is not suitable for more severely degraded DNA as often confronted with in ancient DNA studies. For such strongly fragmented DNA, hybridization capture methods are more suitable than AmpliSeq‐based approaches and have been developed in the field of ancient DNA research [Noonan et al., 2005; Anderung et al., 2008; Briggs et al., 2009; Templeton et al., 2013]. The AmpliSeq system we used in our approach is optimized for this kind of multiplexed amplification and the workflow requires fewer steps, which reduces the chance of contamination and mix‐up. Additionally, more targets are generated with the Ampliseq PCR system than with a hybridization capture‐based system, allowing for deeper coverage, mitigating sequencing errors.

Preliminary Testing of Sample Multiplexing via DNA Barcoding

In order to test the feasibility of sequencing multiple samples simultaneously in one sequencing run, barcode adapters were used. Six DNA samples were pooled (12 libraries, as two primer pools were needed) per each sequencing run and barcoded, using the Ion Xpress™ Barcode Adapters (Life Technologies, a part of Thermo Fisher Scientific Inc.). The ability to multiplex six samples via barcoding is a valuable asset with regard to the increase in throughput and cost‐effectiveness. Further research should involve sequencing more than six samples in a single run on one chip.

Conclusion

Our study shows that analyzing the complete human mt genome in a simultaneous way via a tiling approach targeting short amplicons is feasible. With the short overlapping fragments we employed in covering the entire mt genome, and supported by our preliminary data on experimentally and naturally degraded DNA samples, we expect our approach to be particularly useful for analyzing degraded materials. As this was a preliminary study introducing the new method, future studies with a larger sample set including more degraded samples need to be conducted to explore the complete value of this tool. However, based on the preliminary data presented here, we already expect our method to be highly useful in many mtDNA applications using degraded and nondegraded DNA where maternal lineage identification and maternal ancestry inference on the maximum possible resolution level is appreciated such as in forensic, anthropological, and medical genetics. In the longer run, we envision this MPS tool for complete mt genome analysis allowing maximal maternal lineage determination and maternal ancestry inference being combined with MPS tools for ultra‐high‐resolution paternal lineage and paternal ancestry identification, such as the PGM tool we recently introduced for simultaneous analysis of >530 Y‐chromosomal SNPs allowing to detect >430 worldwide Y haplogroups [Ralf et al., 2015]. Since with the current whole mt genome tool using 161 short overlapping amplicons (144–230 bp, average 200 bp), the sequence capacity limits of the PGM (or alternative devices such as MiSeq) are not reached, further enlargements of combined targeted MPS tools may be expected from future work. For instance, we envision the additional addition of autosomal ancestry‐informative SNPs into combined tools together with Y and mtDNA allowing to not only detect lineages (as possible with mtDNA and Y analysis) but moreover to resolve individual genetic admixture, and to obtain an overall and more extensive estimate of an individual's biogeographic ancestry from all three ancestry components: maternal, paternal, and biparental. For some applications, such as to answer forensic and anthropological questions, the additional incorporation of phenotypic markers into comprehensive ancestry tool(s) such as SNPs predictive for human pigmentation traits or for other externally visible characteristics if available [Kayser, 2015] would further add to the investigative value available with such comprehensive targeted MPS tools. Disclosure statement: The authors declare no conflict of interest. Supplementary Material Click here for additional data file.

72 in total

1. The molecular dissection of mtDNA haplogroup H confirms that the Franco-Cantabrian glacial refuge was a major source for the European gene pool.

Authors: Alessandro Achilli; Chiara Rengo; Chiara Magri; Vincenza Battaglia; Anna Olivieri; Rosaria Scozzari; Fulvio Cruciani; Massimo Zeviani; Egill Briem; Valerio Carelli; Pedro Moral; Jean-Michel Dugoujon; Urmas Roostalu; Eva-Liis Loogväli; Toomas Kivisild; Hans-Jürgen Bandelt; Martin Richards; Richard Villems; A Silvana Santachiara-Benerecetti; Ornella Semino; Antonio Torroni
Journal: Am J Hum Genet Date: 2004-09-20 Impact factor: 11.025

2. Performance comparison of benchtop high-throughput sequencing platforms.

Authors: Nicholas J Loman; Raju V Misra; Timothy J Dallman; Chrystala Constantinidou; Saheer E Gharbia; John Wain; Mark J Pallen
Journal: Nat Biotechnol Date: 2012-05 Impact factor: 54.908

3. Revealing the prehistoric settlement of Australia by Y chromosome and mtDNA analysis.

Authors: Georgi Hudjashov; Toomas Kivisild; Peter A Underhill; Phillip Endicott; Juan J Sanchez; Alice A Lin; Peidong Shen; Peter Oefner; Colin Renfrew; Richard Villems; Peter Forster
Journal: Proc Natl Acad Sci U S A Date: 2007-05-11 Impact factor: 11.205

4. Fishing for ancient DNA.

Authors: Cecilia Anderung; Per Persson; Abigail Bouwman; Rengert Elburg; Anders Götherström
Journal: Forensic Sci Int Genet Date: 2007-11-19 Impact factor: 4.882

Review 5. Ancient mitogenomics.

Authors: Simon Y W Ho; M Thomas P Gilbert
Journal: Mitochondrion Date: 2009-09-27 Impact factor: 4.160

6. Simultaneous analysis of hundreds of Y-chromosomal SNPs for high-resolution paternal lineage classification using targeted semiconductor sequencing.

Authors: Arwin Ralf; Mannis van Oven; Kaiyin Zhong; Manfred Kayser
Journal: Hum Mutat Date: 2014-11-27 Impact factor: 4.878

7. Sequence and organization of the human mitochondrial genome.

Authors: S Anderson; A T Bankier; B G Barrell; M H de Bruijn; A R Coulson; J Drouin; I C Eperon; D P Nierlich; B A Roe; F Sanger; P H Schreier; A J Smith; R Staden; I G Young
Journal: Nature Date: 1981-04-09 Impact factor: 49.962

8. Recommendations for consistent treatment of length variants in the human mitochondrial DNA control region.

Authors: Mark R Wilson; Marc W Allard; Keith Monson; Kevin W P Miller; Bruce Budowle
Journal: Forensic Sci Int Date: 2002-09-10 Impact factor: 2.395

9. Intraspecific phylogenetic analysis of Siberian woolly mammoths using complete mitochondrial genomes.

Authors: M Thomas P Gilbert; Daniela I Drautz; Arthur M Lesk; Simon Y W Ho; Ji Qi; Aakrosh Ratan; Chih-Hao Hsu; Andrei Sher; Love Dalén; Anders Götherström; Lynn P Tomsho; Snjezana Rendulic; Michael Packard; Paula F Campos; Tatyana V Kuznetsova; Fyodor Shidlovskiy; Alexei Tikhonov; Eske Willerslev; Paola Iacumin; Bernard Buigues; Per G P Ericson; Mietje Germonpré; Pavel Kosintsev; Vladimir Nikolaev; Malgosia Nowak-Kemp; James R Knight; Gerard P Irzyk; Clotilde S Perbost; Karin M Fredrikson; Timothy T Harkins; Sharon Sheridan; Webb Miller; Stephan C Schuster
Journal: Proc Natl Acad Sci U S A Date: 2008-06-09 Impact factor: 11.205

10. Clinal distribution of human genomic diversity across the Netherlands despite archaeological evidence for genetic discontinuities in Dutch population history.

Authors: Oscar Lao; Eveline Altena; Christian Becker; Silke Brauer; Thirsa Kraaijenbrink; Mannis van Oven; Peter Nürnberg; Peter de Knijff; Manfred Kayser
Journal: Investig Genet Date: 2013-05-20

10 in total

1. MPS analysis of the mtDNA hypervariable regions on the MiSeq with improved enrichment.

Authors: Mitchell M Holland; Laura A Wilson; Sarah Copeland; Gloria Dimick; Charity A Holland; Robert Bever; Jennifer A McElhoe
Journal: Int J Legal Med Date: 2017-01-11 Impact factor: 2.686

2. Massively parallel sequencing-enabled mixture analysis of mitochondrial DNA samples.

Authors: Jennifer D Churchill; Monika Stoljarova; Jonathan L King; Bruce Budowle
Journal: Int J Legal Med Date: 2018-02-22 Impact factor: 2.686

3. Applications of Probe Capture Enrichment Next Generation Sequencing for Whole Mitochondrial Genome and 426 Nuclear SNPs for Forensically Challenging Samples.

Authors: Shelly Y Shih; Nikhil Bose; Anna Beatriz R Gonçalves; Henry A Erlich; Cassandra D Calloway
Journal: Genes (Basel) Date: 2018-01-22 Impact factor: 4.096

4. Optimized mtDNA Control Region Primer Extension Capture Analysis for Forensically Relevant Samples and Highly Compromised mtDNA of Different Age and Origin.

Authors: Mayra Eduardoff; Catarina Xavier; Christina Strobl; Andrea Casas-Vargas; Walther Parson
Journal: Genes (Basel) Date: 2017-09-21 Impact factor: 4.096

5. MitoRS, a method for high throughput, sensitive, and accurate detection of mitochondrial DNA heteroplasmy.

Authors: Julien Marquis; Gregory Lefebvre; Yiannis A I Kourmpetis; Mohamed Kassam; Frédéric Ronga; Umberto De Marchi; Andreas Wiederkehr; Patrick Descombes
Journal: BMC Genomics Date: 2017-04-26 Impact factor: 3.969

6. Mitochondrial DNA in human identification: a review.

Authors: António Amorim; Teresa Fernandes; Nuno Taveira
Journal: PeerJ Date: 2019-08-13 Impact factor: 2.984

7. Contribution of sarcomere gene mutations to left atrial function in patients with hypertrophic cardiomyopathy.

Authors: Hyemoon Chung; Yoonjung Kim; Chul Hwan Park; In-Soo Kim; Jong-Youn Kim; Pil-Ki Min; Young Won Yoon; Tae Hoon Kim; Byoung Kwon Lee; Bum-Kee Hong; Se-Joong Rim; Hyuck Moon Kwon; Kyung-A Lee; Eui-Young Choi
Journal: Cardiovasc Ultrasound Date: 2021-01-06 Impact factor: 2.062

8. From Forensics to Clinical Research: Expanding the Variant Calling Pipeline for the Precision ID mtDNA Whole Genome Panel.

Authors: Filipe Cortes-Figueiredo; Filipa S Carvalho; Ana Catarina Fonseca; Friedemann Paul; José M Ferro; Sebastian Schönherr; Hansi Weissensteiner; Vanessa A Morais
Journal: Int J Mol Sci Date: 2021-11-06 Impact factor: 5.923

9. Mitochondrial Sequencing of Missing Persons DNA Casework by Implementing Thermo Fisher's Precision ID mtDNA Whole Genome Assay.

Authors: Daniela Cuenca; Jessica Battaglia; Michelle Halsing; Sandra Sheehan
Journal: Genes (Basel) Date: 2020-11-04 Impact factor: 4.096

10. Developmental Validation of a MPS Workflow with a PCR-Based Short Amplicon Whole Mitochondrial Genome Panel.

Authors: Jennifer Churchill Cihlar; Christina Amory; Robert Lagacé; Chantal Roth; Walther Parson; Bruce Budowle
Journal: Genes (Basel) Date: 2020-11-13 Impact factor: 4.096

10 in total