| Literature DB >> 23948325 |
Walther Parson1, Christina Strobl, Gabriela Huber, Bettina Zimmermann, Sibylle M Gomes, Luis Souto, Liane Fendt, Rhena Delport, Reina Langit, Sharon Wootton, Robert Lagacé, Jodi Irwin.
Abstract
Insights into the human mitochondrial phylogeny have been primarily achieved by sequencing full mitochondrial genomes (mtGenomes). In forensic genetics (partial) mtGenome information can be used to assign haplotypes to their phylogenetic backgrounds, which may, in turn, have characteristic geographic distributions that would offer useful information in a forensic case. In addition and perhaps even more relevant in the forensic context, haplogroup-specific patterns of mutations form the basis for quality control of mtDNA sequences. The current method for establishing (partial) mtDNA haplotypes is Sanger-type sequencing (STS), which is laborious, time-consuming, and expensive. With the emergence of Next Generation Sequencing (NGS) technologies, the body of available mtDNA data can potentially be extended much more quickly and cost-efficiently. Customized chemistries, laboratory workflows and data analysis packages could support the community and increase the utility of mtDNA analysis in forensics. We have evaluated the performance of mtGenome sequencing using the Personal Genome Machine (PGM) and compared the resulting haplotypes directly with conventional Sanger-type sequencing. A total of 64mtGenomes (>1 million bases) were established that yielded high concordance with the corresponding STS haplotypes (<0.02% differences). About two-thirds of the differences were observed in or around homopolymeric sequence stretches. In addition, the sequence alignment algorithm employed to align NGS reads played a significant role in the analysis of the data and the resulting mtDNA haplotypes. Further development of alignment software would be desirable to facilitate the application of NGS in mtDNA forensic genetics.Entities:
Keywords: Forensic science; Heteroplasmy; Next Generation Sequencing; PGM; Sanger-type sequencing; mtDNA genomes
Mesh:
Substances:
Year: 2013 PMID: 23948325 PMCID: PMC3757157 DOI: 10.1016/j.fsigen.2013.06.003
Source DB: PubMed Journal: Forensic Sci Int Genet ISSN: 1872-4973 Impact factor: 4.882
Summary of observed differences in sequence outputs of STS and PGM mtGenome typing.
| Location | 100 bp chemistry | 200 bp chemistry | 200 bp chemistry_NextGENe | ||||||
|---|---|---|---|---|---|---|---|---|---|
| # of differences | False positives | False negatives | # of differences | False positives | False negatives | # of differences | False positives | False negatives | |
| HVS-1 (16183–16189) | 2 | 2 | 1 | 1 | 12 | 6 | 6 | ||
| HVS-1 (16190–16194) | 5 | 3 | 2 | 3 | 3 | 2 | 2 | ||
| HVS-2 (302–310) | 16 | 16 | 14 | 14 | 1 | 1 | |||
| HVS-2 (311–316) | 31 | 31 | 33 | 33 | 1 | 1 | |||
| HVS-3 (567–574) | 1 | 1 | 2 | 2 | 1 | 1 | |||
| AC-stretch | 0 | 0 | 2 | 2 | |||||
| Indels | 11 | 9 | 2 | 13 | 11 | 2 | 40 | 40 | |
| Substitutions | 29 | 26 | 3 | 15 | 1 | 14 | 6 | 2 | 4 |
| Pointheteroplasmy | 0 | 0 | 1 | 1 | |||||
| Total | 95 | 81 | 66 | ||||||
Summary of false positive and false negative substitutions between sequence outputs of STS and PGM mtGenome typing.
| Substitutions | ||
|---|---|---|
| False negatives | 100 bp chemistry | 200 bp chemistry |
| 10664T | WGS02 | WGS02 |
| 10664T | WGS04 | |
| 10664T | WGS05 | |
| 13651G | WGS34 | |
| 14374C | WGS01 | |
| 16166C | WGS01 | |
| 16172C | WGS01 | |
| 295T | WGS18 | |
| 456T | WGS15 | |
| 456T | WGS17 | |
| 456T | WGS23 | |
| 456T | WGS27 | |
| 493G | WGS34 | |
| 5442C | WGS03 | |
| 8251A | WGS02 | |
| 961C | WGS03 | |
| 17 | ||
Summary of point heteroplasmy reported with STS and PGM.
| Sample | STS | PGM – variant caller | |
|---|---|---|---|
| 100 bp chemistry | 200 bp chemistry | ||
| WGS01 | 6367Y | 6367C | 6367C |
| Var. Freq. = 25, cv = 260 | Var. Freq. = 22, cv = 245 | ||
| WGS03 | 966M | Not found | 966C |
| Var. Freq. = 66, cv = 161 | |||
| WGS05 | 204Y | Not found | nd |
| WGS09 | 8473Y | 8473C | nd |
| Var. Freq. = 41, cv = 79 | |||
| WGS11 | 16245Y | Not found | nd |
| WGS14 | 15623R | 15623A | 15623A |
| Var. Freq. = 19, cv = 643 | Var. Freq. = 26, cv = 689 | ||
| 16391R | 16391A | 16391A | |
| Var. Freq. = 48, cv = 519 | Var. Freq. = 39, cv = 593 | ||
| WGS15 | 9966R | 9966A | 9966A |
| Var. Freq. = 72, cv = 563 | Var. Freq. = 68, cv = 1248 | ||
| WGS18 | 152Y | 152C | 152C |
| Var. Freq. = 72, cv = 438 | Var. Freq. = 75, cv = 622 | ||
| 1578R | 1578R | 1578R | |
| Var. Freq. = 16, cv = 575 | Var. Freq. = 16, cv = 897 | ||
| WGS20 | 16201Y | 16201Y | 16201Y |
| Var. Freq. = 10, cv = 145 | Var. Freq. = 18, cv = 272 | ||
| WGS29 | 195Y | nd | 195C |
| Var. Freq. = 67, cv = 3743 | |||
| WGS36 | 8252Y | 8252T | 8252T |
| Var. Freq. = 31, cv = 216 | Var. Freq. = 44, cv = 371 | ||
| WGS42 | 234R | nd | 234G |
| Var. Freq. = 47, cv = 2762 | |||
nd: not determined.