| Literature DB >> 31730669 |
N R Harvey1,2, C L Albury1, S Stuart1, M C Benton1, D A Eccles1, J R Connell1, H G Sutherland1, R J N Allcock3, R A Lea1, L M Haupt1, L R Griffiths1.
Abstract
The implementation and popularity of next generation sequencing (NGS) has led to the development of various rapid whole mitochondrial genome sequencing techniques. We summarise an efficient and cost-effective NGS approach for mitochondrial genomic DNA in humans using the Ion Torrent platform, and further discuss our bioinformatics pipeline for streamlined variant calling. Ion 316 chips were utilised with the Ion Torrent semi-conductor platform Personal Genome Machine (PGM) to perform tandem sequencing of mitochondrial genomes from the core pedigree (n = 315) of the Norfolk Island Health Study. Key improvements from commercial methods focus on the initial PCR step, which currently requires extensive optimisation to ensure the accurate and reproducible elongation of each section of the complete mitochondrial genome. Dual-platform barcodes were incorporated into our protocol thereby extending its potential application onto Illumina-based systems. Our bioinformatics pipeline consists of a modified version of GATK best practices tailored for mitochondrial genomic data. When compared with current commercial methods, our method, termed high throughput mitochondrial genome sequencing (HTMGS), allows high multiplexing of samples and the use of alternate library preparation reagents at a lower cost per sample (~1.7 times) when compared to current commercial methodologies. Our HTMGS methodology also provides robust mitochondrial sequencing data (>450X average coverage) that can be applied and modified to suit various study designs. On average, we were able to identify ~30 variants per sample with 572 variants observed across 315 samples. We have developed a high throughput sequencing and analysis method targeting complete mitochondrial genomes; with the potential to be platform agnostic with analysis options that adhere to current best practices.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31730669 PMCID: PMC6857855 DOI: 10.1371/journal.pone.0224847
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Quality metric comparison between suggested commercial sequencing quality and results from both HTMGS sequencing chips.
| Chip number | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
|---|---|---|---|---|---|---|---|
| 65 | 82 | 77 | 80 | 80 | 61 | 78 | |
| 100 | 100 | 100 | 100 | 100 | 100 | 100 | |
| 18 | 45 | 45 | 42 | 44 | 19 | 47 | |
| 3 | 23 | 22 | 11 | 15 | 4 | 15 | |
| 1 | 0 | 0 | 1 | 0 | 5 | 0 | |
| 3,182,960 | 2,190,789 | 2,040,115 | 2,550,966 | 2,415,427 | 2,839,445 | 2,206,387 | |
| 99.7 | 97 | 97 | 96 | 97 | 98 | 96 | |
| 126 | 159 | 185 | 182 | 180 | 125 | 147 | |
| 374Mb | 303Mb | 292Mb | 371Mb | 355Mb | 331Mb | 253Mb |
*This value is dependent on the overall sequencing quality for the rest of the measurements
Fig 1Summary of quality control metrics for the reads generated by the HTMGS protocol by MultiQC.
A) Phred quality scores remained consistent per base in read when considering size selection methods; B) sequence GC content followed a normal distribution. The green line represents ‘excellent’ quality reads whereas orange lines represent ‘medium’ read quality scores; C) read length distribution for all samples was between 150–250bp due to the size selection method. The orange lines represent ‘medium’ read quality scores.
Fig 2Median coverage density plot for all 315 HTMGS runs showing consistent median coverage between all sequencing chips.
Fig 3Evidence of correct variant calling in the HTMGS method via examination of mtDNA variant distribution and concordance.
The solar plot summarises the variants detected via the HTMGS protocol. Each dot represents a detected variant. The inner ring of the plot represents the mitochondrial genome and is coloured based on genomic region as summarised in the plot legend. The X-axis represents the mitochondrial base pair location with the Y-axis representing the variant frequency in the NI population from 0 (not present in the NI sample) to 1 (total concordance of the variant in the NI sample). The location of the data points in relation to the inner ring represents the allele count with those on the centre ring correlating to 1/315 in the NI pedigree whereas variants toward the outer ring varying and those touching the outer ring representing total concordance of the variant in the NI sample (315/315).
Commercial kit verses HTMGS method cost comparisons for each major step used to complete Ion Torrent PGM whole mitochondrial sequencing.
| Long range PCR | GoTaq long PCR Master Mix | $2.45 |
| Cleaning and pooling | Qiaquick spin columns | $1.94 |
| DNA 12000 Bioanalyser kit | $3.62 | |
| 50bp ladder | $0.16 | |
| Size selection | Egel (2% size select) | $3.85 |
| Library Preparation | Next fast library prep set for ion torrent | $16.80 |
| Bioo Scientific barcodes (96) | $3.34 | |
| AMPure XP beads | $7.86 | |
| DNA 1000 Bioanalyser kit | $3.62 | |
| Long range PCR | ||
| Cleaning and pooling | Qiaquick spin columns (100) | $1.94 |
| DNA 12000 Bioanalyser kit | $3.62 | |
| 50bp ladder | $0.16 | |
| Size Selection | Egel (2% size select) | $3.85 |
| Library Preparation | ||
| AMPure XP beads | $7.86 | |
| DNA 1000 | $3.62 | |
All costs shown are current as of November 2018. All materials and prices shown in bold differ from the HTMGS protocol and other reagents were assumed to be comparable.