| Literature DB >> 23861798 |
Xinzhong Li1, Andrew J Buckton, Samuel L Wilkinson, Shibu John, Roddy Walsh, Tomas Novotny, Iveta Valaskova, Manu Gupta, Laurence Game, Paul J R Barton, Stuart A Cook, James S Ware.
Abstract
BACKGROUND: Molecular genetic testing is recommended for diagnosis of inherited cardiac disease, to guide prognosis and treatment, but access is often limited by cost and availability. Recently introduced high-throughput bench-top DNA sequencing platforms have the potential to overcome these limitations. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2013 PMID: 23861798 PMCID: PMC3701544 DOI: 10.1371/journal.pone.0067744
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Characteristics of six genes included in the assay.
| Gene symbol | Gene product | Genomic locus | 1Reference sequence | Number of exons | Coding cDNA length (bp) | Number of amplicons | Disease association |
|
| IKs channel, alpha subunit | 11p15.5 | LRG_285_t1 | 16 | 2015 | 30 | LQT1 |
|
| IKr channel, alpha subunit | 7q36.1 | LRG_288_t1 | 15 | 3465 | 54 | LQT2 |
|
| Nav1.5 channel, alpha subunit | 3p21 | LRG_289_t1 | 27 | 6024 | 78 | LQT3 |
|
| IKs channel, beta subunit | 21q22.12 | LRG_290_t1 | 1 | 389 | 4 | LQT5 |
|
| IKr channel, beta subunit | 21q22.12 | LRG_291_t1 | 1 | 371 | 5 | LQT6 |
|
| Sarcoplasmic reticulum calcium-induced calcium release | 1q43 | LRG_402_t1 | 104 | 14,785 | 215 | CPVT |
| Total | 164 | 27,049 | 386 |
LRG: locus reference genomic (http://www.lrg-sequence.org), LQT: Long QT syndrome, CPVT: catecholaminergic polymorphic ventricular tachycardia.
Comparison of bench-top NGS platforms.
| MiSeq | Ion Torrent PGM | |
| NGS runs | 1 | 3 |
| Template preparation | 1 hr | 3×5.5 hr |
| Run time | 27 hr | 3×3 hr |
| Barcodes | 15 (commercial) | 3×5 (custom) |
| Theoretical sequencing output | 1.5 Gb | 3×1.27 Gb |
| Actual sequencing output | 1.23 Gb | 1.00 Gb |
| Number of sequencing reads | 8.13 M | 6.56 M |
| Read length output | 151 | 150** |
| Paired-end reads | Yes | No |
| Instrument cost | $125k | $75k |
| Sequencing cost for assay | $959 | 3x$686 |
| Per specimen sequencing cost | $64 | $137 |
316 scale chip; ** average.
Sequencing and target capture performance metrics.
| Alignment | |||||||||||
| NGS Platform | Reads | Bases (Mb) | Mean read length | Q20 Bases | Mapped Reads | Reads On Design | Reads On Target | Depth On Target | Evenness | EF | Callable |
| MiSeq | 7757916 | 889 | 115 | 95.8% | 90.7% | 99.3% | 96.7% | 1529 | 68.1% | 110111 | 97.9% |
| PGM | 6133098 | 969 | 106 | 67.5% | 100% | 96.2% | 91.2% | 1231 | 78.8% | 104915 | 96.8% |
Mean read length after trimming primer sequences and low quality bases. ReadsOnDesign/ReadsOnTarget = percentage of reads mapping to amplicon design or protein-coding target region. EF = enrichment factor.
Figure 1Coverage of target genes.
a. The percentage of each gene that is captured and sequenced (at least one read) is shown for each platform (MiSeq in red, PGM in black), for 15 samples; Three genes were consistently fully sequenced. Coverage of KCNQ1 and KCNH2 was more variable: KCNQ1 and KCNE1 were fully covered in the best performing samples, while the best performance on KCNH2 covered >97% of the gene. b. Mean sequencing depth across each gene, for 15 samples. Quartiles are shown. There is significant intra- and inter- sample variability.
Sequencing coverage of each gene.
| MiSeq (%) | PGM (%) | |||||||||||
| 1x | 10x | 20x | 30x | 50x | 100x | 1x | 10x | 20x | 30x | 50x | 100x | |
|
| 98.83 | 88.77 | 75.2 | 69.44 | 67.64 | 65.5 | 100 | 100 | 100 | 100 | 100 | 99.81 |
|
| 100 | 96.42 | 88.93 | 85.98 | 85.98 | 85.98 | 100 | 100 | 100 | 100 | 100 | 100 |
|
| 94.07 | 87.66 | 85.67 | 84.3 | 83.22 | 81.79 | 88.86 | 77.91 | 75.63 | 74.51 | 72.95 | 70.59 |
|
| 96.22 | 92.86 | 89 | 86.42 | 79.56 | 74.03 | 93.14 | 86.45 | 82.68 | 81.42 | 80.92 | 77.39 |
|
| 100 | 100 | 99.98 | 99.69 | 98.72 | 95.96 | 100 | 100 | 99.96 | 99.9 | 99.75 | 99.13 |
|
| 99.69 | 97.2 | 95.16 | 93.7 | 90.69 | 85.78 | 99.87 | 99.67 | 99.65 | 99.61 | 99.38 | 98.86 |
|
| 98.77 | 96.14 | 94.19 | 92.83 | 90.3 | 86.37 | 97.99 | 95.98 | 95.39 | 95.12 | 94.72 | 93.73 |
The coverage of the protein-coding sequence of each gene of interest is tabulated, as a percentage, for a range of sequencing depths (≥1x, 10x, 20x, 30x, 50x and 100x reads).
Figure 2Coverage of KCNQ1 and KCNH2 for the two platforms.
Mean depth of coverage for 15 samples is shown for two genes on a log scale. Regions of no coverage therefore have negative values. The blue lines indicate local GC content (calculated with a 50 bp sliding window). Regions consistently missed have high GC content, with similar patterns for both platforms. KCNQ1 exons 1 & 8 and KCNH2 exons 1, 4 & 12 are difficult to sequence. A cartoon of the exon structure is shown beneath each panel. Plus (+) and minus (-) denote gene strand. Plots for all genes are shown in Supporting Information Figure S5. a.) MiSeq b.) Ion Torrent PGM.
Detection of coding variants for each NGS platform.
| Variants detected by dHPLC/Sanger | Variants detected by MiSeq | Variants missed by MiSeq | Variants detected by PGM | Variants missed by PGM | |||||
| Total | Concordant with dHPLC/Sanger | NGS only | Total | Concordant with dHPLC/Sanger | NGS only | ||||
| Group I(dHPLC) | 38 | 64 | 32 | 32 | 6 | 74 | 38 | 36 | 0 |
| Group II(Sanger) | 36 | 33 | 33 | 0 | 3 | 36 | 34 | 2 | 2 |
| Total | 74 | 97 | 65 | 32 | 9 | 110 | 72 | 38 | 2 |
Accuracy of variant calling for NGS platforms.
| Total coding variants | Variant sites interrogated | Variants detected | False positives | Variants missed | Sensitivity | Positive predictive value | |
|
| |||||||
| Group I | 71 | 60 | 60 | 4 | 0 | 100% | 93.8% |
| Group II | 36 | 33 | 33 | 0 | 0 | 100% | 100% |
|
|
|
|
|
|
|
|
|
|
| |||||||
| Group I | 71 | 71 | 71 | 3 | 0 | 100% | 96.0% |
| Group II | 36 | 35 | 34 | 2 | 1 | 97.1% | 94.4% |
|
|
|
|
|
|
|
|
|
Group I: dHPLC with Sanger confirmation; Group II: direct Sanger sequencing 95% confidence intervals are given for sensitivity and positive predictive value.