| Literature DB >> 26888179 |
Chee Jian Pua1, Jaydutt Bhalshankar1, Kui Miao2, Roddy Walsh3,4, Shibu John3,4, Shi Qi Lim1, Kingsley Chow1, Rachel Buchan3,4, Bee Yong Soh1, Pei Min Lio1, Jaclyn Lim1, Sebastian Schafer1, Jing Quan Lim5, Patrick Tan6,7, Nicola Whiffin3,4, Paul J Barton3,4, James S Ware4,8, Stuart A Cook9,10,11,12.
Abstract
Inherited cardiac conditions (ICCs) are characterised by marked genetic and allelic heterogeneity and require extensive sequencing for genetic characterisation. We iteratively optimised a targeted gene capture panel for ICCs that includes disease-causing, putatively pathogenic, research and phenocopy genes (n = 174 genes). We achieved high coverage of the target region on both MiSeq (>99.8% at ≥ 20× read depth, n = 12) and NextSeq (>99.9% at ≥ 20×, n = 48) platforms with 100% sensitivity and precision for single nucleotide variants and indels across the protein-coding target on the MiSeq. In the final assay, 40 out of 43 established ICC genes informative in clinical practice achieved complete coverage (100 % at ≥ 20×). By comparison, whole exome sequencing (WES; ∼ 80×), deep WES (∼ 500×) and whole genome sequencing (WGS; ∼ 70×) had poorer performance (88.1, 99.2 and 99.3% respectively at ≥ 20×) across the ICC target. The assay described here delivers highly accurate and affordable sequencing of ICC genes, complemented by accessible cloud-based computation and informatics. See Editorial in this issue (DOI: 10.1007/s12265-015-9667-8 ).Entities:
Keywords: Diagnostics; Genetics; Inherited cardiac conditions; Targeted sequencing; Whole exome sequencing; Whole genome sequencing
Mesh:
Substances:
Year: 2016 PMID: 26888179 PMCID: PMC4767849 DOI: 10.1007/s12265-016-9673-5
Source DB: PubMed Journal: J Cardiovasc Transl Res ISSN: 1937-5387 Impact factor: 4.132
ICC disease genes (n = 43) categorised by primary disease association and regions not covered at 20× read depth using ICCv2 and NextSeq 500 sequencing
| Cardiac diseases | Core genes | Gene description | Mean callability at 20× coverage (95 % CI) | Base pairs (bp) with <20× read depth |
|---|---|---|---|---|
| Aortopathies | ACTA2 | Actin, alpha 2, smooth muscle, aorta | 100 (100–100) | 0 |
| COL3A1 | Collagen, type III, alpha 1 | 100 (100–100) | 0 | |
| FBN1 | Fibrillin 1 | 100 (100–100) | 0 | |
| MYH11 | Myosin, heavy chain 11, smooth muscle | 100 (100–100) | 0 | |
| TGFB2 | Transforming growth factor, beta 2 | 100 (100–100) | 0 | |
| TGFBR1 | Transforming growth factor, beta receptor 1 | 98.0 (97.8–98.2) | 97 | |
| TGFBR2 | Transforming growth factor, beta receptor II (70/80 kda) | 100 (100–100) | 0 | |
| Arrhythmogenic right ventricular cardiomyopathy (ARVC) | DSC2 | Desmocollin 2 | 100 (100–100) | 0 |
| DSG2 | Desmoglein 2 | 100 (100–100) | 0 | |
| DSP | Desmoplakin | 100 (100–100) | 0 | |
| JUP | Junction plakoglobin | 100 (100–100) | 0 | |
| PKP2 | Plakophilin 2 | 100 (100–100) | 0 | |
| Brugada syndrome (BrS) | SCN5A | Sodium channel, voltage-gated, type V, alpha subunit | 100 (100–100) | 0 |
| Catecholaminergic polymorphic ventricular tachycardia (CPVT) | CASQ2 | Calsequestrin 2 (cardiac muscle) | 100 (100–100) | 0 |
| RYR2 | Ryanodine receptor 2 (cardiac) | 100 (100–100) | 0 | |
| Dilated cardiomyopathy (DCM) | DES | Desmin | 100 (100–100) | 0 |
| LMNA | Lamin A/C | 100 (100–100) | 0 | |
| MYBPC3 | Myosin-binding protein C, cardiac | 100 (100–100) | 0 | |
| MYH7 | Myosin, heavy chain 7, cardiac muscle, beta | 100 (99.9–100) | 160 | |
| RBM20 | RNA binding motif protein 20 | 100 (100–100) | 0 | |
| TNNI3 | Troponin I type 3 (cardiac) | 100 (100–100) | 0 | |
| TNNT2 | Troponin T type 2 (cardiac) | 100 (100–100) | 0 | |
| TPM1 | Tropomyosin 1 (alpha) | 100 (100–100) | 0 | |
| TTN | Titin | 99.7 (99.7–99.8) | 1569 | |
| Familial hypercholesterolaemia (FH) | APOB | Apolipoprotein B (including Ag(x) antigen) | 100 (100–100) | 0 |
| LDLR | Low-density lipoprotein receptor | 100 (100–100) | 0 | |
| PCSK9 | Proprotein convertase subtilisin/kexin type 9 | 100 (100–100) | 0 | |
| Hypertrophic cardiomyopathy (HCM) | ACTC1 | Actin, alpha, cardiac muscle 1 | 100 (100–100) | 0 |
| CSRP3 | Cysteine and glycine-rich protein 3 (cardiac LIM protein) | 100 (100–100) | 0 | |
| MYBPC3 | Myosin-binding protein C, cardiac | 100 (100–100) | 0 | |
| MYH7 | Myosin, heavy chain 7, cardiac muscle, beta | 100 (99.9–100) | 160 | |
| MYL2 | Myosin, light chain 2, regulatory, cardiac, slow | 100 (100–100) | 0 | |
| MYL3 | Myosin, light chain 3, alkali; ventricular, skeletal, slow | 100 (100–100) | 0 | |
| TNNI3 | Troponin I type 3 (cardiac) | 100 (100–100) | 0 | |
| TNNT2 | Troponin T type 2 (cardiac) | 100 (100–100) | 0 | |
| TPM1 | Tropomyosin 1 (alpha) | 100 (100–100) | 0 | |
| Long QT syndrome (LQTS) | KCNE1 | Potassium voltage-gated channel, Isk-related family, member 1 | 100 (100–100) | 0 |
| KCNE2 | Potassium voltage-gated channel, Isk-related family, member 2 | 100 (100–100) | 0 | |
| KCNH2 | Potassium voltage-gated channel, subfamily H (eag-related), member 2 | 100 (100–100) | 0 | |
| KCNJ2 | Potassium inwardly rectifying channel, subfamily J, member 2 | 100 (100–100) | 0 | |
| KCNQ1 | Potassium voltage-gated channel, KQT-like subfamily, member 1 | 100 (100–100) | 0 | |
| SCN5A | Sodium channel, voltage-gated, type V, alpha subunit | 100 (100–100) | 0 | |
| Noonan syndrome (NS) | KRAS | V-Ki-ras2 Kirsten rat sarcoma viral oncogene homolog | 100 (100–100) | 0 |
| PTPN11 | Protein tyrosine phosphatase, non-receptor type 11 | 100 (100–100) | 0 | |
| RAF1 | V-raf-1 murine leukemia viral oncogene homolog 1 | 100 (100–100) | 0 | |
| SOS1 | Son of sevenless homolog 1 (Drosophila) | 100 (100–100) | 0 | |
| Phenocopy genes | GLA | Galactosidase, alpha | 100 (100–100) | 0 |
| LAMP2 | Lysosomal-associated membrane protein 2 | 100 (100–100) | 0 | |
| PRKAG2 | Protein kinase, AMP-activated, gamma 2 non-catalytic subunit | 100 (100–100) | 0 |
Genomic coordinates of regions with poor callability are given in Table S7
Comparison of quality metrics for ICCv2 (marketed as the TruSight Cardio Sequencing Kit) panel (M3, MiSeq; M4, NextSeq 500), WES, Deep WES and WGS
| Sequencing summary | Method 3 (M3) | Method 4 (M4) | WES | Deep WES | WGS |
|---|---|---|---|---|---|
| Nextera Rapid Capture kit | ICCv2 | ICCv2 | WES | WES | TruSeq Nano DNA |
| Sequencer | MiSeq | NextSeq 500 | HiSeq 2500 | HiSeq 2500 | HiSeq X |
| Sequencing reagent kit | MiSeq v2, 300 cycles | Mid Output v2, 300 cycles | SBS v4, 250 cycles | SBS v4, 250 cycles | V2.5, 300 cycles |
| Samples per lane | 12 | 48 | 12 | 2 | 0.5 |
| Average output per sample (GB) | 0.5 | 1.2 | 5.4 | 43.7 | 200 |
| Mean read depth of ICC target (95 % CI) | 329× (317×–342×) | 578× (568×–587×) | 74× (71×–78×) | 522× | 69.4× (65.4×–73.5×) |
| Mean ICC bases ≥20× (%) (95 % CI) | 99.8 (99.8–99.9) | 99.9 (99.9–99.9) | 88.1 (87.3–88.9) | 99.2 | 99.3 (99.2–99.5) |
| Targeted enrichment and sequencing cost per sample (USD) | 200 | 200 | 900 | 5400 | 2800 |
| Library preparation and sequencing time per run (days) | 4 | 4 | 9 | 9 | 4 |
A full comparison of methods 1–4 using ICCv1 and ICCv2 panels is shown in Table S4
Fig. 1Stringent heat map showing the percentage coverage of ICC disease genes commonly used to inform clinical practice (n = 43) at 20× read depth using M3 (MiSeq, 150 bp PE), M4 (NextSeq 500, 150 bp PE), deep whole exome sequencing (WES; HiSeq 2500, 125 bp PE), WES (HiSeq, 125 bp PE) and whole genome sequencing (WGS: HiSeq X, 150 bp PE) (gene coverage at 20×: dark red ≤98 %; dark green = 100 %)
Fig. 2Percentage coverage of all TTN exons (ENST00000589042/NM_001267550.1) at 20× read depth across methods (top four panels). Mappability score (score* [28]) and GC content in the TTN gene (bottom two panels). Error bars represent standard deviation
Comparison of variant calls for M3 (MiSeq) and M4 (NextSeq) sequencing of the N12878 reference sample with the Genome in a Bottle high confidence variant call set
| Sequencer | Variant type | TP | FP | FN | TN | Sensitivity (%)a | Precision (%)b | MCC (%)c |
|---|---|---|---|---|---|---|---|---|
| MiSeq | All | 249 | 0 | 1 | 522509 | 99.6 | 100 | 99.8 |
| MiSeq | SNVs | 245 | 0 | 0 | 522518 | 100 | 100 | 100 |
| MiSeq | Indels | 4 | 0 | 1 | 522754 | 0.80 | 100 | 89.4 |
| NextSeq | All | 249 | 1 | 1 | 522508 | 99.6 | 99.6 | 99.6 |
| NextSeq | SNVs | 245 | 1 | 0 | 522517 | 100 | 99.6 | 99.8 |
| NextSeq | Indels | 4 | 0 | 1 | 522754 | 0.80 | 100 | 89.4 |
Analysis was done over a 522,763 bp region corresponding to protein-coding region ±8 bps that overlaps with the GIAB high confidence regions
TP true positive, FP false positive, FN false negative, MCC Matthews correlation coefficient
aSensitivity = TP / (TP + FN)
bPrecision = TP / (TP + FP)
cMCC = (TP × TN) − (FP × FN) / √[(TP + FP)(TP + FN)(TN + FP)(TN + FN)]