| Literature DB >> 35882841 |
Mallory J Owen1,2, Sebastien Lefebvre3, Christian Hansen1,2, Chris M Kunard4, David P Dimmock1,2,5, Laurie D Smith1, Gunter Scharer1, Rebecca Mardach2,6, Mary J Willis1, Annette Feigenbaum2,6, Anna-Kaisa Niemi2,6, Yan Ding1,2, Luca Van Der Kraan1,2, Katarzyna Ellsworth1,2, Lucia Guidugli1,2, Bryan R Lajoie4, Timothy K McPhail4, Shyamal S Mehtalia4, Kevin K Chau1,2, Yong H Kwon1,2, Zhanyang Zhu1,2, Sergey Batalov1,2, Shimul Chowdhury1,2,5, Seema Rego1,2, James Perry2,6, Mark Speziale2,6, Mark Nespeca2,6,7, Meredith S Wright1,2,5, Martin G Reese8, Francisco M De La Vega8, Joe Azure8, Erwin Frise8, Charlene Son Rigby8, Sandy White8, Charlotte A Hobbs1,2,6, Sheldon Gilmer2, Gail Knight2,6, Albert Oriol1,2, Jerica Lenberg1,2,5, Shareef A Nahas1,2, Kate Perofsky1,2,6, Kyu Kim1,2,6, Jeanne Carroll1,2,6, Nicole G Coufal1,2,6, Erica Sanford1, Kristen Wigby1,2,6, Jacqueline Weir4, Vicki S Thomson4, Louise Fraser4, Seka S Lazare4, Yoon H Shin4, Haiying Grunenwald4, Richard Lee4, David Jones4, Duke Tran4, Andrew Gross4, Patrick Daigle4, Anne Case4, Marisa Lue4, James A Richardson4, John Reynders3, Thomas Defay3, Kevin P Hall4, Narayanan Veeraraghavan1,2, Stephen F Kingsmore9,10,11.
Abstract
While many genetic diseases have effective treatments, they frequently progress rapidly to severe morbidity or mortality if those treatments are not implemented immediately. Since front-line physicians frequently lack familiarity with these diseases, timely molecular diagnosis may not improve outcomes. Herein we describe Genome-to-Treatment, an automated, virtual system for genetic disease diagnosis and acute management guidance. Diagnosis is achieved in 13.5 h by expedited whole genome sequencing, with superior analytic performance for structural and copy number variants. An expert panel adjudicated the indications, contraindications, efficacy, and evidence-of-efficacy of 9911 drug, device, dietary, and surgical interventions for 563 severe, childhood, genetic diseases. The 421 (75%) diseases and 1527 (15%) effective interventions retained are integrated with 13 genetic disease information resources and appended to diagnostic reports ( https://gtrx.radygenomiclab.com ). This system provided correct diagnoses in four retrospectively and two prospectively tested infants. The Genome-to-Treatment system facilitates optimal outcomes in children with rapidly progressive genetic diseases.Entities:
Mesh:
Year: 2022 PMID: 35882841 PMCID: PMC9325884 DOI: 10.1038/s41467-022-31446-6
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 17.694
Fig. 1Flow diagrams of the technological components of a 13.5-hour system for automated diagnosis and virtual acute management guidance of genetic diseases by rWGS.
Innovations described herein are indicated by orange boxes. A The order and duration of laboratory steps and technologies. EHR Electronic Health Record, EDTA EthyleneDiamineTetraAcetic acid, gDNA genomic DeoxyriboNucleic Acid, PCR Polymerase Chain Reaction, QA Quality Assurance, nt Nucleotide, SNV Single Nucleotide Variant, indel insertion-deletion nucleotide variant, SV Structural Variant, CNV Copy Number Variant, GTRx Genome-to-Treatment. B Diagram of the information flow from order placement in the EHR to return of diagnostic results together with specific management guidance for that genetic disease. rWGS Portal: Custom software system for rWGS ordering, accessioning, chain-of-custody, and return of results (v.3.2). LIMS Custom laboratory information management system for rWGS, short tandem repeat profiling, confirmatory testing (Sanger sequencing and Multiplex Ligation-dependent Probe Amplification), and inventory management (L7 informatics). IR Information resource, *: HL7/FHIR or Continuity of Care Documents, †: JSON. ‡: bcl, □: vcf.
Analytic performance, reproducibility, and duration of the major steps in automated diagnosis of genetic diseases by accelerated rWGS.
| Sample | 362 | 12878 | NA24385 | AG928 | AG366 | AF414 | AI003 | AH638 | CSD59F | CSD709 | ||||||||
| Run | Ref. [ | 927 | 929 | 930 | 1018 | 1020 | 1204 | 1208 | 1218 | 1026 | 1027 | 477 | 480 | 478 | 479 | |||
| Sample & Run Type | DNA/Analytic performance | Blood/Retrospective | Blood/Prospective | |||||||||||||||
| Diagnosis (Gene) | None | None | None | MT-ATP6 | ||||||||||||||
| rWGS Methods | Ref [ | Herein | Herein | Here | Std | Here | Std | Here | Std | |||||||||
| SV & CNV ID Method | None | MC | MC | MC | D3.5 | D3.5 | D3.5 | |||||||||||
| Length of steps (min) | ||||||||||||||||||
| Sample Prep. Time | 151 | 50 | 45 | 41 | 50 | 74 | 71 | 69 | 67 | 80 | 1233 | 90 | 265 | 90 | 265 | |||
| Sequencing Time | 932 | 667 | 667 | 666 | 673 | 674 | 667 | 683 | 675 | 676 | 1067 | 687 | 1050 | 687 | 1050 | |||
| 10 /20 Analysis Time | 62 | 48 | 191 | 45 | 181 | 46 | 194 | 48 | 42 | 55 | 37 | 38 | 47 | 173 | 44 | 185 | 56 | 220 |
| Tertiary Analysis | n.a. | n.a. | n.a. | n.a. | n.a. | n.a. | n.a. | n.a. | 10 | 14 | 13 | 13 | 10 | 87 | 12 | 126 | 21 | 131 |
| Total Time to Result | 1145 | 765 | 903 | 757 | 888 | 753 | 917 | 761 | 800 | 807 | 802 | 793 | 812 | 2560 | 833 | 1626 | 854 | 1666 |
| Sequence metrics | ||||||||||||||||||
| Trimmed yield (Gigabases) | 149 | 192 | 178 | 186 | 189 | 165 | 176 | 80 | 135 | 187 | 162 | 182 | 144 | 174 | 153 | |||
| Reads with quality score >30 | 90.7% | 90.5% | 88.7% | 90.8% | 91.3% | 89.2% | 91.2% | 92.5% | 87.3% | 90.5% | 92.6% | 90.9% | 89.8% | 90.1% | 89.3% | |||
| Error rate | n.a. | 0.17% | 0.21% | 0.17% | 0.14% | 0.19% | 0.16% | 0.14% | 0.29% | 0.17% | 0.15% | 0.14% | 0.14% | 0.17% | 0.16% | |||
| Reads mapped | 98.9% | 96.7% | 96.8% | 96.8% | 97.2% | 96.0% | 96.9% | 89.0% | 94.8% | 96.2% | 99.1% | 96.1% | 99.1% | 95.5% | 98.6% | |||
| Duplicate reads | 8.5% | 11.6% | 10.8% | 12.9% | 13.9% | 15.2% | 15.5% | 23.2% | 14.5% | 13.7% | 11.4% | 15.8% | 10.4% | 14.9% | 13.6% | |||
| Mean insert size (Nt) | 345 | 395 | 438 | 449 | 445 | 440 | 426 | 496 | 468 | 465 | 423 | 491 | 467 | 502 | 460.5 | |||
| Average genome coverage | 47.5 | 52.3 | 49.1 | 49.9 | 50.5 | 44.5 | 47.44 | 19.46 | 36.7 | 49.1 | 45.7 | 46.9 | 40.7 | 45.1 | 41.5 | |||
| MIM genes w. >10X coverage of all coding domain Nt. | 95.8% | 97.6% | 96.4% | 96.7% | 97.1% | 94.9% | 95.8% | 4.2% | 92.2% | 90.1% | 95.5% | 94.6% | 94.5% | 94.7% | 94.6% | |||
| Variant metrics | ||||||||||||||||||
| Nt Variants (1000 s) | 4733 | 4834 | 4838 | 4838 | 4837 | 4857 | 3789 | 3904 | 4851 | 4691 | 4690 | 4852 | 4852 | 4916 | 4910 | |||
| Variants passing Quality Metrics | 96.8% | 98.9% | 99.1% | 99.1% | 99.0% | 99.0% | 99.0% | 98.4% | 98.6% | 98.9% | 98.9% | 99.0% | 98.9% | 98.9% | 98.9% | |||
| Coding domain variants | 0.58% | 0.51% | 0.52% | 0.52% | 0.52% | 0.53% | 0.52% | 0.52% | 0.51% | 0.52% | 0.52% | 0.52% | 0.52% | 0.53% | 0.53% | |||
| Nt insertions & deletions | 17.5% | 19.7% | 19.7% | 19.7% | 19.6% | 19.5% | 19.6% | 18.9% | 19.4% | 19.6% | 19.6% | 19.7% | 19.7% | 19.7% | 19.7% | |||
| Transition/transversion ratio | 2.02 | 2.03 | 2.02 | 2.02 | 2.02 | 2.03 | 2.03 | 2.03 | 2.03 | 2.03 | 2.03 | 2.03 | 2.03 | 2.03 | 2.03 | |||
Analytic and diagnostic reproducibility were examined for sample 362 from 19.5-h rWGS (16), reference samples NA12878 and NA24385, four retrospective samples/diagnoses (AG928/Hereditary fructose intolerance (compound heterozygous, pathogenic (P) SNVs in aldolase B [ALDOB c.448 G > C, c.524 C > A]); AG366/Ornithine transcarbamylase deficiency (hemizygous, de novo, P, SNV in ornithine transcarbamylase [OTC c.275 G > A]); AF414/Propionic acidemia (homozygous, likely pathogenic (LP) indel in α-subunit of propionyl-CoA carboxylase [PCCA c.1899 + 4_1899 + 7del]); AI003/Developmental and epileptic encephalopathy 11 (heterozygous, de novo, LP SNV in the α2-subunit of the voltage-gated sodium channel [SCN2A c.4437 G > C]), and three prospective samples (AH638/Thiamine metabolism dysfunction syndrome 2 (homozygous, P, frame-shift variant in solute carrier 19, member 3 [SLC19A3 c.597dup]), CSD59F (heteroplasmic, P, SNV in the mitochondrial ATP synthase 6 gene [MT-ATP6 m.8993 T > C]), and CSD709/ Geleophysic dysplasia (compound heterozygous SNVs in ADAMTS-like 2 [ADAMTSL2 c.338 G > T and c.1851C > A]), which received rWGS both with the 13.5-h method (Herein) and standard, singleton or trio, clinical rWGS (Std) (Table S2). Ref.:[16] Reference [16]. Sample 12878: Sample NA12878. ID: Identification. Here: Herein. 10/20 analysis time: Conversion of raw data from base call to FASTQ format, read alignment to the reference genomes and variant calling. Tertiary analysis: Time of automated interpretation to provisional diagnosis (most rapid of three systems run in parallel (MOON, Illumina TruSight Software Suite and GEM). SV and CNV detection methods: MC: Manta and CNVnator. : DRAGEN version 3.7. D3.5: DRAGEN version 3.5.3. MIM: Mendelian inheritance in man. Nt: Nucleotide. Gene symbols are shown in italics. Variant section headers are shown in bold.
Comparison of the analytic performance of standard, clinical rWGS, and the 13.5-h method.
| Variant type | Performance metric | NA12878 | NA24385 | ||||
|---|---|---|---|---|---|---|---|
| Variant number | v.2.5 | Variant number | MC | ||||
| SNV | Precision | 3,258,654 | 99.8% | 99.9% | 3,440,606 | n.a. | 99.7% |
| Recall | 99.7% | 99.9% | n.a. | 99.3% | |||
| indel | Precision | 490,488 | 99.0% | 99.6% | 553,766 | n.a. | 99.4% |
| Recall | 95.5% | 99.4% | n.a. | 98.6% | |||
| SV deletion | Precision | n.a. | n.a. | n.a. | 4203 | 91.7% | 97.1% |
| Recall | n.a. | n.a. | 57.3% | 61.7% | |||
| SV insertion | Precision | n.a. | n.a. | n.a. | 5444 | 99.0% | 98.4% |
| Recall | n.a. | n.a. | 27.4% | 49.3% | |||
| CNV deletion | Precision | n.a. | n.a. | n.a. | 33 | 83.3% | 100.0% |
| Recall | n.a. | n.a. | 9.1% | 87.9% | |||
The analytic performance of DRAGEN v.3.7 () for SNVs and indels was compared with DRAGEN v2.5, the prior method, in reference samples NA12878 and NA24385, using NIST benchmark genotypes[16]. The analytic performance of DRAGEN v.3.7 for SVs and CNVs was compared with Manta and CNVnator (MC) in triplicate libraries in reference sample NA24385, using NIST benchmark genotypes. SV and CNV evaluations used Witty.Er, with default settings except event reporting [–em cts])[35]. SVs were of size >50 nt and CNVs >10 kb.
Fig. 2Flowchart of the development of Genome-To-Treatment (GTRx), a virtual system for acute management guidance for rare genetic diseases.
Phase 1 - Compilation of a comprehensive gene-genetic disease list for severe, childhood-onset conditions in which an established treatment was available. Phase 2, integration of 13 information resources pertaining to rare genetic diseases. Phase 3, development of the GTRx web resource containing the integrated information resources. Phase 4, automated, artificial intelligence (AI)-based searching and manual curation of published evidence of treatments for each condition by three companies. Phase 5, development of a custom REDCap system for structured assessment of genes, disorders, and therapeutic interventions. Phase 6a, independent manual review of curated interventions and assertions for the first 15 pilot gene-disease pairs by five experts. Phase 6b, primary and secondary reviews of the remaining gene-disease pairs. Phase 7, round-table discussion of records lacking consensus. Phase 8, upload of retained consensus records to the GTRx web resource.
Fig. 3GTRx disease, gene, and literature filtering, and final content.
A A modified PRISMA flowchart showing filtering steps and summarizing results of review of 563 unique disease-gene dyads herein[86]. B Genetic disease types and disease genes featured in the first 100 GTRx genes reviewed herein.
Fig. 4Clinical course and diagnostic timeline of two critically ill infants who received 13.5-h rWGS and confirmatory standard diagnostic rWGS.
Clinical (a and c, dark blue circles) and diagnostic timelines (b and d, light blue circles) of infants AH638 (a, b) and CSD59F (c, d), who received both standard, clinical rWGS and the 13.5-h methods. ED Emergency Department, EEG Electroencephalogram, AI Artificial intelligence, DOL Day of life. Circles with vertical lines indicate interactions between neonatology, genomics, and biochemical genetics.
Fig. 5Decreasing cost of research WGS (red line) and time to provisional diagnosis of rapid, clinical WGS (blue line) of WGS, 2005–2021.
[13, 15–17] (https://www.genome.gov/about-genomics/fact-sheets/Sequencing-Human-Genome-cost). Source data are provided as a Source Data file.