| Literature DB >> 29323667 |
Caroline F Wright1,2, Jeremy F McRae3, Stephen Clayton3, Giuseppe Gallone3, Stuart Aitken4, Tomas W FitzGerald3, Philip Jones3, Elena Prigmore3, Diana Rajan3, Jenny Lord3, Alejandro Sifrim3, Rosemary Kelsell3, Michael J Parker5, Jeffrey C Barrett3, Matthew E Hurles3, David R FitzPatrick5, Helen V Firth3,6.
Abstract
PURPOSE: Given the rapid pace of discovery in rare disease genomics, it is likely that improvements in diagnostic yield can be made by systematically reanalyzing previously generated genomic sequence data in light of new knowledge.Entities:
Keywords: diagnostic yield; exome sequencing; reanalysis; reclassification; recontact
Mesh:
Year: 2018 PMID: 29323667 PMCID: PMC5912505 DOI: 10.1038/gim.2017.246
Source DB: PubMed Journal: Genet Med ISSN: 1098-3600 Impact factor: 8.822
Potential analytical sources of missed diagnoses and corresponding improvements made to the DDD workflow since 2014.
| Step | Purpose | Potential sources of missed diagnoses | Changes to DDD workflow |
|---|---|---|---|
| Sequence data is mapped to the human genome reference, and variation called relative to that reference | Low depth sequence data Incorrect reference sequence Incorrect mapping Variant detection/genotyping failed Variant class not considered (e.g. triplet repeats) | Updated versions of BWA, SAMtools, GATK and DeNovoGear Multi-sample variant calling Additional variant detection algorithms | |
| Stringent filters are applied to exclude low quality, common and non-coding variants that are unlikely to be clinically relevant | Low quality variant discarded Incorrect annotation of allele frequency Incorrect annotation of consequence Variant filtering thresholds too stringent | Updated version of VEP Updated MAF data Updated filtering thresholds (lower MAF, exclusion of benign inherited missense variants) | |
| Evidence-based, disease-specific ‘virtual’ gene panels are applied to limit variants to those with a relevant genotype (heterozygous/homozygous) and inheritance (dominant/recessive) in proven disease-causing genes | Incorrect disease mechanism Incorrect inheritance or family history Incomplete penetrance Phenotype not recorded Known gene missing from panel Causal gene not yet discovered |
Updated DDG2P (November 2013 freeze used previously; June 2016 freeze used here, including 286 additional genes) Plausibly pathogenic variants shared via DECIPHER Research Track Reviewed parental phenotypes | |
| Clinical assessment of the pathogenicity and contribution of specific variants to disease in a specific individual/family | Patient phenotype differs from previously published cases Phenotype not yet developed Evidence for pathogenicity is unclear | Candidate variants re-reviewed by core DDD clinical team and/or referring clinician Some patients clinically assessed again |
BWA=Burrows-Wheeler Aligner. GATK=Genome Analysis Toolkit. MAF=Minor Allele Frequency. VEP=Variant Effect Predictor. DDG2P=Developmental Disorder Gene-to-Phenotype database.
Figure 1Outline of DDD variant filtering and reporting workflow.
Details of thresholds are outlined in the Methods section. The entire workflow is automated until the final stage, which requires detailed clinical review of any candidate variants in light of the child’s specific developmental phenotype.
Summary of diagnoses and detection methods in the 454 diagnosed probands.
Reported variants that were considered by our clinical teams to explain all or part of a patient’s phenotype are summarised here; the variants themselves are in available with associated phenotypes through DECIPHER (https://decipher.sanger.ac.uk). All variants are in published developmental disorder genes with sufficient evidence to merit inclusion on our clinician-curated gene-to-phenotype database (https://www.ebi.ac.uk/gene2phenotype/). Note that although most variants have been analytically validated in an accredited diagnostic laboratory, functional studies have not been systematically performed to confirm clinical pathogenicity.
| Variant type | Analysis Method | #Diagnoses |
|---|---|---|
| Chromosomal aneuploidy | Chromosome read depth counter | 2 |
| Copy Number Variants | CNsolidate/CoNVex/CIPHER | 50 |
| DeNovoGear | 232 | |
| DeNovoGear/Discovery | 58 | |
| DeNovoGear/DDD Research Variant Track | 5 | |
| GATK candidate | 4 | |
| Inherited SNVs/indels in known genes | GATK Mendelian filter | 82 |
| Inherited SNVs/indels in new DDD genes | GATK Mendelian filter/Discovery | 4 |
| Large insertions/deletions | Soft-clipped reads | 4 |
| Mosaic structural variants | triPOD | 5 |
| Mosaic inherited SNVs/indels | Parental mosaicism | 4 |
| Non-essential splice variants | Splicing analysis | 4 |
| Uniparental disomy | UPDio | 6 |
Includes 6 dual diagnoses
Discovery indicates that a new developmental gene was found and published by the DDD Study.12,13,31
SNV=single nucleotide polymorphism; Indel=insertion/deletion; GATK=Genome Analysis Toolkit
Figure 2Summary of reported and diagnostic variants in 1133 trios.
The total number of candidate variants per proband using the 2017 analysis pipeline is indicated (black bars), along with the number of full or partially diagnostic variants per proband in 2017 (striped dark grey bars) and 2014 (light grey bars).
Figure 3Pathogenicity assessments of reported variants by inheritance class.
All variants (including SNVs, indels, CNVs, SVs, UPD and aneuploidies) that were classified by clinical teams as definitely/likely pathogenic were considered diagnostic, while those considered uncertain/likely benign/benign were not. The likelihood that a rare, functional de novo mutation in a dominant DDG2P gene is considered pathogenic is >80%, while the diagnostic yield from reported inherited variants is substantially less (10-30%). Note that variants of unknown and mosaic inheritance are excluded from the diagram due to low numbers (n<10).