| Literature DB >> 25156663 |
Michael M Segal1, Mostafa Abdellateef2, Ayman W El-Hattab3, Brian S Hilbush4, Francisco M De La Vega5, Gerard Tromp6, Marc S Williams6, Rebecca A Betensky7, Joseph Gleeson2.
Abstract
We describe an "integrated genome-phenome analysis" that combines both genomic sequence data and clinical information for genomic diagnosis. It is novel in that it uses robust diagnostic decision support and combines the clinical differential diagnosis and the genomic variants using a "pertinence" metric. This allows the analysis to be hypothesis-independent, not requiring assumptions about mode of inheritance, number of genes involved, or which clinical findings are most relevant. Using 20 genomic trios with neurologic disease, we find that pertinence scores averaging 99.9% identify the causative variant under conditions in which a genomic trio is analyzed and family-aware variant calling is done. The analysis takes seconds, and pertinence scores can be improved by clinicians adding more findings. The core conclusion is that automated genome-phenome analysis can be accurate, rapid, and efficient. We also conclude that an automated process offers a methodology for quality improvement of many components of genomic analysis.Entities:
Keywords: diagnosis; diagnostic decision support; whole exome sequencing
Mesh:
Year: 2014 PMID: 25156663 PMCID: PMC4339658 DOI: 10.1177/0883073814545884
Source DB: PubMed Journal: J Child Neurol ISSN: 0883-0738 Impact factor: 1.987
Identifying Pertinent Genes Beginning With Variants (Averages for n = 14).a
| Trio with family-aware calling | Trio | Proband only | ||
|---|---|---|---|---|
| Frequency data | Variome | Variome | Variome | 1000G |
| Variants | 9461 | 7126 | 7126 | 7126 |
| Genes with severity >0 | 807 | 981 | 1050 | 1398 |
| Genes after comparisonsb | 89 | 206 | 798 | 1335 |
| Known genesc | 13 | 27 | 92 | 147 |
| Gene rank of diagnosisd | 1.0 | 1.1 | 1.2 | 1.4 |
| Pertinence for no. 1 diagnosis | 99.9% | 90.8% | 89.3% | 81.6% |
| Correct gene pertinente | 100% | 79% | 79% | 64% |
| No. 1 gene = causal | 100% | 93% | 93% | 86% |
aThe analysis includes the 14 cases in which a causal gene variant was identified that corresponded to a published human germline phenotype.
bGenes after compound heterozygosity comparisons (and novelty analysis for trio).
cGenes for which the zygosity found corresponded to a known human phenotype.
dGenes with known phenotype to be evaluated manually to reach diagnosis.
ePertinence ≥99%.
Figure 1.Pertinent genes using trios with family-aware variant calling with variome and clinical information; n = 14 (dotted lines are cases with artifactual numbers of de novo variants).
Figure 2.Pertinent genes using proband-only with 1000 Genome and clinical information; n = 14 (dotted lines are cases with artifactual numbers of de novo variants).
Comparison of Genome-Phenome Analysis to Filtering Using a Key Finding.a
| Gene rank of diagnosis using 1 finding | ||||
|---|---|---|---|---|
| Gene | Genome-phenome analysis rank | Best rank | Average rank (of nonmisses) | % misses |
|
| 1.0 | 1.5 | 5.8 | 37.5 |
|
| 1.0 | 1.0 | 2.3 | 38.5 |
|
| 1.0 | 1.0 | 1.5 | 60.0 |
|
| 4.0 | 5.0 | 5.5 | 66.7 |
|
| 1.0 | 1.0 | 1.4 | 0.0 |
|
| 1.0 | 1.0 | 1.8 | 40.0 |
|
| 1.0 | 1.0 | 2.1 | 50.0 |
|
| 1.0 | 1.0 | 1.8 | 25.0 |
|
| 1.0 | 1.0 | 5.0 | 33.3 |
|
| 1.0 | 2.0 | 10.2 | 0.0 |
|
| 1.0 | 1.0 | 5.7 | 0.0 |
|
| 1.0 | 1.0 | 1.0 | 40.0 |
|
| 1.0 | 1.0 | 1.0 | 20.0 |
|
| 1.0 | 1.0 | 1.2 | 25.0 |
|
|
|
|
|
|
| CLARITY 1 | ||||
| | 1.0 | 2.0 | 9.1 | 28.6 |
| | 7.0 | 9.0 | 9.0 | 85.7 |
| CLARITY 2 | ||||
| | 2.0 | 4.0 | 4.0 | 50.0 |
| | 1.0 | 4.0 | 6.0 | 0.0 |
aAnalyses are using proband only, and for the non-CLARITY cases, the Middle Eastern variome.
Effects of Functional and Conservation Scores (Averages for n = 14).
| Enhanced processinga | Base processinga | |||
|---|---|---|---|---|
| Pertinence (%) | Rank | Pertinence (%) | Rank | |
| Both used | 99.9 | 1.0 | 81.6 | 1.4 |
| Functional off | 99.9 | 1.0 | 81.5 | 1.6 |
| Conservation off | 92.7 | 8.1 | 80.0 | 1.6 |
| Neither used | 99.8 | 1.0 | 77.7 | 1.6 |
aEnhanced processing includes a trio with family-aware variant calling and use of the variome. In base processing, a proband and 1000 Genome data were used.
Phenotype Rankings Using Clinical Information and the Variome (Averages for n = 14).
| Genomes | Trio | Trio | Proband | None |
|---|---|---|---|---|
| Family aware calling | Yes | No | No | N/A |
| Rank | 1.2 | 1.5 | 1.6 | 13.4 |
| Rank with nonvariants | 1.1 | 1.4 | 1.4 | 13.4 |
Gene Discovery Candidates of Monoallelic and Biallelic Zygosity (Averages for n = 15).
| Trio with family-aware calling | Trio without family-aware calling | Reduction with family-aware calling | Proband only | Reduction with family-aware calling | |
|---|---|---|---|---|---|
| Total monoallelic | 5.1 | 62.8 | 92% | 554.2 | 99.1% |
| Total biallelic | 40.1 | 57.9 | 31% | 103.1 | 61.1% |
Gene Discovery Ranking of the Causal Variant (Averages for n = 3).
| Trio with family-aware calling | Trio | Proband only | ||
|---|---|---|---|---|
| Frequency data | Variome | Variome | Variome | 1000G |
| Rank | 6.7 | 30.7 | 198.0 | 392.7 |