| Literature DB >> 32698834 |
Michael M Segal1, Renee George2,3, Peter Waltman4,5, Ayman W El-Hattab6, Kiely N James2,3, Valentina Stanley2,3, Joseph Gleeson2,3.
Abstract
BACKGROUND: In diagnosis of rare genetic diseases we face a decision as to the degree to which the sequencing lab offers one or more diagnoses based on clinical input provided by the clinician, or the clinician reaches a diagnosis based on the complete set of variants provided by the lab. We tested a software approach to assist the clinician in making the diagnosis based on clinical findings and an annotated genomic variant table, using cases already solved using less automated processes.Entities:
Keywords: Artificial intelligence; Copy number variation; Diagnostic decision support system; Genomic analysis; Rare disease diagnosis
Mesh:
Year: 2020 PMID: 32698834 PMCID: PMC7374885 DOI: 10.1186/s13023-020-01461-1
Source DB: PubMed Journal: Orphanet J Rare Dis ISSN: 1750-1172 Impact factor: 4.123
The 81 cases with 216 individuals used in the study
| Type of case | Cases | Known gene | Discovery gene | Individuals per case | Rank of correct (known gene cases) | Number of zygosities (known gene cases) | |||
|---|---|---|---|---|---|---|---|---|---|
| Average | Range | Average | Range | Average | Range | ||||
| SNV in nuclear family | 18 | 15 | 3 | 2.39 | 2–4 | 1.13 | 1–3 | 12.60 | 1–21 |
| SNV variant shared | 19 | 19 | 0 | 3.11 | 2–6 | 1.16 | 1–4 | 2.63 | 1–10 |
| SNV gene shared | 20 | 15 | 5 | 3.60 | 2–6 | 1.00 | 1 | 1.27 | 1–2 |
| CNV in nuclear families | 24 | 21 | 3 | 1.75 | 1–3 | 1.14 | 1–3 | 87.00 | 1–790 |
| TOTAL | 81 | 70 | 11 | 2.67 | 1–6 | 1.11 | 1–4 | 29.79 | 1–790 |
There were 57 cases with SNVs only, divided into 3 groups depending on familial relationships: nuclear family (all were beyond the trio by virtue of having more than one sibling), variant shared (beyond nuclear families but with the same pathogenic variant), and gene shared (unrelated, with different variants in the same gene). Cases with CNVs were all within nuclear families, but 7 were beyond the trio by virtue of including a sibling. The number of zygosities (i.e., monoallelic versus biallelic) in the genome-phenome clinical correlation (e.g., Fig. 1) and rank of the gene zygosity that was correct (1 = top) are shown only for “known gene cases”; i.e., cases with a known gene-phenotype association in which a genome-phenome correlation can be done
Fig. 1Genome-Phenome Analysis display including both SNV and CNV results. Display of gene zygosities that fit with the variants and affected status of all individuals used. Numbers to the left are severity scores for each zygosity (e.g., NBAS gene variants, biallelic, with “c” denoting compound heterozygote). Zygosities are not ranked by severity score; instead they are ranked by the pertinence metric, here 100% for biallelic GLDC gene variants (denoted by light green shading) and 0% for the other zygosities and chromosomal abnormalities shown that represent other possible genetic diagnoses but are much lower in pertinence. The pertinence metric depends on both the severity of the gene zygosity and the clinical findings entered for the proband. The GLDC and PRSS1 variants were derived from a deletion region and the NBAS variants were derived from SNPs. Clicking on the “Show the 1 GLDC variant” button shows a mini variant table with that one variant location and an explanation of how the severity score was determined for that variant (not shown here). The check marks denote variants that were found, using the convention used for all findings, where for example, @6 m for a clinical finding would denote that the clinician had entered that finding as having onset at 6 months of age (not shown here). The boxes between the check marks and the zygosities are used to denote the clinician’s choice of a gene zygosity to report as pathogenic (not used in this illustration)
Fig. 2Gene discovery display. This display shows gene zygosities not associated with any published genetic condition. In the case shown here, only 2 discovery gene candidates were found, ranked by severity score (no pertinence metric is possible for gene zygosities with no known clinical phenotype). The PPIL1 gene (biallelic) variants were reported as causative
Fig. 3Pertinence values for positives and negatives. Positives are the 66 cases in which the correct (reported) gene was #1 in the genome-phenome analysis output (e.g., Fig. 1); in 67% of these, pertinence was 100.0. Negatives are the 15 cases in which the correct gene was not ranked # in the genome-phenome analysis (11 in which it was in the gene discovery display (e.g., Fig. 2) and 4 in which it was in the genome-phenome analysis output by not ranked #1)
Fig. 4Receiver operating characteristic (ROC) curve for the diagnostic ability of the analysis. All 81 cases were ranked by their pertinence scores and the true positive rate and the false positive rate are displayed. The area under the ROC curve is 0.93