| Literature DB >> 30727941 |
Imane Boudellioua1,2, Maxat Kulmanov1,2, Paul N Schofield3, Georgios V Gkoutos4,5,6,7,8,9, Robert Hoehndorf10,11.
Abstract
BACKGROUND: Prioritization of variants in personal genomic data is a major challenge. Recently, computational methods that rely on comparing phenotype similarity have shown to be useful to identify causative variants. In these methods, pathogenicity prediction is combined with a semantic similarity measure to prioritize not only variants that are likely to be dysfunctional but those that are likely involved in the pathogenesis of a patient's phenotype.Entities:
Keywords: Machine learning; Ontology; Phenotype; Variant prioritization
Mesh:
Year: 2019 PMID: 30727941 PMCID: PMC6364462 DOI: 10.1186/s12859-019-2633-8
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1The DeepPVP neural network model
Comparison of top ranks of ClinVar variants as recovered from WES data; variants with MAF > 1% are filtered
| Top hit | Top 10 hits | Total | ROC AUC | AUPR | |
|---|---|---|---|---|---|
| DeepPVP | 4060 (71.40%) | 4750 (83.54%) | 5686 | 0.94 | 0.66 |
| DeepPVP-RF | 3520 (61.91%) | 4321 (75.99%) | 5686 | 0.95 | 0.55 |
| PVP 1.1 | 3619 (63.65%) | 4076 (71.68%) | 5686 | 0.95 | 0.55 |
| Exomiser | 2910 (51.18%) | 3608 (63.45%) | 5686 | 0.89 | 0.43 |
| Exomiser-CADD | 2926 (51.46%) | 3621 (63.68%) | 5686 | 0.89 | 0.43 |
| CADD | 1060 (18.64%) | 2429 (42.72%) | 5686 | 0.94 | 0.14 |
| DANN | 170 (2.99%) | 1322 (23.25%) | 5686 | 0.90 | 0.03 |
| GWAVA | 63 (1.11%) | 264 (4.64%) | 5686 | 0.66 | 0.01 |