| Literature DB >> 28968714 |
Mark F Rogers1, Hashem A Shihab2, Matthew Mort3, David N Cooper3, Tom R Gaunt2, Colin Campbell1.
Abstract
Summary: We present FATHMM-XF, a method for predicting pathogenic point mutations in the human genome. Drawing on an extensive feature set, FATHMM-XF outperforms competitors on benchmark tests, particularly in non-coding regions where the majority of pathogenic mutations are likely to be found. Availability and implementation: The FATHMM-XF web server is available at http://fathmm.biocompute.org.uk/fathmm-xf/, and as tracks on the Genome Tolerance Browser: http://gtb.biocompute.org.uk. Predictions are provided for human genome version GRCh37/hg19. The data used for this project can be downloaded from: http://fathmm.biocompute.org.uk/fathmm-xf/. Contact: mark.rogers@bristol.ac.uk or c.campbell@bristol.ac.uk. Supplementary information: Supplementary data are available at Bioinformatics online.Entities:
Mesh:
Year: 2018 PMID: 28968714 PMCID: PMC5860356 DOI: 10.1093/bioinformatics/btx536
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
FATHMM-XF yields state-of-the-art accuracy on unseen ClinVar examples in both non-coding regions and coding regions.
| Non-coding regions | ||||||
|---|---|---|---|---|---|---|
| Method | Acc. | AUC | Sens. | Spec. | MCC | PPV |
| FATHMM-XF | 0.89 | 0.97 | 0.95 | 0.84 | 0.53 | 0.36 |
| FATHMM-MKL | 0.88 | 0.95 | 0.94 | 0.82 | 0.49 | 0.33 |
| GAVIN | 0.87 | — | 0.82 | 0.93 | 0.61 | 0.52 |
| CADD (v1.3) | 0.64 | 0.95 | 0.98 | 0.30 | 0.18 | 0.12 |
| DANN | 0.61 | 0.95 | 0.99 | 0.23 | 0.15 | 0.11 |
| Coding regions | Acc. | AUC | Sens. | Spec. | MCC | PPV |
| FATHMM-XF | 0.88 | 0.96 | 0.84 | 0.92 | 0.76 | 0.83 |
| GAVIN | 0.89 | — | 0.90 | 0.87 | 0.74 | 0.76 |
| FATHMM-MKL | 0.80 | 0.90 | 0.91 | 0.70 | 0.56 | 0.58 |
| CADD (v1.3) | 0.63 | 0.91 | 0.98 | 0.29 | 0.30 | 0.38 |
| DANN | 0.60 | 0.89 | 0.99 | 0.20 | 0.25 | 0.36 |
Note: (Top) FATHMM-XF yields the highest accuracy on unseen ClinVar examples for non-coding regions, outperforming its nearest competitor, FATHMM-MKL. Cautious classification yields exceptionally high scores, yielding predictions for 31% of examples. (Bottom) FATHMM-XF yields higher accuracy, AUC, MCC and PPV scores than competitors on unseen ClinVar examples in coding regions. The lone exception is GAVIN, with nominally higher accuracy. Cautious classification again achieves extremely high scores, yielding predictions for more than 42% of test examples.