| Literature DB >> 24895432 |
Felicity Allen1, Allison Pon2, Michael Wilson2, Russ Greiner2, David Wishart2.
Abstract
CFM-ID is a web server supporting three tasks associated with the interpretation of tandem mass spectra (MS/MS) for the purpose of automated metabolite identification: annotation of the peaks in a spectrum for a known chemical structure; prediction of spectra for a given chemical structure and putative metabolite identification--a predicted ranking of possible candidate structures for a target spectrum. The algorithms used for these tasks are based on Competitive Fragmentation Modeling (CFM), a recently introduced probabilistic generative model for the MS/MS fragmentation process that uses machine learning techniques to learn its parameters from data. These algorithms have been extensively tested on multiple datasets and have been shown to out-perform existing methods such as MetFrag and FingerId. This web server provides a simple interface for using these algorithms and a graphical display of the resulting annotations, spectra and structures. CFM-ID is made freely available at http://cfmid.wishartlab.com.Entities:
Mesh:
Year: 2014 PMID: 24895432 PMCID: PMC4086103 DOI: 10.1093/nar/gku436
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Summary of the three tasks provided by the CFM-ID web server: Spectrum Prediction, Peak Assignment and Compound Identification. Examples of possible inputs and the corresponding graphical output are shown for each.
Summary of Compound Identification results
| Querying KEGG (# cand. ≈ 22) | Querying PubChem (# cand. ≈ 1025) | |||||
|---|---|---|---|---|---|---|
| Data Set | R = 1 | R ≤ 5 | R = 1 | R ≤ 10 | MF = 1 | |
| CFM-ID | Metlin (+) | 76.5% | 96.2% | 10.9% | 40.7% | 88.9% |
| MassBank | 72.8% | 97.5% | 7.3% | 46.9% | 93.2% | |
| HMDB | 23.1% | 58.1% | 4.1% | 24.9% | 88.4% | |
| Metlin (−) | 72.1% | 96.5% | 13.4% | 51.4% | 93.8% | |
| MetFrag | Metlin (+) | 51.9% | 89.9% | 5.7% | 30.5% | 82.6% |
| MassBank | 48.1% | 88.9% | 4.7% | 20.8% | 85.4% | |
| HMDB | 13.3% | 43.6% | 2.6% | 13.4% | 88.0% | |
| Metlin (−) | 44.7% | 80.7% | 7.5% | 28.8% | 81.8% | |
| FingerID | Metlin (+) | 8.7% | 36.1% | 1.3% | 9.3% | 67.7% |
| MassBank | 14.8% | 37.0% | 0.5% | 5.7% | 71.9% | |
# cand. ≈ N : the median number of molecules in the candidate list.
R : ranking of the correct molecule in the candidate list.
MF : ranking of the correct molecular formula.