| Literature DB >> 29040459 |
Nozomu Sakurai1, Takafumi Narise1, Joon-Soo Sim2, Chang-Muk Lee2, Chiaki Ikeda1, Nayumi Akimoto1, Shigehiko Kanaya3, Oliver Stegle.
Abstract
Summary: For metabolite annotation in metabolomics, variations in the registered states of compounds (charged molecules and multiple components, such as salts) and their redundancy among compound databases could be the cause of misannotations and hamper immediate recognition of the uniqueness of metabolites while searching by mass values measured using mass spectrometry. We developed a search system named UC2 (Unique Connectivity of Uncharged Compounds), where compounds are tentatively neutralized into uncharged states and stored on the basis of their unique connectivity of atoms after removing their stereochemical information using the first block in the hash of the IUPAC International Chemical Identifier, by which false-positive hits are remarkably reduced, both charged and uncharged compounds are properly searched in a single query and records having a unique connectivity are compiled in a single search result. Availability and implementation: The UC2 search tool is available free of charge as a REST web service (http://webs2.kazusa.or.jp/mfsearcher) and a Java-based GUI tool. Contact: sakurai@kazusa.or.jp. Supplementary information: Supplementary data are available at Bioinformatics online.Entities:
Mesh:
Year: 2018 PMID: 29040459 PMCID: PMC5860614 DOI: 10.1093/bioinformatics/btx649
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
The number of the peaks (mass values) searched by the conventional search and the UC2 search
| Tomato | Urine | Random mass values | ||||
|---|---|---|---|---|---|---|
| Positive | Negative | Positive | Negative | Positive | Negative | |
| Total | 510 | 359 | 1264 | 1475 | 6491 | 6379 |
| Results found | 277 | 167 | 967 | 1092 | 1000 | 1000 |
| In conventional search | 277 | 164 | 967 | 1091 | 998 | 984 |
| In UC2 search | 220 | 139 | 906 | 1012 | 556 | 553 |
| Results found only in the conventional search | 57 (20.6%) | 28 (16.8%) | 61 (6.3%) | 80 (7.3%) | 444 (44.4%) | 447 (44.7%) |
| False positives | 57 | 28 | 61 | 80 | 444 | 447 |
| Results found only in the UC2 search | 0 (0%) | 3 (1.8%) | 0 (0%) | 1 (0.1%) | 2 (0.2%) | 16 (1.6%) |
| True positives | 0 | 3 | 0 | 1 | 1 | 14 |
| False positives | 0 | 0 | 0 | 0 | 1 | 2 |
Metabolites (160 peaks) detected in both positive and negative modes are shown as the positive.
[M+H]+ and [M−H]- were assumed for positive and negative modes, respectively, in the search with randomly generated mass values.
The queries whose results were found only in UC2 search and matched to charged or fragmented entries were defined as true positives.