| Literature DB >> 31510666 |
Azat M Tagirdzhanov1,2, Alexander Shlemov1, Alexey Gurevich1.
Abstract
MOTIVATION: Peptidic natural products (PNPs) are considered a promising compound class that has many applications in medicine. Recently developed mass spectrometry-based pipelines are transforming PNP discovery into a high-throughput technology. However, the current computational methods for PNP identification via database search of mass spectra are still in their infancy and could be substantially improved.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31510666 PMCID: PMC6612854 DOI: 10.1093/bioinformatics/btz374
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.MS-GF+, and results on Spectra. The curves display the number of identified PSMs at different FDR levels. The dashed curves correspond to Dereplicator runs without P-value estimation
Fig. 2.Split of into training and testing datasets
Fig. 3.and results on Spectra. The curves display the number of identified PSMs at different FDR levels
Summary on and results on natural products spectral datasets
| Method | Number of PSMs | Number of PNPs | Number of PNP families | ||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| |
|
| |||||||||
|
| 8544 | 4538 | 6972 | 351 | 231 | 304 | 108 | 80 | 92 |
|
| 10 504 | 8811 | 10 287 | 395 | 290 | 378 | 132 | 101 | 129 |
|
| |||||||||
|
| 1978 | 177 | 177 | 429 | 13 | 13 | 196 | 8 | 8 |
|
| 2650 | 412 | 783 | 497 | 33 | 82 | 218 | 14 | 37 |
|
| 233 | 230 | 233 | 21 | 20 | 21 | 10 | 10 | 10 |
|
| 250 | 250 | 250 | 24 | 24 | 24 | 12 | 12 | 12 |
|
| |||||||||
|
| 220 | 15 | 15 | 78 | 2 | 2 | 52 | 2 | 2 |
|
| 402 | 174 | 186 | 138 | 30 | 33 | 66 | 12 | 14 |
|
| 83 | 83 | 83 | 13 | 13 | 13 | 6 | 6 | 6 |
|
| 179 | 179 | 179 | 29 | 29 | 29 | 10 | 10 | 10 |
|
| |||||||||
|
| 5311 | 280 | 949 | 290 | 32 | 68 | 145 | 15 | 30 |
|
| 6562 | 2495 | 5405 | 256 | 143 | 226 | 121 | 58 | 102 |
|
| 1380 | 814 | 1380 | 36 | 29 | 36 | 12 | 11 | 12 |
|
| 1881 | 1881 | 1881 | 42 | 42 | 42 | 11 | 11 | 11 |
Note: , FDR0 and FDR1 stand for the number of identified PSMs, unique PNPs or unique PNP families with P-value below , at FDR 0% and 1%, respectively. Here NPS stands for the rank-based NPS model, the finally best overall intensity-aware approach.
Fig. 4.and results on (a) , (b) , (c) Spectra and (d) the combination of the three. The curves display the number of identified PSMs (a–c) and PNP families (d) at different FDR levels