| Literature DB >> 29882266 |
Yasset Perez-Riverol1, Juan Antonio Vizcaíno1, Johannes Griss1,2.
Abstract
In this article, current and future applications of spectral clustering are discussed in the context of mass spectrometry-based proteomics approaches. First of all, the main algorithms and tools that can currently be used to perform spectral clustering are introduced. In addition, its main applications and their use in current computational proteomics workflows are explained, including the generation of spectral libraries and spectral archives. Finally, possible future directions for spectral clustering, including its potential use to achieve a deeper coverage of the proteome and the discovery of novel post-translational modifications and single amino acid variants.Entities:
Keywords: algorithms; computational proteomics; mass spectrometry; spectral clustering
Mesh:
Substances:
Year: 2018 PMID: 29882266 PMCID: PMC6099476 DOI: 10.1002/pmic.201700454
Source DB: PubMed Journal: Proteomics ISSN: 1615-9853 Impact factor: 3.984
Figure 1Spectral clustering in proteomics. The input data for any clustering algorithm consists of A) publicly available mass spectra data in proteomics repositories (unidentified, correctly identified, and/or incorrectly identified spectra); B) identified spectra from small‐scale experiments. After the spectral clustering process one main output is expected: C) spectral archives. The spectral archives contain two types of clusters: D) clusters with identified spectra (spectral libraries) and clusters of unidentified spectra. Multiple applications are represented: E) by clustering high‐quality peptide identifications with low‐quality ones, quality assessment of possible false positive identifications can be performed. F) Spectral clustering can help to infer identifications for unidentified spectra, by clustering identified and unidentified spectra together. G) Detection of clusters of unidentified spectra. The resulting clusters should be analyzed with alternative methods such as de novo or open modification searches. H) The combination of database searches with spectral library searches can be useful to increase the number of identifications. I) Finally, spectral libraries in DIA analysis algorithms where spectral assays are designed from previous spectral libraries generated from DDA data.