Literature DB >> 32053352

Full-Spectrum Prediction of Peptides Tandem Mass Spectra using Deep Neural Network.

Kaiyuan Liu1, Sujun Li1, Lei Wang1, Yuzhen Ye1, Haixu Tang1.   

Abstract

The ability to predict tandem mass (MS/MS) spectra from peptide sequences can significantly enhance our understanding of the peptide fragmentation process and could improve peptide identification in proteomics. However, current approaches for predicting high-energy collisional dissociation (HCD) spectra are limited to predict the intensities of expected ion types, that is, the a/b/c/x/y/z ions and their neutral loss derivatives (referred to as backbone ions). In practice, backbone ions only account for <70% of total ion intensities in HCD spectra, indicating many intense ions are ignored by current predictors. In this paper, we present a deep learning approach that can predict the complete spectra (both backbone and nonbackbone ions) directly from peptide sequences. We made no assumptions or expectations on which kind of ions to predict but instead predicting the intensities for all possible m/z. Training this model needs no annotations of fragment ion nor any prior knowledge of the fragmentation rules. Our analyses show that the predicted 2+ and 3+ HCD spectra are highly similar to the experimental spectra, with average full-spectrum cosine similarities of 0.820 (±0.088) and 0.786 (±0.085), respectively, very close to the similarities between the experimental replicated spectra. In contrast, the best-performed backbone only models can only achieve an average similarity below 0.75 and 0.70 for 2+ and 3+ spectra, respectively. Furthermore, we developed a multitask learning (MTL) approach for predicting spectra of insufficient training samples, which allows our model to make accurate predictions for electron transfer dissociation (ETD) spectra and HCD spectra of less abundant charges (1+ and 4+).

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32053352      PMCID: PMC8057055          DOI: 10.1021/acs.analchem.9b04867

Source DB:  PubMed          Journal:  Anal Chem        ISSN: 0003-2700            Impact factor:   8.008


  29 in total

1.  Connecting genomic alterations to cancer biology with proteomics: the NCI Clinical Proteomic Tumor Analysis Consortium.

Authors:  Matthew J Ellis; Michael Gillette; Steven A Carr; Amanda G Paulovich; Richard D Smith; Karin K Rodland; R Reid Townsend; Christopher Kinsinger; Mehdi Mesri; Henry Rodriguez; Daniel C Liebler
Journal:  Cancer Discov       Date:  2013-10       Impact factor: 39.397

2.  pDeep: Predicting MS/MS Spectra of Peptides with Deep Learning.

Authors:  Xie-Xuan Zhou; Wen-Feng Zeng; Hao Chi; Chunjie Luo; Chao Liu; Jianfeng Zhan; Si-Min He; Zhifei Zhang
Journal:  Anal Chem       Date:  2017-11-21       Impact factor: 6.986

3.  Extending a Tandem Mass Spectral Library to Include MS2 Spectra of Fragment Ions Produced In-Source and MSn Spectra.

Authors:  Xiaoyu Yang; Pedatsur Neta; Stephen E Stein
Journal:  J Am Soc Mass Spectrom       Date:  2017-07-18       Impact factor: 3.109

4.  On the accuracy and limits of peptide fragmentation spectrum prediction.

Authors:  Sujun Li; Randy J Arnold; Haixu Tang; Predrag Radivojac
Journal:  Anal Chem       Date:  2010-12-22       Impact factor: 6.986

5.  Predicting intensity ranks of peptide fragment ions.

Authors:  Ari M Frank
Journal:  J Proteome Res       Date:  2009-05       Impact factor: 4.466

6.  Building ProteomeTools based on a complete synthetic human proteome.

Authors:  Daniel P Zolg; Mathias Wilhelm; Karsten Schnatbaum; Johannes Zerweck; Tobias Knaute; Bernard Delanghe; Derek J Bailey; Siegfried Gessulat; Hans-Christian Ehrlich; Maximilian Weininger; Peng Yu; Judith Schlegl; Karl Kramer; Tobias Schmidt; Ulrike Kusebauch; Eric W Deutsch; Ruedi Aebersold; Robert L Moritz; Holger Wenschuh; Thomas Moehring; Stephan Aiche; Andreas Huhmer; Ulf Reimer; Bernhard Kuster
Journal:  Nat Methods       Date:  2017-01-30       Impact factor: 28.547

7.  MS-GF+ makes progress towards a universal database search tool for proteomics.

Authors:  Sangtae Kim; Pavel A Pevzner
Journal:  Nat Commun       Date:  2014-10-31       Impact factor: 14.919

8.  Assembling the Community-Scale Discoverable Human Proteome.

Authors:  Mingxun Wang; Jian Wang; Jeremy Carver; Benjamin S Pullman; Seong Won Cha; Nuno Bandeira
Journal:  Cell Syst       Date:  2018-08-29       Impact factor: 10.304

9.  The PRoteomics IDEntifications (PRIDE) database and associated tools: status in 2013.

Authors:  Juan Antonio Vizcaíno; Richard G Côté; Attila Csordas; José A Dianes; Antonio Fabregat; Joseph M Foster; Johannes Griss; Emanuele Alpi; Melih Birim; Javier Contell; Gavin O'Kelly; Andreas Schoenegger; David Ovelleiro; Yasset Pérez-Riverol; Florian Reisinger; Daniel Ríos; Rui Wang; Henning Hermjakob
Journal:  Nucleic Acids Res       Date:  2012-11-29       Impact factor: 16.971

10.  ProteomeXchange provides globally coordinated proteomics data submission and dissemination.

Authors:  Juan A Vizcaíno; Eric W Deutsch; Rui Wang; Attila Csordas; Florian Reisinger; Daniel Ríos; José A Dianes; Zhi Sun; Terry Farrah; Nuno Bandeira; Pierre-Alain Binz; Ioannis Xenarios; Martin Eisenacher; Gerhard Mayer; Laurent Gatto; Alex Campos; Robert J Chalkley; Hans-Joachim Kraus; Juan Pablo Albar; Salvador Martinez-Bartolomé; Rolf Apweiler; Gilbert S Omenn; Lennart Martens; Andrew R Jones; Henning Hermjakob
Journal:  Nat Biotechnol       Date:  2014-03       Impact factor: 54.908

View more
  6 in total

Review 1.  Prediction of peptide mass spectral libraries with machine learning.

Authors:  Jürgen Cox
Journal:  Nat Biotechnol       Date:  2022-08-25       Impact factor: 68.164

Review 2.  Maximizing Depth of PTM Coverage: Generating Robust MS Datasets for Computational Prediction Modeling.

Authors:  Anthony A Iannetta; Leslie M Hicks
Journal:  Methods Mol Biol       Date:  2022

3.  High Performance Computing Framework for Tera-Scale Database Search of Mass Spectrometry Data.

Authors:  Muhammad Haseeb; Fahad Saeed
Journal:  Nat Comput Sci       Date:  2021-08-20

4.  Positional SHAP (PoSHAP) for Interpretation of machine learning models trained from biological sequences.

Authors:  Quinn Dickinson; Jesse G Meyer
Journal:  PLoS Comput Biol       Date:  2022-01-28       Impact factor: 4.779

Review 5.  Deep learning neural network tools for proteomics.

Authors:  Jesse G Meyer
Journal:  Cell Rep Methods       Date:  2021-05-17

6.  DIAmeter: matching peptides to data-independent acquisition mass spectrometry data.

Authors:  Yang Young Lu; Jeff Bilmes; Ricard A Rodriguez-Mias; Judit Villén; William Stafford Noble
Journal:  Bioinformatics       Date:  2021-07-12       Impact factor: 6.937

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.