Literature DB >> 31745383

Learning Concave Conditional Likelihood Models for Improved Analysis of Tandem Mass Spectra.

John T Halloran1, David M Rocke2.   

Abstract

The most widely used technology to identify the proteins present in a complex biological sample is tandem mass spectrometry, which quickly produces a large collection of spectra representative of the peptides (i.e., protein subsequences) present in the original sample. In this work, we greatly expand the parameter learning capabilities of a dynamic Bayesian network (DBN) peptide-scoring algorithm, Didea [25], by deriving emission distributions for which its conditional log-likelihood scoring function remains concave. We show that this class of emission distributions, called Convex Virtual Emissions (CVEs), naturally generalizes the log-sum-exp function while rendering both maximum likelihood estimation and conditional maximum likelihood estimation concave for a wide range of Bayesian networks. Utilizing CVEs in Didea allows efficient learning of a large number of parameters while ensuring global convergence, in stark contrast to Didea's previous parameter learning framework (which could only learn a single parameter using a costly grid search) and other trainable models [12, 13, 14] (which only ensure convergence to local optima). The newly trained scoring function substantially outperforms the state-of-the-art in both scoring function accuracy and downstream Fisher kernel analysis. Furthermore, we significantly improve Didea's runtime performance through successive optimizations to its message passing schedule and derive explicit connections between Didea's new concave score and related MS/MS scoring functions.

Entities:  

Year:  2018        PMID: 31745383      PMCID: PMC6863516     

Source DB:  PubMed          Journal:  Adv Neural Inf Process Syst        ISSN: 1049-5258


  20 in total

1.  Semi-supervised learning for peptide identification from shotgun proteomics datasets.

Authors:  Lukas Käll; Jesse D Canterbury; Jason Weston; William Stafford Noble; Michael J MacCoss
Journal:  Nat Methods       Date:  2007-10-21       Impact factor: 28.547

2.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database.

Authors:  J K Eng; A L McCormack; J R Yates
Journal:  J Am Soc Mass Spectrom       Date:  1994-11       Impact factor: 3.109

3.  Computing exact p-values for a cross-correlation shotgun proteomics score function.

Authors:  J Jeffry Howbert; William Stafford Noble
Journal:  Mol Cell Proteomics       Date:  2014-06-02       Impact factor: 5.911

4.  Faster SEQUEST searching for peptide identification from tandem mass spectra.

Authors:  Benjamin J Diament; William Stafford Noble
Journal:  J Proteome Res       Date:  2011-07-29       Impact factor: 4.466

5.  Spectrum Identification using a Dynamic Bayesian Network Model of Tandem Mass Spectra.

Authors:  Ajit P Singh; John Halloran; Jeff A Bilmes; Katrin Kirchoff; William S Noble
Journal:  Uncertain Artif Intell       Date:  2012-08

6.  Learning Peptide-Spectrum Alignment Models for Tandem Mass Spectrometry.

Authors:  John T Halloran; Jeff A Bilmes; William S Noble
Journal:  Uncertain Artif Intell       Date:  2014

7.  A proteomics search algorithm specifically designed for high-resolution tandem mass spectra.

Authors:  Craig D Wenger; Joshua J Coon
Journal:  J Proteome Res       Date:  2013-01-31       Impact factor: 4.466

8.  Statistical calibration of the SEQUEST XCorr function.

Authors:  Aaron A Klammer; Christopher Y Park; William Stafford Noble
Journal:  J Proteome Res       Date:  2009-04       Impact factor: 4.466

9.  MS-GF+ makes progress towards a universal database search tool for proteomics.

Authors:  Sangtae Kim; Pavel A Pevzner
Journal:  Nat Commun       Date:  2014-10-31       Impact factor: 14.919

10.  Improved False Discovery Rate Estimation Procedure for Shotgun Proteomics.

Authors:  Uri Keich; Attila Kertesz-Farkas; William Stafford Noble
Journal:  J Proteome Res       Date:  2015-07-27       Impact factor: 4.466

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.