Literature DB >> 30638379

A Markov Chain Monte Carlo Method for Estimating the Statistical Significance of Proteoform Identifications by Top-Down Mass Spectrometry.

Qiang Kou1, Zhe Wang2, Rachele A Lubeckyj3, Si Wu2, Liangliang Sun3, Xiaowen Liu1,4.   

Abstract

Top-down mass spectrometry is capable of identifying whole proteoform sequences with multiple post-translational modifications because it generates tandem mass spectra directly from intact proteoforms. Many software tools, such as ProSightPC, MSPathFinder, and TopMG, have been proposed for identifying proteoforms with modifications. In these tools, various methods are employed to estimate the statistical significance of identifications. However, most existing methods are designed for proteoform identifications without modifications, and the challenge remains for accurately estimating the statistical significance of proteoform identifications with modifications. Here we propose TopMCMC, a method that combines a Markov chain random walk algorithm and a greedy algorithm for assigning statistical significance to matches between spectra and protein sequences with variable modifications. Experimental results showed that TopMCMC achieved high accuracy in estimating E-values and false discovery rates of identifications in top-down mass spectrometry. Coupled with TopMG, TopMCMC identified more spectra than the generating function method from an MCF-7 top-down mass spectrometry data set.

Entities:  

Keywords:  Markov chain Monte Carlo; statistical significance estimation; top-down mass spectrometry

Mesh:

Substances:

Year:  2019        PMID: 30638379      PMCID: PMC6484843          DOI: 10.1021/acs.jproteome.8b00562

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  36 in total

1.  Automated reduction and interpretation of high resolution electrospray mass spectra of large molecules.

Authors:  D M Horn; R A Zubarev; F W McLafferty
Journal:  J Am Soc Mass Spectrom       Date:  2000-04       Impact factor: 3.109

2.  A hypergeometric probability model for protein identification and validation using tandem mass spectral data and protein sequence databases.

Authors:  Rovshan G Sadygov; John R Yates
Journal:  Anal Chem       Date:  2003-08-01       Impact factor: 6.986

3.  A complex standard for protein identification, designed by evolution.

Authors:  Marc Vaudel; Julia M Burkhart; Daniela Breiter; René P Zahedi; Albert Sickmann; Lennart Martens
Journal:  J Proteome Res       Date:  2012-09-13       Impact factor: 4.466

4.  The generating function of CID, ETD, and CID/ETD pairs of tandem mass spectra: applications to database search.

Authors:  Sangtae Kim; Nikolai Mischerikow; Nuno Bandeira; J Daniel Navarro; Louis Wich; Shabaz Mohammed; Albert J R Heck; Pavel A Pevzner
Journal:  Mol Cell Proteomics       Date:  2010-09-09       Impact factor: 5.911

5.  Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry.

Authors:  Joshua E Elias; Steven P Gygi
Journal:  Nat Methods       Date:  2007-03       Impact factor: 28.547

6.  Sensitive and specific identification of wild type and variant proteins from 8 to 669 kDa using top-down mass spectrometry.

Authors:  N Murat Karabacak; Long Li; Ashutosh Tiwari; Lawrence J Hayward; Pengyu Hong; Michael L Easterling; Jeffrey N Agar
Journal:  Mol Cell Proteomics       Date:  2008-12-15       Impact factor: 5.911

7.  Single-Shot Top-Down Proteomics with Capillary Zone Electrophoresis-Electrospray Ionization-Tandem Mass Spectrometry for Identification of Nearly 600 Escherichia coli Proteoforms.

Authors:  Rachele A Lubeckyj; Elijah N McCool; Xiaojing Shen; Qiang Kou; Xiaowen Liu; Liangliang Sun
Journal:  Anal Chem       Date:  2017-11-07       Impact factor: 6.986

Review 8.  Top Down proteomics: facts and perspectives.

Authors:  Adam D Catherman; Owen S Skinner; Neil L Kelleher
Journal:  Biochem Biophys Res Commun       Date:  2014-02-17       Impact factor: 3.575

9.  Characterization of Proteoforms with Unknown Post-translational Modifications Using the MIScore.

Authors:  Qiang Kou; Binhai Zhu; Si Wu; Charles Ansong; Nikola Tolić; Ljiljana Paša-Tolić; Xiaowen Liu
Journal:  J Proteome Res       Date:  2016-07-01       Impact factor: 4.466

Review 10.  Computational and statistical analysis of protein mass spectrometry data.

Authors:  William Stafford Noble; Michael J MacCoss
Journal:  PLoS Comput Biol       Date:  2012-01-26       Impact factor: 4.475

View more
  1 in total

1.  TopPIC Gateway: A Web Gateway for Top-Down Mass Spectrometry Data Interpretation.

Authors:  In Kwon Choi; Eroma Abeysinghe; Eric Coulter; Suresh Marru; Marlon Pierce; Xiaowen Liu
Journal:  PEARC20 (2020)       Date:  2020-07
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.