Literature DB >> 22686349

Mass spectrum sequential subtraction speeds up searching large peptide MS/MS spectra datasets against large nucleotide databases for proteogenomics.

Mohamed Helmy1, Naoyuki Sugiyama, Masaru Tomita, Yasushi Ishihama.   

Abstract

We have developed a novel bioinformatics method called mass spectrum sequential subtraction (MSSS) to search large peptide spectra datasets produced by liquid chromatography/mass spectrometry (LC-MS/MS) against protein and large-sized nucleotide sequence databases. The main principle in MSSS is to search the peptide spectra set against the protein database, followed by removal of the spectra corresponding to the identified peptides to create a smaller set of the remaining peptide spectra for searching against the nucleotide sequences database. Therefore, we reduce the number of spectra to be searched to limit the peptide search space. Comparing MSSS and conventional search approach using a dataset of 27 LC-MS/MS runs of rice culture cells indicated that MSSS reduced the search queries to 50% and the search time to 75% on average. In addition, MSSS had no effect on the identification false-positive rate (FPR) or the novel peptide sequences identification ability. We used MSSS to analyze another dataset of 34 LC-MS/MS runs, resulting in identifying additional 74 novel peptides. Proteogenomic analysis with these additional peptides yielded 47 new genomic features in 24 rice genes plus 24 intergenic peptides. These results show that the utility of MSSS in searching large databases with large MS/MS datasets for proteogenomics.
© 2012 The Authors Journal compilation © 2012 by the Molecular Biology Society of Japan/Blackwell Publishing Ltd.

Entities:  

Mesh:

Year:  2012        PMID: 22686349     DOI: 10.1111/j.1365-2443.2012.01615.x

Source DB:  PubMed          Journal:  Genes Cells        ISSN: 1356-9597            Impact factor:   1.891


  6 in total

Review 1.  Proteogenomics: concepts, applications and computational strategies.

Authors:  Alexey I Nesvizhskii
Journal:  Nat Methods       Date:  2014-11       Impact factor: 28.547

Review 2.  Advances in Multi-Omics Approaches for Molecular Breeding of Black Rot Resistance in Brassica oleracea L.

Authors:  Ranjan K Shaw; Yusen Shen; Jiansheng Wang; Xiaoguang Sheng; Zhenqing Zhao; Huifang Yu; Honghui Gu
Journal:  Front Plant Sci       Date:  2021-12-06       Impact factor: 5.753

3.  Peppy: proteogenomic search software.

Authors:  Brian A Risk; Wendy J Spitzer; Morgan C Giddings
Journal:  J Proteome Res       Date:  2013-05-06       Impact factor: 4.466

Review 4.  Proteogenomics: Integrating Next-Generation Sequencing and Mass Spectrometry to Characterize Human Proteomic Variation.

Authors:  Gloria M Sheynkman; Michael R Shortreed; Anthony J Cesnik; Lloyd M Smith
Journal:  Annu Rev Anal Chem (Palo Alto Calif)       Date:  2016-03-30       Impact factor: 10.745

5.  Improving the Genome Annotation of Rhizoctonia solani Using Proteogenomics.

Authors:  Jiantao Shu; Mingkun Yang; Cheng Zhang; Pingfang Yang; Feng Ge; Ming Li
Journal:  Curr Genomics       Date:  2021-12-30       Impact factor: 2.689

Review 6.  Next-generation sequence assembly: four stages of data processing and computational challenges.

Authors:  Sara El-Metwally; Taher Hamza; Magdi Zakaria; Mohamed Helmy
Journal:  PLoS Comput Biol       Date:  2013-12-12       Impact factor: 4.475

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.