Literature DB >> 31364357

pClean: An Algorithm To Preprocess High-Resolution Tandem Mass Spectra for Database Searching.

Yamei Deng1,2,3, Zhe Ren3,4, Qingfei Pan1,2,3, Da Qi3,4, Bo Wen, Yan Ren3,4, Huanming Yang3,4,5, Lin Wu1, Fei Chen1, Siqi Liu1,3,4.   

Abstract

Database searches of MS/MS spectra are the main approach to peptide/protein identification in proteomics. Since most database search engines only utilize a small portion of the original MS/MS signals for peptide detection, how to improve the quality of MS/MS signals is a primary concern for enhancement of the peptide/protein identification rate. A fundamental issue is that some noise MS signals, informative or uninformative, have to be filtered out prior to database searching. Herein, an integrative preprocessing algorithm was designed, termed pClean, which incorporates three modules to preprocess MS/MS spectra, such as the removal of isobaric-labeling related ions, the reduction in isotopic peaks, the deconvolution of ions with higher charges, and the clearance of uninformative MS/MS signals. In contrast to the currently available approaches to MS/MS data preprocessing, pClean enables treatment of MS/MS spectra with high mass accuracy and favors filtering for the labeling or nonlabeling of peptides. Data sets at various scales gained from mass spectrometers with high resolution were used to assess the quality of peptides identified after pClean treatment and to compare the pClean improvement with those of other software programs. On the basis of the analysis of peptides identified and the Mascot ion score, pClean was proven to be effective in the removal of mass spectral noise and the reduction of random matching. Compared with other software programs, pClean appeared to be beneficial in terms of preprocessing performances for the enhancement of confidence scores and the increase in peptides identified. pClean is available at https://github.com/AimeeD90/pClean_release .

Keywords:  MS/MS; bioinformatics; database search; pClean; preprocessing; proteomics

Year:  2019        PMID: 31364357     DOI: 10.1021/acs.jproteome.9b00141

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  2 in total

1.  High Performance Computing Framework for Tera-Scale Database Search of Mass Spectrometry Data.

Authors:  Muhammad Haseeb; Fahad Saeed
Journal:  Nat Comput Sci       Date:  2021-08-20

Review 2.  Inferring early-life host and microbiome functions by mass spectrometry-based metaproteomics and metabolomics.

Authors:  Veronika Kuchařová Pettersen; Luis Caetano Martha Antunes; Antoine Dufour; Marie-Claire Arrieta
Journal:  Comput Struct Biotechnol J       Date:  2021-12-20       Impact factor: 7.271

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.