Literature DB >> 28369237

Evaluating de novo sequencing in proteomics: already an accurate alternative to database-driven peptide identification?

Thilo Muth1, Bernhard Y Renard1.   

Abstract

While peptide identifications in mass spectrometry (MS)-based shotgun proteomics are mostly obtained using database search methods, high-resolution spectrum data from modern MS instruments nowadays offer the prospect of improving the performance of computational de novo peptide sequencing. The major benefit of de novo sequencing is that it does not require a reference database to deduce full-length or partial tag-based peptide sequences directly from experimental tandem mass spectrometry spectra. Although various algorithms have been developed for automated de novo sequencing, the prediction accuracy of proposed solutions has been rarely evaluated in independent benchmarking studies. The main objective of this work is to provide a detailed evaluation on the performance of de novo sequencing algorithms on high-resolution data. For this purpose, we processed four experimental data sets acquired from different instrument types from collision-induced dissociation and higher energy collisional dissociation (HCD) fragmentation mode using the software packages Novor, PEAKS and PepNovo. Moreover, the accuracy of these algorithms is also tested on ground truth data based on simulated spectra generated from peak intensity prediction software. We found that Novor shows the overall best performance compared with PEAKS and PepNovo with respect to the accuracy of correct full peptide, tag-based and single-residue predictions. In addition, the same tool outpaced the commercial competitor PEAKS in terms of running time speedup by factors of around 12-17. Despite around 35% prediction accuracy for complete peptide sequences on HCD data sets, taken as a whole, the evaluated algorithms perform moderately on experimental data but show a significantly better performance on simulated data (up to 84% accuracy). Further, we describe the most frequently occurring de novo sequencing errors and evaluate the influence of missing fragment ion peaks and spectral noise on the accuracy. Finally, we discuss the potential of de novo sequencing for now becoming more widely used in the field.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 28369237     DOI: 10.1093/bib/bbx033

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  14 in total

1.  Precision De Novo Peptide Sequencing Using Mirror Proteases of Ac-LysargiNase and Trypsin for Large-scale Proteomics.

Authors:  Hao Yang; Yan-Chang Li; Ming-Zhi Zhao; Fei-Lin Wu; Xi Wang; Wei-Di Xiao; Yi-Hao Wang; Jun-Ling Zhang; Fu-Qiang Wang; Feng Xu; Wen-Feng Zeng; Christopher M Overall; Si-Min He; Hao Chi; Ping Xu
Journal:  Mol Cell Proteomics       Date:  2019-01-08       Impact factor: 5.911

Review 2.  Proteomic Approaches to Unravel Mechanisms of Antibiotic Resistance and Immune Evasion of Bacterial Pathogens.

Authors:  Eva Torres-Sangiao; Alexander Dyason Giddey; Cristina Leal Rodriguez; Zhiheng Tang; Xiaoyun Liu; Nelson C Soares
Journal:  Front Med (Lausanne)       Date:  2022-05-02

3.  Combining High-Resolution and Exact Calibration To Boost Statistical Power: A Well-Calibrated Score Function for High-Resolution MS2 Data.

Authors:  Andy Lin; J Jeffry Howbert; William Stafford Noble
Journal:  J Proteome Res       Date:  2018-10-18       Impact factor: 4.466

4.  Elucidation of cross-species proteomic effects in human and hominin bone proteome identification through a bioinformatics experiment.

Authors:  F Welker
Journal:  BMC Evol Biol       Date:  2018-02-20       Impact factor: 3.260

5.  pNovo 3: precise de novo peptide sequencing using a learning-to-rank framework.

Authors:  Hao Yang; Hao Chi; Wen-Feng Zeng; Wen-Jing Zhou; Si-Min He
Journal:  Bioinformatics       Date:  2019-07-15       Impact factor: 6.937

Review 6.  The Role of Mass Spectrometry and Proteogenomics in the Advancement of HLA Epitope Prediction.

Authors:  Amanda L Creech; Ying S Ting; Scott P Goulding; John F K Sauld; Dominik Barthelme; Michael S Rooney; Terri A Addona; Jennifer G Abelin
Journal:  Proteomics       Date:  2018-02-23       Impact factor: 3.984

7.  A high-speed search engine pLink 2 with systematic evaluation for proteome-scale identification of cross-linked peptides.

Authors:  Zhen-Lin Chen; Jia-Ming Meng; Yong Cao; Ji-Li Yin; Run-Qian Fang; Sheng-Bo Fan; Chao Liu; Wen-Feng Zeng; Yue-He Ding; Dan Tan; Long Wu; Wen-Jing Zhou; Hao Chi; Rui-Xiang Sun; Meng-Qiu Dong; Si-Min He
Journal:  Nat Commun       Date:  2019-07-30       Impact factor: 14.919

8.  Combination of Proteogenomics with Peptide De Novo Sequencing Identifies New Genes and Hidden Posttranscriptional Modifications.

Authors:  B Blank-Landeshammer; I Teichert; R Märker; M Nowrousian; U Kück; A Sickmann
Journal:  mBio       Date:  2019-10-15       Impact factor: 7.867

9.  Improved Identification of Small Open Reading Frames Encoded Peptides by Top-Down Proteomic Approaches and De Novo Sequencing.

Authors:  Bing Wang; Zhiwei Wang; Ni Pan; Jiangmei Huang; Cuihong Wan
Journal:  Int J Mol Sci       Date:  2021-05-22       Impact factor: 5.923

Review 10.  Prospects and challenges of cancer systems medicine: from genes to disease networks.

Authors:  Mohammad Reza Karimi; Amir Hossein Karimi; Shamsozoha Abolmaali; Mehdi Sadeghi; Ulf Schmitz
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.