Literature DB >> 16873487

Peptide sequence tag-based blind identification of post-translational modifications with point process model.

Chunmei Liu1, Bo Yan, Yinglei Song, Ying Xu, Liming Cai.   

Abstract

UNLABELLED: An important but difficult problem in proteomics is the identification of post-translational modifications (PTMs) in a protein. In general, the process of PTM identification by aligning experimental spectra with theoretical spectra from peptides in a peptide database is very time consuming and may lead to high false positive rate. In this paper, we introduce a new approach that is both efficient and effective for blind PTM identification. Our work consists of the following phases. First, we develop a novel tree decomposition based algorithm that can efficiently generate peptide sequence tags (PSTs) from an extended spectrum graph. Sequence tags are selected from all maximum weighted antisymmetric paths in the graph and their reliabilities are evaluated with a score function. An efficient deterministic finite automaton (DFA) based model is then developed to search a peptide database for candidate peptides by using the generated sequence tags. Finally, a point process model-an efficient blind search approach for PTM identification, is applied to report the correct peptide and PTMs if there are any. Our tests on 2657 experimental tandem mass spectra and 2620 experimental spectra with one artificially added PTM show that, in addition to high efficiency, our ab-initio sequence tag selection algorithm achieves better or comparable accuracy to other approaches. Database search results show that the sequence tags of lengths 3 and 4 filter out more than 98.3% and 99.8% peptides respectively when applied to a yeast peptide database. With the dramatically reduced search space, the point process model achieves significant improvement in accuracy as well. AVAILABILITY: The software is available upon request.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16873487     DOI: 10.1093/bioinformatics/btl226

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  17 in total

1.  De novo sequencing and homology searching.

Authors:  Bin Ma; Richard Johnson
Journal:  Mol Cell Proteomics       Date:  2011-11-16       Impact factor: 5.911

Review 2.  The significance, development and progress of high-throughput combinatorial histone code analysis.

Authors:  Nicolas L Young; Peter A Dimaggio; Benjamin A Garcia
Journal:  Cell Mol Life Sci       Date:  2010-08-04       Impact factor: 9.261

3.  Spectral dictionaries: Integrating de novo peptide sequencing with database search of tandem mass spectra.

Authors:  Sangtae Kim; Nitin Gupta; Nuno Bandeira; Pavel A Pevzner
Journal:  Mol Cell Proteomics       Date:  2008-08-14       Impact factor: 5.911

4.  A novel approach for untargeted post-translational modification identification using integer linear optimization and tandem mass spectrometry.

Authors:  Richard C Baliban; Peter A DiMaggio; Mariana D Plazas-Mayorca; Nicolas L Young; Benjamin A Garcia; Christodoulos A Floudas
Journal:  Mol Cell Proteomics       Date:  2010-01-26       Impact factor: 5.911

5.  DeltAMT: a statistical algorithm for fast detection of protein modifications from LC-MS/MS data.

Authors:  Yan Fu; Li-Yun Xiu; Wei Jia; Ding Ye; Rui-Xiang Sun; Xiao-Hong Qian; Si-Min He
Journal:  Mol Cell Proteomics       Date:  2011-02-14       Impact factor: 5.911

Review 6.  A face in the crowd: recognizing peptides through database search.

Authors:  Jimmy K Eng; Brian C Searle; Karl R Clauser; David L Tabb
Journal:  Mol Cell Proteomics       Date:  2011-08-29       Impact factor: 5.911

Review 7.  Quantitative proteomic analysis of histone modifications.

Authors:  He Huang; Shu Lin; Benjamin A Garcia; Yingming Zhao
Journal:  Chem Rev       Date:  2015-02-17       Impact factor: 60.622

Review 8.  A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics.

Authors:  Alexey I Nesvizhskii
Journal:  J Proteomics       Date:  2010-09-08       Impact factor: 4.044

9.  PILOT_PROTEIN: identification of unmodified and modified proteins via high-resolution mass spectrometry and mixed-integer linear optimization.

Authors:  Richard C Baliban; Peter A Dimaggio; Mariana D Plazas-Mayorca; Benjamin A Garcia; Christodoulos A Floudas
Journal:  J Proteome Res       Date:  2012-07-26       Impact factor: 4.466

10.  Liquid Chromatography Mass Spectrometry-Based Proteomics: Biological and Technological Aspects.

Authors:  Yuliya V Karpievitch; Ashoka D Polpitiya; Gordon A Anderson; Richard D Smith; Alan R Dabney
Journal:  Ann Appl Stat       Date:  2010       Impact factor: 2.083

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.