Literature DB >> 22103967

Protein identification using customized protein sequence databases derived from RNA-Seq data.

Xiaojing Wang1, Robbert J C Slebos, Dong Wang, Patrick J Halvey, David L Tabb, Daniel C Liebler, Bing Zhang.   

Abstract

The standard shotgun proteomics data analysis strategy relies on searching MS/MS spectra against a context-independent protein sequence database derived from the complete genome sequence of an organism. Because transcriptome sequence analysis (RNA-Seq) promises an unbiased and comprehensive picture of the transcriptome, we reason that a sample-specific protein database derived from RNA-Seq data can better approximate the real protein pool in the sample and thus improve protein identification. In this study, we have developed a two-step strategy for building sample-specific protein databases from RNA-Seq data. First, the database size is reduced by eliminating unexpressed or lowly expressed genes according to transcript quantification. Second, high-quality nonsynonymous coding single nucleotide variations (SNVs) are identified based on RNA-Seq data, and corresponding protein variants are added to the database. Using RNA-Seq and shotgun proteomics data from two colorectal cancer cell lines SW480 and RKO, we demonstrated that customized protein sequence databases could significantly increase the sensitivity of peptide identification, reduce ambiguity in protein assembly, and enable the detection of known and novel peptide variants. Thus, sample-specific databases from RNA-Seq data can enable more sensitive and comprehensive protein discovery in shotgun proteomics studies.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 22103967      PMCID: PMC3727138          DOI: 10.1021/pr200766z

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  47 in total

Review 1.  Structure and mechanism of the Hsp90 molecular chaperone machinery.

Authors:  Laurence H Pearl; Chrisostomos Prodromou
Journal:  Annu Rev Biochem       Date:  2006       Impact factor: 23.643

2.  Mapping short DNA sequencing reads and calling variants using mapping quality scores.

Authors:  Heng Li; Jue Ruan; Richard Durbin
Journal:  Genome Res       Date:  2008-08-19       Impact factor: 9.043

3.  Detection of alternative splice variants at the proteome level in Aspergillus flavus.

Authors:  Kung-Yen Chang; D Ryan Georgianna; Steffen Heber; Gary A Payne; David C Muddiman
Journal:  J Proteome Res       Date:  2010-03-05       Impact factor: 4.466

Review 4.  A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics.

Authors:  Alexey I Nesvizhskii
Journal:  J Proteomics       Date:  2010-09-08       Impact factor: 4.044

5.  Mutant p53 gain of function: reduction of tumor malignancy of human cancer cell lines through abrogation of mutant p53 expression.

Authors:  G Bossi; E Lapi; S Strano; C Rinaldo; G Blandino; A Sacchi
Journal:  Oncogene       Date:  2006-01-12       Impact factor: 9.867

6.  A bioinformatics workflow for variant peptide detection in shotgun proteomics.

Authors:  Jing Li; Zengliu Su; Ze-Qiang Ma; Robbert J C Slebos; Patrick Halvey; David L Tabb; Daniel C Liebler; William Pao; Bing Zhang
Journal:  Mol Cell Proteomics       Date:  2011-03-09       Impact factor: 5.911

Review 7.  RNA-Seq: a revolutionary tool for transcriptomics.

Authors:  Zhong Wang; Mark Gerstein; Michael Snyder
Journal:  Nat Rev Genet       Date:  2009-01       Impact factor: 53.242

8.  The utility of mass spectrometry-based proteomic data for validation of novel alternative splice forms reconstructed from RNA-Seq data: a preliminary assessment.

Authors:  Kang Ning; Alexey I Nesvizhskii
Journal:  BMC Bioinformatics       Date:  2010-12-14       Impact factor: 3.169

9.  Novel peptide identification from tandem mass spectra using ESTs and sequence database compression.

Authors:  Nathan J Edwards
Journal:  Mol Syst Biol       Date:  2007-04-17       Impact factor: 11.429

10.  RAId_DbS: mass-spectrometry based peptide identification web server with knowledge integration.

Authors:  Gelio Alves; Aleksey Y Ogurtsov; Yi-Kuo Yu
Journal:  BMC Genomics       Date:  2008-10-27       Impact factor: 3.969

View more
  74 in total

1.  A proteogenomics approach integrating proteomics and ribosome profiling increases the efficiency of protein identification and enables the discovery of alternative translation start sites.

Authors:  Alexander Koch; Daria Gawron; Sandra Steyaert; Elvis Ndah; Jeroen Crappé; Sarah De Keulenaer; Ellen De Meester; Ming Ma; Ben Shen; Kris Gevaert; Wim Van Criekinge; Petra Van Damme; Gerben Menschaert
Journal:  Proteomics       Date:  2014-10-02       Impact factor: 3.984

2.  Large-scale mass spectrometric detection of variant peptides resulting from nonsynonymous nucleotide differences.

Authors:  Gloria M Sheynkman; Michael R Shortreed; Brian L Frey; Mark Scalf; Lloyd M Smith
Journal:  J Proteome Res       Date:  2013-11-11       Impact factor: 4.466

3.  Proteogenomic strategies for identification of aberrant cancer peptides using large-scale next-generation sequencing data.

Authors:  Sunghee Woo; Seong Won Cha; Seungjin Na; Clark Guest; Tao Liu; Richard D Smith; Karin D Rodland; Samuel Payne; Vineet Bafna
Journal:  Proteomics       Date:  2014-11-17       Impact factor: 3.984

4.  Identification of gene fusions from human lung cancer mass spectrometry data.

Authors:  Han Sun; Xiaobin Xing; Jing Li; Fengli Zhou; Yunqin Chen; Ying He; Wei Li; Guangwu Wei; Xiao Chang; Jia Jia; Yixue Li; Lu Xie
Journal:  BMC Genomics       Date:  2013-12-09       Impact factor: 3.969

5.  Top-down-assisted bottom-up method for homologous protein sequencing: hemoglobin from 33 bird species.

Authors:  Yang Song; Ünige A Laskay; Inger-Marie E Vilcins; Alan G Barbour; Vicki H Wysocki
Journal:  J Am Soc Mass Spectrom       Date:  2015-06-26       Impact factor: 3.109

Review 6.  Improving protein identification from tandem mass spectrometry data by one-step methods and integrating data from other platforms.

Authors:  Sinjini Sikdar; Ryan Gill; Susmita Datta
Journal:  Brief Bioinform       Date:  2015-07-03       Impact factor: 11.622

Review 7.  Protein analysis by shotgun/bottom-up proteomics.

Authors:  Yaoyang Zhang; Bryan R Fonslow; Bing Shan; Moon-Chang Baek; John R Yates
Journal:  Chem Rev       Date:  2013-02-26       Impact factor: 60.622

8.  Deep proteome coverage based on ribosome profiling aids mass spectrometry-based protein and peptide discovery and provides evidence of alternative translation products and near-cognate translation initiation events.

Authors:  Gerben Menschaert; Wim Van Criekinge; Tineke Notelaers; Alexander Koch; Jeroen Crappé; Kris Gevaert; Petra Van Damme
Journal:  Mol Cell Proteomics       Date:  2013-02-21       Impact factor: 5.911

9.  Discovery and mass spectrometric analysis of novel splice-junction peptides using RNA-Seq.

Authors:  Gloria M Sheynkman; Michael R Shortreed; Brian L Frey; Lloyd M Smith
Journal:  Mol Cell Proteomics       Date:  2013-04-29       Impact factor: 5.911

Review 10.  Clinical potential of mass spectrometry-based proteogenomics.

Authors:  Bing Zhang; Jeffrey R Whiteaker; Andrew N Hoofnagle; Geoffrey S Baird; Karin D Rodland; Amanda G Paulovich
Journal:  Nat Rev Clin Oncol       Date:  2019-04       Impact factor: 66.675

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.