Literature DB >> 21389108

A bioinformatics workflow for variant peptide detection in shotgun proteomics.

Jing Li1, Zengliu Su, Ze-Qiang Ma, Robbert J C Slebos, Patrick Halvey, David L Tabb, Daniel C Liebler, William Pao, Bing Zhang.   

Abstract

Shotgun proteomics data analysis usually relies on database search. However, commonly used protein sequence databases do not contain information on protein variants and thus prevent variant peptides and proteins from been identified. Including known coding variations into protein sequence databases could help alleviate this problem. Based on our recently published human Cancer Proteome Variation Database, we have created a protein sequence database that comprehensively annotates thousands of cancer-related coding variants collected in the Cancer Proteome Variation Database as well as noncancer-specific ones from the Single Nucleotide Polymorphism Database (dbSNP). Using this database, we then developed a data analysis workflow for variant peptide identification in shotgun proteomics. The high risk of false positive variant identifications was addressed by a modified false discovery rate estimation method. Analysis of colorectal cancer cell lines SW480, RKO, and HCT-116 revealed a total of 81 peptides that contain either noncancer-specific or cancer-related variations. Twenty-three out of 26 variants randomly selected from the 81 were confirmed by genomic sequencing. We further applied the workflow on data sets from three individual colorectal tumor specimens. A total of 204 distinct variant peptides were detected, and five carried known cancer-related mutations. Each individual showed a specific pattern of cancer-related mutations, suggesting potential use of this type of information for personalized medicine. Compatibility of the workflow has been tested with four popular database search engines including Sequest, Mascot, X!Tandem, and MyriMatch. In summary, we have developed a workflow that effectively uses existing genomic data to enable variant peptide detection in proteomics.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21389108      PMCID: PMC3098595          DOI: 10.1074/mcp.M110.006536

Source DB:  PubMed          Journal:  Mol Cell Proteomics        ISSN: 1535-9476            Impact factor:   5.911


  47 in total

Review 1.  The human proteomics initiative (HPI).

Authors:  C O'Donovan; R Apweiler; A Bairoch
Journal:  Trends Biotechnol       Date:  2001-05       Impact factor: 19.536

2.  Error tolerant searching of uninterpreted tandem mass spectrometry data.

Authors:  David M Creasy; John S Cottrell
Journal:  Proteomics       Date:  2002-10       Impact factor: 3.984

3.  MyriMatch: highly accurate tandem mass spectral peptide identification by multivariate hypergeometric analysis.

Authors:  David L Tabb; Christopher G Fernando; Matthew C Chambers
Journal:  J Proteome Res       Date:  2007-02       Impact factor: 4.466

4.  Improving sensitivity by probabilistically combining results from multiple MS/MS search methodologies.

Authors:  Brian C Searle; Mark Turner; Alexey I Nesvizhskii
Journal:  J Proteome Res       Date:  2008-01       Impact factor: 4.466

5.  Maximizing the sensitivity and reliability of peptide identification in large-scale proteomic experiments by harnessing multiple search engines.

Authors:  Wen Yu; J Alex Taylor; Michael T Davis; Leo E Bonilla; Kimberly A Lee; Paul L Auger; Chris C Farnsworth; Andrew A Welcher; Scott D Patterson
Journal:  Proteomics       Date:  2010-03       Impact factor: 3.984

6.  The Catalogue of Somatic Mutations in Cancer (COSMIC).

Authors:  S A Forbes; G Bhamra; S Bamford; E Dawson; C Kok; J Clements; A Menzies; J W Teague; P A Futreal; M R Stratton
Journal:  Curr Protoc Hum Genet       Date:  2008-04

7.  The Protein Mutant Database.

Authors:  T Kawabata; M Ota; K Nishikawa
Journal:  Nucleic Acids Res       Date:  1999-01-01       Impact factor: 16.971

8.  Mutant p53 gain of function: reduction of tumor malignancy of human cancer cell lines through abrogation of mutant p53 expression.

Authors:  G Bossi; E Lapi; S Strano; C Rinaldo; G Blandino; A Sacchi
Journal:  Oncogene       Date:  2006-01-12       Impact factor: 9.867

9.  Suppression of inhibitor of differentiation 2, a target of mutant p53, is required for gain-of-function mutations.

Authors:  Wensheng Yan; Gang Liu; Ariane Scoumanne; Xinbin Chen
Journal:  Cancer Res       Date:  2008-08-15       Impact factor: 12.701

10.  Somatic mutation databases as tools for molecular epidemiology and molecular pathology of cancer: proposed guidelines for improving data collection, distribution, and integration.

Authors:  M Olivier; A Petitjean; J Teague; S Forbes; J K Dunnick; J T den Dunnen; A Langerød; J M Wilkinson; M Vihinen; R G H Cotton; P Hainaut
Journal:  Hum Mutat       Date:  2009-03       Impact factor: 4.878

View more
  42 in total

1.  Single Amino Acid Variant Discovery in Small Numbers of Cells.

Authors:  Zhijing Tan; Xinpei Yi; Nicholas J Carruthers; Paul M Stemmer; David M Lubman
Journal:  J Proteome Res       Date:  2018-11-21       Impact factor: 4.466

2.  Large-scale mass spectrometric detection of variant peptides resulting from nonsynonymous nucleotide differences.

Authors:  Gloria M Sheynkman; Michael R Shortreed; Brian L Frey; Mark Scalf; Lloyd M Smith
Journal:  J Proteome Res       Date:  2013-11-11       Impact factor: 4.466

3.  Proteogenomic strategies for identification of aberrant cancer peptides using large-scale next-generation sequencing data.

Authors:  Sunghee Woo; Seong Won Cha; Seungjin Na; Clark Guest; Tao Liu; Richard D Smith; Karin D Rodland; Samuel Payne; Vineet Bafna
Journal:  Proteomics       Date:  2014-11-17       Impact factor: 3.984

4.  Identification of gene fusions from human lung cancer mass spectrometry data.

Authors:  Han Sun; Xiaobin Xing; Jing Li; Fengli Zhou; Yunqin Chen; Ying He; Wei Li; Guangwu Wei; Xiao Chang; Jia Jia; Yixue Li; Lu Xie
Journal:  BMC Genomics       Date:  2013-12-09       Impact factor: 3.969

5.  Top-down-assisted bottom-up method for homologous protein sequencing: hemoglobin from 33 bird species.

Authors:  Yang Song; Ünige A Laskay; Inger-Marie E Vilcins; Alan G Barbour; Vicki H Wysocki
Journal:  J Am Soc Mass Spectrom       Date:  2015-06-26       Impact factor: 3.109

6.  Proteogenomic database construction driven from large scale RNA-seq data.

Authors:  Sunghee Woo; Seong Won Cha; Gennifer Merrihew; Yupeng He; Natalie Castellana; Clark Guest; Michael MacCoss; Vineet Bafna
Journal:  J Proteome Res       Date:  2013-07-17       Impact factor: 4.466

7.  Single Amino Acid Variant Profiles of Subpopulations in the MCF-7 Breast Cancer Cell Line.

Authors:  Zhijing Tan; Song Nie; Sean P McDermott; Max S Wicha; David M Lubman
Journal:  J Proteome Res       Date:  2017-01-20       Impact factor: 4.466

8.  sapFinder: an R/Bioconductor package for detection of variant peptides in shotgun proteomics experiments.

Authors:  Bo Wen; Shaohang Xu; Gloria M Sheynkman; Qiang Feng; Liang Lin; Quanhui Wang; Xun Xu; Jun Wang; Siqi Liu
Journal:  Bioinformatics       Date:  2014-07-22       Impact factor: 6.937

9.  CanProVar 2.0: An Updated Database of Human Cancer Proteome Variation.

Authors:  Menghuan Zhang; Bo Wang; Jia Xu; Xiaojing Wang; Lu Xie; Bing Zhang; Yixue Li; Jing Li
Journal:  J Proteome Res       Date:  2016-12-15       Impact factor: 4.466

10.  JUMPg: An Integrative Proteogenomics Pipeline Identifying Unannotated Proteins in Human Brain and Cancer Cells.

Authors:  Yuxin Li; Xusheng Wang; Ji-Hoon Cho; Timothy I Shaw; Zhiping Wu; Bing Bai; Hong Wang; Suiping Zhou; Thomas G Beach; Gang Wu; Jinghui Zhang; Junmin Peng
Journal:  J Proteome Res       Date:  2016-06-13       Impact factor: 4.466

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.