Literature DB >> 25263569

Proteogenomic strategies for identification of aberrant cancer peptides using large-scale next-generation sequencing data.

Sunghee Woo1, Seong Won Cha, Seungjin Na, Clark Guest, Tao Liu, Richard D Smith, Karin D Rodland, Samuel Payne, Vineet Bafna.   

Abstract

Cancer is driven by the acquisition of somatic DNA lesions. Distinguishing the early driver mutations from subsequent passenger mutations is key to molecular subtyping of cancers, understanding cancer progression, and the discovery of novel biomarkers. The advances of genomics technologies (whole-genome exome, and transcript sequencing, collectively referred to as NGS (next-generation sequencing)) have fueled recent studies on somatic mutation discovery. However, the vision is challenged by the complexity, redundancy, and errors in genomic data, and the difficulty of investigating the proteome translated portion of aberrant genes using only genomic approaches. Combination of proteomic and genomic technologies are increasingly being employed. Various strategies have been employed to allow the usage of large-scale NGS data for conventional MS/MS searches. This paper provides a discussion of applying different strategies relating to large database search, and FDR (false discovery rate) -based error control, and their implication to cancer proteogenomics. Moreover, it extends and develops the idea of a unified genomic variant database that can be searched by any MS sample. A total of 879 BAM files downloaded from TCGA repository were used to create a 4.34 GB unified FASTA database that contained 2787062 novel splice junctions, 38,464 deletions, 1,105 insertions, and 182,302 substitutions. Proteomic data from a single ovarian carcinoma sample (439,858 spectra) was searched against the database. By applying the most conservative FDR measure, we have identified 524 novel peptides and 65,578 known peptides at 1% FDR threshold. The novel peptides include interesting examples of doubly mutated peptides, frame-shifts, and nonsample-recruited mutations, which emphasize the strength of our approach.
© 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Entities:  

Keywords:  Cancer; MS; Mutated peptide identification; Ovarian cancer; Proteogenomics

Mesh:

Substances:

Year:  2014        PMID: 25263569      PMCID: PMC4256132          DOI: 10.1002/pmic.201400206

Source DB:  PubMed          Journal:  Proteomics        ISSN: 1615-9853            Impact factor:   3.984


  25 in total

1.  Correlation of mRNA and protein abundance in the developing maize leaf.

Authors:  Lalit Ponnala; Yupeng Wang; Qi Sun; Klaas J van Wijk
Journal:  Plant J       Date:  2014-04-02       Impact factor: 6.417

2.  An automated proteogenomic method uses mass spectrometry to reveal novel genes in Zea mays.

Authors:  Natalie E Castellana; Zhouxin Shen; Yupeng He; Justin W Walley; California Jack Cassidy; Steven P Briggs; Vineet Bafna
Journal:  Mol Cell Proteomics       Date:  2013-10-18       Impact factor: 5.911

3.  Proteogenomic database construction driven from large scale RNA-seq data.

Authors:  Sunghee Woo; Seong Won Cha; Gennifer Merrihew; Yupeng He; Natalie Castellana; Clark Guest; Michael MacCoss; Vineet Bafna
Journal:  J Proteome Res       Date:  2013-07-17       Impact factor: 4.466

4.  Protein identification using customized protein sequence databases derived from RNA-Seq data.

Authors:  Xiaojing Wang; Robbert J C Slebos; Dong Wang; Patrick J Halvey; David L Tabb; Daniel C Liebler; Bing Zhang
Journal:  J Proteome Res       Date:  2011-12-14       Impact factor: 4.466

5.  De novo derivation of proteomes from transcriptomes for transcript and protein identification.

Authors:  Vanessa C Evans; Gary Barker; Kate J Heesom; Jun Fan; Conrad Bessant; David A Matthews
Journal:  Nat Methods       Date:  2012-11-11       Impact factor: 28.547

6.  Proteogenomic analysis of bacteria and archaea: a 46 organism case study.

Authors:  Eli Venter; Richard D Smith; Samuel H Payne
Journal:  PLoS One       Date:  2011-11-17       Impact factor: 3.240

7.  TopHat-Fusion: an algorithm for discovery of novel fusion transcripts.

Authors:  Daehwan Kim; Steven L Salzberg
Journal:  Genome Biol       Date:  2011-08-11       Impact factor: 13.583

8.  customProDB: an R package to generate customized protein databases from RNA-Seq data for proteomics search.

Authors:  Xiaojing Wang; Bing Zhang
Journal:  Bioinformatics       Date:  2013-09-20       Impact factor: 6.937

9.  Comprehensive molecular portraits of human breast tumours.

Authors: 
Journal:  Nature       Date:  2012-09-23       Impact factor: 49.962

10.  Ensembl 2013.

Authors:  Paul Flicek; Ikhlak Ahmed; M Ridwan Amode; Daniel Barrell; Kathryn Beal; Simon Brent; Denise Carvalho-Silva; Peter Clapham; Guy Coates; Susan Fairley; Stephen Fitzgerald; Laurent Gil; Carlos García-Girón; Leo Gordon; Thibaut Hourlier; Sarah Hunt; Thomas Juettemann; Andreas K Kähäri; Stephen Keenan; Monika Komorowska; Eugene Kulesha; Ian Longden; Thomas Maurel; William M McLaren; Matthieu Muffato; Rishi Nag; Bert Overduin; Miguel Pignatelli; Bethan Pritchard; Emily Pritchard; Harpreet Singh Riat; Graham R S Ritchie; Magali Ruffier; Michael Schuster; Daniel Sheppard; Daniel Sobral; Kieron Taylor; Anja Thormann; Stephen Trevanion; Simon White; Steven P Wilder; Bronwen L Aken; Ewan Birney; Fiona Cunningham; Ian Dunham; Jennifer Harrow; Javier Herrero; Tim J P Hubbard; Nathan Johnson; Rhoda Kinsella; Anne Parker; Giulietta Spudich; Andy Yates; Amonida Zadissa; Stephen M J Searle
Journal:  Nucleic Acids Res       Date:  2012-11-30       Impact factor: 16.971

View more
  25 in total

1.  A proteogenomic approach to understand splice isoform functions through sequence and expression-based computational modeling.

Authors:  Hong-Dong Li; Gilbert S Omenn; Yuanfang Guan
Journal:  Brief Bioinform       Date:  2016-01-06       Impact factor: 11.622

2.  Deep transcriptome annotation enables the discovery and functional characterization of cryptic small proteins.

Authors:  Sondos Samandi; Annie V Roy; Vivian Delcourt; Jean-François Lucier; Jules Gagnon; Maxime C Beaudoin; Benoît Vanderperre; Marc-André Breton; Julie Motard; Jean-François Jacques; Mylène Brunelle; Isabelle Gagnon-Arsenault; Isabelle Fournier; Aida Ouangraoua; Darel J Hunting; Alan A Cohen; Christian R Landry; Michelle S Scott; Xavier Roucou
Journal:  Elife       Date:  2017-10-30       Impact factor: 8.140

3.  A mass graph-based approach for the identification of modified proteoforms using top-down tandem mass spectra.

Authors:  Qiang Kou; Si Wu; Nikola Tolic; Ljiljana Paša-Tolic; Yunlong Liu; Xiaowen Liu
Journal:  Bioinformatics       Date:  2017-05-01       Impact factor: 6.937

4.  Response to "Mass spectrometrists should search for all peptides, but assess only the ones they care about".

Authors:  William Stafford Noble; Uri Keich
Journal:  Nat Methods       Date:  2017-06-29       Impact factor: 28.547

5.  FusionPro, a Versatile Proteogenomic Tool for Identification of Novel Fusion Transcripts and Their Potential Translation Products in Cancer Cells.

Authors:  Chae-Yeon Kim; Keun Na; Saeram Park; Seul-Ki Jeong; Jin-Young Cho; Heon Shin; Min Jung Lee; Gyoonhee Han; Young-Ki Paik
Journal:  Mol Cell Proteomics       Date:  2019-06-17       Impact factor: 5.911

6.  Proteogenomic Annotation of Chinese Hamsters Reveals Extensive Novel Translation Events and Endogenous Retroviral Elements.

Authors:  Shangzhong Li; Seong Won Cha; Kelly Heffner; Deniz Baycin Hizal; Michael A Bowen; Raghothama Chaerkady; Robert N Cole; Vijay Tejwani; Prashant Kaushik; Michael Henry; Paula Meleady; Susan T Sharfstein; Michael J Betenbaugh; Vineet Bafna; Nathan E Lewis
Journal:  J Proteome Res       Date:  2019-05-08       Impact factor: 4.466

7.  Brute-Force Approach for Mass Spectrometry-Based Variant Peptide Identification in Proteogenomics without Personalized Genomic Data.

Authors:  Mark V Ivanov; Anna A Lobas; Lev I Levitsky; Sergei A Moshkovskii; Mikhail V Gorshkov
Journal:  J Am Soc Mass Spectrom       Date:  2018-01-03       Impact factor: 3.109

8.  SpectroGene: A Tool for Proteogenomic Annotations Using Top-Down Spectra.

Authors:  Mikhail Kolmogorov; Xiaowen Liu; Pavel A Pevzner
Journal:  J Proteome Res       Date:  2015-12-17       Impact factor: 4.466

9.  Advanced Proteogenomic Analysis Reveals Multiple Peptide Mutations and Complex Immunoglobulin Peptides in Colon Cancer.

Authors:  Sunghee Woo; Seong Won Cha; Stefano Bonissone; Seungjin Na; David L Tabb; Pavel A Pevzner; Vineet Bafna
Journal:  J Proteome Res       Date:  2015-07-21       Impact factor: 4.466

10.  CrossHub: a tool for multi-way analysis of The Cancer Genome Atlas (TCGA) in the context of gene expression regulation mechanisms.

Authors:  George S Krasnov; Alexey A Dmitriev; Nataliya V Melnikova; Andrew R Zaretsky; Tatiana V Nasedkina; Alexander S Zasedatelev; Vera N Senchenko; Anna V Kudryavtseva
Journal:  Nucleic Acids Res       Date:  2016-01-14       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.