Literature DB >> 24142994

An automated proteogenomic method uses mass spectrometry to reveal novel genes in Zea mays.

Natalie E Castellana1, Zhouxin Shen, Yupeng He, Justin W Walley, California Jack Cassidy, Steven P Briggs, Vineet Bafna.   

Abstract

New technologies in genomics and proteomics have influenced the emergence of proteogenomics, a field at the confluence of genomics, transcriptomics, and proteomics. First generation proteogenomic toolkits employ peptide mass spectrometry to identify novel protein coding regions. We extend first generation proteogenomic tools to achieve greater accuracy and enable the analysis of large, complex genomes. We apply our pipeline to Zea mays, which has a genome comparable in size to human. Our pipeline begins with the comparison of mass spectra to a putative translation of the genome. We select novel peptides, those that match a region of the genome that was not previously known to be protein coding, for grouping into refinement events. We present a novel, probabilistic framework for evaluating the accuracy of each event. Our calculated event probability, or eventProb, considers the number of supporting peptides and spectra, and the quality of each supporting peptide-spectrum match. Our pipeline predicts 165 novel protein-coding genes and proposes updated models for 741 additional genes.

Entities:  

Mesh:

Year:  2013        PMID: 24142994      PMCID: PMC3879611          DOI: 10.1074/mcp.M113.031260

Source DB:  PubMed          Journal:  Mol Cell Proteomics        ISSN: 1535-9476            Impact factor:   5.911


  32 in total

1.  BLAT--the BLAST-like alignment tool.

Authors:  W James Kent
Journal:  Genome Res       Date:  2002-04       Impact factor: 9.043

2.  Endogenous siRNAs derived from a pair of natural cis-antisense transcripts regulate salt tolerance in Arabidopsis.

Authors:  Omar Borsani; Jianhua Zhu; Paul E Verslues; Ramanjulu Sunkar; Jian-Kang Zhu
Journal:  Cell       Date:  2005-12-29       Impact factor: 41.582

3.  Comparative proteogenomics: combining mass spectrometry and comparative genomics to analyze multiple genomes.

Authors:  Nitin Gupta; Jamal Benhamida; Vipul Bhargava; Daniel Goodman; Elisabeth Kain; Ian Kerman; Ngan Nguyen; Noah Ollikainen; Jesse Rodriguez; Jian Wang; Mary S Lipton; Margaret Romine; Vineet Bafna; Richard D Smith; Pavel A Pevzner
Journal:  Genome Res       Date:  2008-04-21       Impact factor: 9.043

4.  Using geneid to identify genes.

Authors:  Enrique Blanco; Genís Parra; Roderic Guigó
Journal:  Curr Protoc Bioinformatics       Date:  2007-06

Review 5.  Building and searching tandem mass spectral libraries for peptide identification.

Authors:  Henry Lam
Journal:  Mol Cell Proteomics       Date:  2011-09-06       Impact factor: 5.911

6.  Discovery and revision of Arabidopsis genes by proteogenomics.

Authors:  Natalie E Castellana; Samuel H Payne; Zhouxin Shen; Mario Stanke; Vineet Bafna; Steven P Briggs
Journal:  Proc Natl Acad Sci U S A       Date:  2008-12-19       Impact factor: 11.205

7.  Spectral probabilities and generating functions of tandem mass spectra: a strike against decoy databases.

Authors:  Sangtae Kim; Nitin Gupta; Pavel A Pevzner
Journal:  J Proteome Res       Date:  2008-07-03       Impact factor: 4.466

Review 8.  RNA-Seq: a revolutionary tool for transcriptomics.

Authors:  Zhong Wang; Mark Gerstein; Michael Snyder
Journal:  Nat Rev Genet       Date:  2009-01       Impact factor: 53.242

9.  Use of shotgun proteomics for the identification, confirmation, and correction of C. elegans gene annotations.

Authors:  Gennifer E Merrihew; Colleen Davis; Brent Ewing; Gary Williams; Lukas Käll; Barbara E Frewen; William Stafford Noble; Phil Green; James H Thomas; Michael J MacCoss
Journal:  Genome Res       Date:  2008-07-24       Impact factor: 9.043

10.  False discovery rates in spectral identification.

Authors:  Kyowon Jeong; Sangtae Kim; Nuno Bandeira
Journal:  BMC Bioinformatics       Date:  2012-11-05       Impact factor: 3.169

View more
  28 in total

1.  Tissue-specific Proteogenomic Analysis of Plutella xylostella Larval Midgut Using a Multialgorithm Pipeline.

Authors:  Xun Zhu; Shangbo Xie; Jean Armengaud; Wen Xie; Zhaojiang Guo; Shi Kang; Qingjun Wu; Shaoli Wang; Jixing Xia; Rongjun He; Youjun Zhang
Journal:  Mol Cell Proteomics       Date:  2016-02-22       Impact factor: 5.911

2.  Proteogenomic strategies for identification of aberrant cancer peptides using large-scale next-generation sequencing data.

Authors:  Sunghee Woo; Seong Won Cha; Seungjin Na; Clark Guest; Tao Liu; Richard D Smith; Karin D Rodland; Samuel Payne; Vineet Bafna
Journal:  Proteomics       Date:  2014-11-17       Impact factor: 3.984

3.  Proteogenomics: emergence and promise.

Authors:  Sam Faulkner; Matthew D Dun; Hubert Hondermarck
Journal:  Cell Mol Life Sci       Date:  2015-01-22       Impact factor: 9.261

4.  Multi-omic data analysis using Galaxy.

Authors:  Jorrit Boekel; John M Chilton; Ira R Cooke; Peter L Horvatovich; Pratik D Jagtap; Lukas Käll; Janne Lehtiö; Pieter Lukasse; Perry D Moerland; Timothy J Griffin
Journal:  Nat Biotechnol       Date:  2015-02       Impact factor: 54.908

5.  Proteogenomics of Gammarus fossarum to document the reproductive system of amphipods.

Authors:  Judith Trapp; Olivier Geffard; Gilles Imbert; Jean-Charles Gaillard; Anne-Hélène Davin; Arnaud Chaumot; Jean Armengaud
Journal:  Mol Cell Proteomics       Date:  2014-10-07       Impact factor: 5.911

6.  Multi-Omics Driven Assembly and Annotation of the Sandalwood (Santalum album) Genome.

Authors:  Hirehally Basavarajegowda Mahesh; Pratigya Subba; Jayshree Advani; Meghana Deepak Shirke; Ramya Malarini Loganathan; Shankara Lingu Chandana; Siddappa Shilpa; Oishi Chatterjee; Sneha Maria Pinto; Thottethodi Subrahmanya Keshava Prasad; Malali Gowda
Journal:  Plant Physiol       Date:  2018-02-12       Impact factor: 8.340

7.  Full-Length Transcript-Based Proteogenomics of Rice Improves Its Genome and Proteome Annotation.

Authors:  Mo-Xian Chen; Fu-Yuan Zhu; Bei Gao; Kai-Long Ma; Youjun Zhang; Alisdair R Fernie; Xi Chen; Lei Dai; Neng-Hui Ye; Xue Zhang; Yuan Tian; Di Zhang; Shi Xiao; Jianhua Zhang; Ying-Gao Liu
Journal:  Plant Physiol       Date:  2019-12-19       Impact factor: 8.340

8.  Advanced Proteogenomic Analysis Reveals Multiple Peptide Mutations and Complex Immunoglobulin Peptides in Colon Cancer.

Authors:  Sunghee Woo; Seong Won Cha; Stefano Bonissone; Seungjin Na; David L Tabb; Pavel A Pevzner; Vineet Bafna
Journal:  J Proteome Res       Date:  2015-07-21       Impact factor: 4.466

9.  An Analysis of the Sensitivity of Proteogenomic Mapping of Somatic Mutations and Novel Splicing Events in Cancer.

Authors:  Kelly V Ruggles; Zuojian Tang; Xuya Wang; Himanshu Grover; Manor Askenazi; Jennifer Teubl; Song Cao; Michael D McLellan; Karl R Clauser; David L Tabb; Philipp Mertins; Robbert Slebos; Petra Erdmann-Gilmore; Shunqiang Li; Harsha P Gunawardena; Ling Xie; Tao Liu; Jian-Ying Zhou; Shisheng Sun; Katherine A Hoadley; Charles M Perou; Xian Chen; Sherri R Davies; Christopher A Maher; Christopher R Kinsinger; Karen D Rodland; Hui Zhang; Zhen Zhang; Li Ding; R Reid Townsend; Henry Rodriguez; Daniel Chan; Richard D Smith; Daniel C Liebler; Steven A Carr; Samuel Payne; Matthew J Ellis; David Fenyő
Journal:  Mol Cell Proteomics       Date:  2015-12-02       Impact factor: 5.911

10.  Computational identification of micro-structural variations and their proteogenomic consequences in cancer.

Authors:  Yen-Yi Lin; Alexander Gawronski; Faraz Hach; Sujun Li; Ibrahim Numanagic; Iman Sarrafi; Swati Mishra; Andrew McPherson; Colin C Collins; Milan Radovich; Haixu Tang; S Cenk Sahinalp
Journal:  Bioinformatics       Date:  2018-05-15       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.