Literature DB >> 21840390

The discovery of novel protein-coding features in mouse genome based on mass spectrometry data.

Xiao-Bin Xing1, Qing-Run Li, Han Sun, Xing Fu, Fei Zhan, Xiu Huang, Jing Li, Chun-Lei Chen, Yu Shyr, Rong Zeng, Yi-Xue Li, Lu Xie.   

Abstract

Identifying protein-coding genes in eukaryotic genomes remains a challenge in post-genome era due to the complex gene models. We applied a proteogenomics strategy to detect un-annotated protein-coding regions in mouse genome. High-accuracy tandem mass spectrometry (MS/MS) data from diverse mouse samples were generated by LTQ-Orbitrap mass spectrometer in house. Two searchable diagnostic proteomic datasets were constructed, one with all possible encoding exon junctions, and the other with all putative encoding exons, for the discovery of novel exon splicing events and novel uninterrupted protein-coding regions. Altogether 29,586 unique peptides were identified. Aligning backwards to the mouse genome, the translation of 4471 annotated genes was validated by the known peptides; and 172 genic events were defined in mouse genome by the novel peptides. The approach in the current work can provide substantial evidences for eukaryote genome annotation in encoding genes. Copyright Â
© 2011 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21840390      PMCID: PMC5757624          DOI: 10.1016/j.ygeno.2011.07.005

Source DB:  PubMed          Journal:  Genomics        ISSN: 0888-7543            Impact factor:   5.736


  35 in total

1.  TANDEM: matching proteins with tandem mass spectra.

Authors:  Robertson Craig; Ronald C Beavis
Journal:  Bioinformatics       Date:  2004-02-19       Impact factor: 6.937

2.  Mass spectrometry-based prokaryote gene annotation.

Authors:  Yoko Ishino; Hitomi Okada; Masahiko Ikeuchi; Hisaaki Taniguchi
Journal:  Proteomics       Date:  2007-11       Impact factor: 3.984

3.  Mapping and quantifying mammalian transcriptomes by RNA-Seq.

Authors:  Ali Mortazavi; Brian A Williams; Kenneth McCue; Lorian Schaeffer; Barbara Wold
Journal:  Nat Methods       Date:  2008-05-30       Impact factor: 28.547

4.  Discovery and revision of Arabidopsis genes by proteogenomics.

Authors:  Natalie E Castellana; Samuel H Payne; Zhouxin Shen; Mario Stanke; Vineet Bafna; Steven P Briggs
Journal:  Proc Natl Acad Sci U S A       Date:  2008-12-19       Impact factor: 11.205

5.  Use of shotgun proteomics for the identification, confirmation, and correction of C. elegans gene annotations.

Authors:  Gennifer E Merrihew; Colleen Davis; Brent Ewing; Gary Williams; Lukas Käll; Barbara E Frewen; William Stafford Noble; Phil Green; James H Thomas; Michael J MacCoss
Journal:  Genome Res       Date:  2008-07-24       Impact factor: 9.043

6.  Shotgun proteomics aids discovery of novel protein-coding genes, alternative splicing, and "resurrected" pseudogenes in the mouse genome.

Authors:  Markus Brosch; Gary I Saunders; Adam Frankish; Mark O Collins; Lu Yu; James Wright; Ruth Verstraten; David J Adams; Jennifer Harrow; Jyoti S Choudhary; Tim Hubbard
Journal:  Genome Res       Date:  2011-04-01       Impact factor: 9.043

7.  Novel peptide identification from tandem mass spectra using ESTs and sequence database compression.

Authors:  Nathan J Edwards
Journal:  Mol Syst Biol       Date:  2007-04-17       Impact factor: 11.429

8.  AUGUSTUS: ab initio prediction of alternative transcripts.

Authors:  Mario Stanke; Oliver Keller; Irfan Gunduz; Alec Hayes; Stephan Waack; Burkhard Morgenstern
Journal:  Nucleic Acids Res       Date:  2006-07-01       Impact factor: 16.971

9.  BASys: a web server for automated bacterial genome annotation.

Authors:  Gary H Van Domselaar; Paul Stothard; Savita Shrivastava; Joseph A Cruz; AnChi Guo; Xiaoli Dong; Paul Lu; Duane Szafron; Russ Greiner; David S Wishart
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

10.  Ensembl's 10th year.

Authors:  Paul Flicek; Bronwen L Aken; Benoit Ballester; Kathryn Beal; Eugene Bragin; Simon Brent; Yuan Chen; Peter Clapham; Guy Coates; Susan Fairley; Stephen Fitzgerald; Julio Fernandez-Banet; Leo Gordon; Stefan Gräf; Syed Haider; Martin Hammond; Kerstin Howe; Andrew Jenkinson; Nathan Johnson; Andreas Kähäri; Damian Keefe; Stephen Keenan; Rhoda Kinsella; Felix Kokocinski; Gautier Koscielny; Eugene Kulesha; Daniel Lawson; Ian Longden; Tim Massingham; William McLaren; Karine Megy; Bert Overduin; Bethan Pritchard; Daniel Rios; Magali Ruffier; Michael Schuster; Guy Slater; Damian Smedley; Giulietta Spudich; Y Amy Tang; Stephen Trevanion; Albert Vilella; Jan Vogel; Simon White; Steven P Wilder; Amonida Zadissa; Ewan Birney; Fiona Cunningham; Ian Dunham; Richard Durbin; Xosé M Fernández-Suarez; Javier Herrero; Tim J P Hubbard; Anne Parker; Glenn Proctor; James Smith; Stephen M J Searle
Journal:  Nucleic Acids Res       Date:  2009-11-11       Impact factor: 16.971

View more
  8 in total

1.  Identification of gene fusions from human lung cancer mass spectrometry data.

Authors:  Han Sun; Xiaobin Xing; Jing Li; Fengli Zhou; Yunqin Chen; Ying He; Wei Li; Guangwu Wei; Xiao Chang; Jia Jia; Yixue Li; Lu Xie
Journal:  BMC Genomics       Date:  2013-12-09       Impact factor: 3.969

2.  Discovery and mass spectrometric analysis of novel splice-junction peptides using RNA-Seq.

Authors:  Gloria M Sheynkman; Michael R Shortreed; Brian L Frey; Lloyd M Smith
Journal:  Mol Cell Proteomics       Date:  2013-04-29       Impact factor: 5.911

3.  Onco-proteogenomics: cancer proteomics joins forces with genomics.

Authors:  Javier A Alfaro; Ankit Sinha; Thomas Kislinger; Paul C Boutros
Journal:  Nat Methods       Date:  2014-11       Impact factor: 28.547

4.  An Analysis of the Sensitivity of Proteogenomic Mapping of Somatic Mutations and Novel Splicing Events in Cancer.

Authors:  Kelly V Ruggles; Zuojian Tang; Xuya Wang; Himanshu Grover; Manor Askenazi; Jennifer Teubl; Song Cao; Michael D McLellan; Karl R Clauser; David L Tabb; Philipp Mertins; Robbert Slebos; Petra Erdmann-Gilmore; Shunqiang Li; Harsha P Gunawardena; Ling Xie; Tao Liu; Jian-Ying Zhou; Shisheng Sun; Katherine A Hoadley; Charles M Perou; Xian Chen; Sherri R Davies; Christopher A Maher; Christopher R Kinsinger; Karen D Rodland; Hui Zhang; Zhen Zhang; Li Ding; R Reid Townsend; Henry Rodriguez; Daniel Chan; Richard D Smith; Daniel C Liebler; Steven A Carr; Samuel Payne; Matthew J Ellis; David Fenyő
Journal:  Mol Cell Proteomics       Date:  2015-12-02       Impact factor: 5.911

Review 5.  Integrating genomic, transcriptomic, and interactome data to improve Peptide and protein identification in shotgun proteomics.

Authors:  Xiaojing Wang; Bing Zhang
Journal:  J Proteome Res       Date:  2014-05-12       Impact factor: 4.466

6.  Proteogenomics produces comprehensive and highly accurate protein-coding gene annotation in a complete genome assembly of Malassezia sympodialis.

Authors:  Yafeng Zhu; Pär G Engström; Christian Tellgren-Roth; Charles D Baudo; John C Kennell; Sheng Sun; R Blake Billmyre; Markus S Schröder; Anna Andersson; Tina Holm; Benjamin Sigurgeirsson; Guangxi Wu; Sundar Ram Sankaranarayanan; Rahul Siddharthan; Kaustuv Sanyal; Joakim Lundeberg; Björn Nystedt; Teun Boekhout; Thomas L Dawson; Joseph Heitman; Annika Scheynius; Janne Lehtiö
Journal:  Nucleic Acids Res       Date:  2017-03-17       Impact factor: 16.971

7.  Impact of Alternative Splicing on the Human Proteome.

Authors:  Yansheng Liu; Mar Gonzàlez-Porta; Sergio Santos; Alvis Brazma; John C Marioni; Ruedi Aebersold; Ashok R Venkitaraman; Vihandha O Wickramasinghe
Journal:  Cell Rep       Date:  2017-08-01       Impact factor: 9.423

8.  ProGeo-neo: a customized proteogenomic workflow for neoantigen prediction and selection.

Authors:  Yuyu Li; Guangzhi Wang; Xiaoxiu Tan; Jian Ouyang; Menghuan Zhang; Xiaofeng Song; Qi Liu; Qibin Leng; Lanming Chen; Lu Xie
Journal:  BMC Med Genomics       Date:  2020-04-03       Impact factor: 3.063

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.