Literature DB >> 21460061

Shotgun proteomics aids discovery of novel protein-coding genes, alternative splicing, and "resurrected" pseudogenes in the mouse genome.

Markus Brosch1, Gary I Saunders, Adam Frankish, Mark O Collins, Lu Yu, James Wright, Ruth Verstraten, David J Adams, Jennifer Harrow, Jyoti S Choudhary, Tim Hubbard.   

Abstract

Recent advances in proteomic mass spectrometry (MS) offer the chance to marry high-throughput peptide sequencing to transcript models, allowing the validation, refinement, and identification of new protein-coding loci. We present a novel pipeline that integrates highly sensitive and statistically robust peptide spectrum matching with genome-wide protein-coding predictions to perform large-scale gene validation and discovery in the mouse genome for the first time. In searching an excess of 10 million spectra, we have been able to validate 32%, 17%, and 7% of all protein-coding genes, exons, and splice boundaries, respectively. Moreover, we present strong evidence for the identification of multiple alternatively spliced translations from 53 genes and have uncovered 10 entirely novel protein-coding genes, which are not covered in any mouse annotation data sources. One such novel protein-coding gene is a fusion protein that spans the Ins2 and Igf2 loci to produce a transcript encoding the insulin II and the insulin-like growth factor 2-derived peptides. We also report nine processed pseudogenes that have unique peptide hits, demonstrating, for the first time, that they are not just transcribed but are translated and are therefore resurrected into new coding loci. This work not only highlights an important utility for MS data in genome annotation but also provides unique insights into the gene structure and propagation in the mouse genome. All these data have been subsequently used to improve the publicly available mouse annotation available in both the Vega and Ensembl genome browsers (http://vega.sanger.ac.uk).

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21460061      PMCID: PMC3083093          DOI: 10.1101/gr.114272.110

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  73 in total

Review 1.  Matching peptide mass spectra to EST and genomic DNA databases.

Authors:  J S Choudhary; W P Blackstock; D M Creasy; J S Cottrell
Journal:  Trends Biotechnol       Date:  2001-10       Impact factor: 19.536

2.  Proteogenomic mapping as a complementary method to perform genome annotation.

Authors:  Jacob D Jaffe; Howard C Berg; George M Church
Journal:  Proteomics       Date:  2004-01       Impact factor: 3.984

3.  DBToolkit: processing protein databases for peptide-centric proteomics.

Authors:  Lennart Martens; Joël Vandekerckhove; Kris Gevaert
Journal:  Bioinformatics       Date:  2005-07-19       Impact factor: 6.937

Review 4.  Mass spectrometry and protein analysis.

Authors:  Bruno Domon; Ruedi Aebersold
Journal:  Science       Date:  2006-04-14       Impact factor: 47.728

5.  Evolutionary fate of retroposed gene copies in the human genome.

Authors:  Nicolas Vinckenbosch; Isabelle Dupanloup; Henrik Kaessmann
Journal:  Proc Natl Acad Sci U S A       Date:  2006-02-21       Impact factor: 11.205

6.  Proposal for a common nomenclature for sequence ions in mass spectra of peptides.

Authors:  P Roepstorff; J Fohlman
Journal:  Biomed Mass Spectrom       Date:  1984-11

7.  Adaptive discriminant function analysis and reranking of MS/MS database search results for improved peptide identification in shotgun proteomics.

Authors:  Ying Ding; Hyungwon Choi; Alexey I Nesvizhskii
Journal:  J Proteome Res       Date:  2008-09-13       Impact factor: 4.466

8.  Proteomics studies confirm the presence of alternative protein isoforms on a large scale.

Authors:  Michael L Tress; Bernd Bodenmiller; Ruedi Aebersold; Alfonso Valencia
Journal:  Genome Biol       Date:  2008-11-18       Impact factor: 13.583

9.  Accurate and sensitive peptide identification with Mascot Percolator.

Authors:  Markus Brosch; Lu Yu; Tim Hubbard; Jyoti Choudhary
Journal:  J Proteome Res       Date:  2009-06       Impact factor: 4.466

10.  Manual annotation and analysis of the defensin gene cluster in the C57BL/6J mouse reference genome.

Authors:  Clara Amid; Linda M Rehaume; Kelly L Brown; James G R Gilbert; Gordon Dougan; Robert E W Hancock; Jennifer L Harrow
Journal:  BMC Genomics       Date:  2009-12-15       Impact factor: 3.969

View more
  52 in total

1.  HiRIEF LC-MS enables deep proteome coverage and unbiased proteogenomics.

Authors:  Rui M M Branca; Lukas M Orre; Henrik J Johansson; Viktor Granholm; Mikael Huss; Åsa Pérez-Bercoff; Jenny Forshed; Lukas Käll; Janne Lehtiö
Journal:  Nat Methods       Date:  2013-11-17       Impact factor: 28.547

2.  GAPP: A Proteogenomic Software for Genome Annotation and Global Profiling of Post-translational Modifications in Prokaryotes.

Authors:  Jia Zhang; Ming-Kun Yang; Honghui Zeng; Feng Ge
Journal:  Mol Cell Proteomics       Date:  2016-09-14       Impact factor: 5.911

3.  The discovery of novel protein-coding features in mouse genome based on mass spectrometry data.

Authors:  Xiao-Bin Xing; Qing-Run Li; Han Sun; Xing Fu; Fei Zhan; Xiu Huang; Jing Li; Chun-Lei Chen; Yu Shyr; Rong Zeng; Yi-Xue Li; Lu Xie
Journal:  Genomics       Date:  2011-08-04       Impact factor: 5.736

4.  From pseudogenes to proteins.

Authors:  Nicole Rusk
Journal:  Nat Methods       Date:  2011-06       Impact factor: 28.547

Review 5.  Next-generation proteomics: towards an integrative view of proteome dynamics.

Authors:  A F Maarten Altelaar; Javier Munoz; Albert J R Heck
Journal:  Nat Rev Genet       Date:  2012-12-04       Impact factor: 53.242

Review 6.  Protein analysis by shotgun/bottom-up proteomics.

Authors:  Yaoyang Zhang; Bryan R Fonslow; Bing Shan; Moon-Chang Baek; John R Yates
Journal:  Chem Rev       Date:  2013-02-26       Impact factor: 60.622

7.  Discovery and mass spectrometric analysis of novel splice-junction peptides using RNA-Seq.

Authors:  Gloria M Sheynkman; Michael R Shortreed; Brian L Frey; Lloyd M Smith
Journal:  Mol Cell Proteomics       Date:  2013-04-29       Impact factor: 5.911

8.  Pseudogenes: Four Decades of Discovery.

Authors:  Leonardo Salmena
Journal:  Methods Mol Biol       Date:  2021

Review 9.  Progress and Challenges in Ocean Metaproteomics and Proposed Best Practices for Data Sharing.

Authors:  Mak A Saito; Erin M Bertrand; Megan E Duffy; David A Gaylord; Noelle A Held; William Judson Hervey; Robert L Hettich; Pratik D Jagtap; Michael G Janech; Danie B Kinkade; Dagmar H Leary; Matthew R McIlvin; Eli K Moore; Robert M Morris; Benjamin A Neely; Brook L Nunn; Jaclyn K Saunders; Adam I Shepherd; Nicholas I Symmonds; David A Walsh
Journal:  J Proteome Res       Date:  2019-03-12       Impact factor: 4.466

Review 10.  Genotype to phenotype via network analysis.

Authors:  Hannah Carter; Matan Hofree; Trey Ideker
Journal:  Curr Opin Genet Dev       Date:  2013-11-14       Impact factor: 5.578

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.