| Literature DB >> 25510491 |
Jeroen Crappé1, Elvis Ndah2, Alexander Koch1, Sandra Steyaert1, Daria Gawron3, Sarah De Keulenaer1, Ellen De Meester1, Tim De Meyer1, Wim Van Criekinge1, Petra Van Damme3, Gerben Menschaert4.
Abstract
An increasing amount of studies integrate mRNA sequencing data into MS-based proteomics to complement the translation product search space. However, several factors, including extensive regulation of mRNA translation and the need for three- or six-frame-translation, impede the use of mRNA-seq data for the construction of a protein sequence search database. With that in mind, we developed the PROTEOFORMER tool that automatically processes data of the recently developed ribosome profiling method (sequencing of ribosome-protected mRNA fragments), resulting in genome-wide visualization of ribosome occupancy. Our tool also includes a translation initiation site calling algorithm allowing the delineation of the open reading frames (ORFs) of all translation products. A complete protein synthesis-based sequence database can thus be compiled for mass spectrometry-based identification. This approach increases the overall protein identification rates with 3% and 11% (improved and new identifications) for human and mouse, respectively, and enables proteome-wide detection of 5'-extended proteoforms, upstream ORF translation and near-cognate translation start sites. The PROTEOFORMER tool is available as a stand-alone pipeline and has been implemented in the galaxy framework for ease of use.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25510491 PMCID: PMC4357689 DOI: 10.1093/nar/gku1283
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971