Literature DB >> 21357752

RNAcode: robust discrimination of coding and noncoding regions in comparative sequence data.

Stefan Washietl1, Sven Findeiss, Stephan A Müller, Stefan Kalkhof, Martin von Bergen, Ivo L Hofacker, Peter F Stadler, Nick Goldman.   

Abstract

With the availability of genome-wide transcription data and massive comparative sequencing, the discrimination of coding from noncoding RNAs and the assessment of coding potential in evolutionarily conserved regions arose as a core analysis task. Here we present RNAcode, a program to detect coding regions in multiple sequence alignments that is optimized for emerging applications not covered by current protein gene-finding software. Our algorithm combines information from nucleotide substitution and gap patterns in a unified framework and also deals with real-life issues such as alignment and sequencing errors. It uses an explicit statistical model with no machine learning component and can therefore be applied "out of the box," without any training, to data from all domains of life. We describe the RNAcode method and apply it in combination with mass spectrometry experiments to predict and confirm seven novel short peptides in Escherichia coli and to analyze the coding potential of RNAs previously annotated as "noncoding." RNAcode is open source software and available for all major platforms at http://wash.github.com/rnacode.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21357752      PMCID: PMC3062170          DOI: 10.1261/rna.2536111

Source DB:  PubMed          Journal:  RNA        ISSN: 1355-8382            Impact factor:   4.942


  66 in total

1.  Metatranscriptomics reveals unique microbial small RNAs in the ocean's water column.

Authors:  Yanmei Shi; Gene W Tyson; Edward F DeLong
Journal:  Nature       Date:  2009-05-14       Impact factor: 49.962

2.  RNAz 2.0: improved noncoding RNA detection.

Authors:  Andreas R Gruber; Sven Findeiß; Stefan Washietl; Ivo L Hofacker; Peter F Stadler
Journal:  Pac Symp Biocomput       Date:  2010

3.  CRITICA: coding region identification tool invoking comparative analysis.

Authors:  J H Badger; G J Olsen
Journal:  Mol Biol Evol       Date:  1999-04       Impact factor: 16.240

4.  The transcription unit architecture of the Escherichia coli genome.

Authors:  Byung-Kwan Cho; Karsten Zengler; Yu Qiu; Young Seoub Park; Eric M Knight; Christian L Barrett; Yuan Gao; Bernhard Ø Palsson
Journal:  Nat Biotechnol       Date:  2009-11-01       Impact factor: 54.908

5.  Small membrane proteins found by comparative genomics and ribosome binding site models.

Authors:  Matthew R Hemm; Brian J Paul; Thomas D Schneider; Gisela Storz; Kenneth E Rudd
Journal:  Mol Microbiol       Date:  2008-12       Impact factor: 3.501

6.  Detection and identification of low-mass peptides and proteins from solvent suspensions of Escherichia coli by high performance liquid chromatography fractionation and matrix-assisted laser desorption/ionization mass spectrometry.

Authors:  Y Dai; L Li; D C Roser; S R Long
Journal:  Rapid Commun Mass Spectrom       Date:  1999       Impact factor: 2.419

7.  Identification of candidate structured RNAs in the marine organism 'Candidatus Pelagibacter ubique'.

Authors:  Michelle M Meyer; Tyler D Ames; Daniel P Smith; Zasha Weinberg; Michael S Schwalbach; Stephen J Giovannoni; Ronald R Breaker
Journal:  BMC Genomics       Date:  2009-06-16       Impact factor: 3.969

8.  The Universal Protein Resource (UniProt) in 2010.

Authors: 
Journal:  Nucleic Acids Res       Date:  2009-10-20       Impact factor: 16.971

9.  nGASP--the nematode genome annotation assessment project.

Authors:  Avril Coghlan; Tristan J Fiedler; Sheldon J McKay; Paul Flicek; Todd W Harris; Darin Blasiar; Lincoln D Stein
Journal:  BMC Bioinformatics       Date:  2008-12-19       Impact factor: 3.169

10.  The UCSC Genome Browser Database: update 2009.

Authors:  R M Kuhn; D Karolchik; A S Zweig; T Wang; K E Smith; K R Rosenbloom; B Rhead; B J Raney; A Pohl; M Pheasant; L Meyer; F Hsu; A S Hinrichs; R A Harte; B Giardine; P Fujita; M Diekhans; T Dreszer; H Clawson; G P Barber; D Haussler; W J Kent
Journal:  Nucleic Acids Res       Date:  2008-11-07       Impact factor: 16.971

View more
  86 in total

Review 1.  Long non-coding RNAs and cancer: a new frontier of translational research?

Authors:  R Spizzo; M I Almeida; A Colombatti; G A Calin
Journal:  Oncogene       Date:  2012-01-23       Impact factor: 9.867

2.  Hidden treasures in unspliced EST data.

Authors:  J Engelhardt; P F Stadler
Journal:  Theory Biosci       Date:  2012-04-08       Impact factor: 1.919

3.  The Escherichia coli CydX protein is a member of the CydAB cytochrome bd oxidase complex and is required for cytochrome bd oxidase activity.

Authors:  Caitlin E VanOrsdel; Shantanu Bhatt; Rondine J Allen; Evan P Brenner; Jessica J Hobson; Aqsa Jamil; Brittany M Haynes; Allyson M Genson; Matthew R Hemm
Journal:  J Bacteriol       Date:  2013-06-07       Impact factor: 3.490

Review 4.  Noncoding RNA and colorectal cancer: its epigenetic role.

Authors:  Yoshiaki Kita; Keiichi Yonemori; Yusaku Osako; Kenji Baba; Shinichiro Mori; Kosei Maemura; Shoji Natsugoe
Journal:  J Hum Genet       Date:  2016-06-09       Impact factor: 3.172

5.  Large-Scale Analyses of Human Microbiomes Reveal Thousands of Small, Novel Genes.

Authors:  Hila Sberro; Brayon J Fremin; Soumaya Zlitni; Fredrik Edfors; Nicholas Greenfield; Michael P Snyder; Georgios A Pavlopoulos; Nikos C Kyrpides; Ami S Bhatt
Journal:  Cell       Date:  2019-08-08       Impact factor: 41.582

6.  Coding sequence density estimation via topological pressure.

Authors:  David Koslicki; Daniel J Thompson
Journal:  J Math Biol       Date:  2014-01-22       Impact factor: 2.259

7.  An atlas of human long non-coding RNAs with accurate 5' ends.

Authors:  Chung-Chau Hon; Jordan A Ramilowski; Jayson Harshbarger; Nicolas Bertin; Owen J L Rackham; Julian Gough; Elena Denisenko; Sebastian Schmeier; Thomas M Poulsen; Jessica Severin; Marina Lizio; Hideya Kawaji; Takeya Kasukawa; Masayoshi Itoh; A Maxwell Burroughs; Shohei Noma; Sarah Djebali; Tanvir Alam; Yulia A Medvedeva; Alison C Testa; Leonard Lipovich; Chi-Wai Yip; Imad Abugessaisa; Mickaël Mendez; Akira Hasegawa; Dave Tang; Timo Lassmann; Peter Heutink; Magda Babina; Christine A Wells; Soichi Kojima; Yukio Nakamura; Harukazu Suzuki; Carsten O Daub; Michiel J L de Hoon; Erik Arner; Yoshihide Hayashizaki; Piero Carninci; Alistair R R Forrest
Journal:  Nature       Date:  2017-03-01       Impact factor: 49.962

8.  CPC2: a fast and accurate coding potential calculator based on sequence intrinsic features.

Authors:  Yu-Jian Kang; De-Chang Yang; Lei Kong; Mei Hou; Yu-Qi Meng; Liping Wei; Ge Gao
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

Review 9.  Computational analysis of noncoding RNAs.

Authors:  Stefan Washietl; Sebastian Will; David A Hendrix; Loyal A Goff; John L Rinn; Bonnie Berger; Manolis Kellis
Journal:  Wiley Interdiscip Rev RNA       Date:  2012-09-18       Impact factor: 9.957

10.  Identification of non-coding RNAs with a new composite feature in the Hybrid Random Forest Ensemble algorithm.

Authors:  Supatcha Lertampaiporn; Chinae Thammarongtham; Chakarida Nukoolkit; Boonserm Kaewkamnerdpong; Marasri Ruengjitchatchawalya
Journal:  Nucleic Acids Res       Date:  2014-04-25       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.