Literature DB >> 18096747

Comparative genomics beyond sequence-based alignments: RNA structures in the ENCODE regions.

Elfar Torarinsson1, Zizhen Yao, Eric D Wiklund, Jesper B Bramsen, Claus Hansen, Jørgen Kjems, Niels Tommerup, Walter L Ruzzo, Jan Gorodkin.   

Abstract

Recent computational scans for non-coding RNAs (ncRNAs) in multiple organisms have relied on existing multiple sequence alignments. However, as sequence similarity drops, a key signal of RNA structure--frequent compensating base changes--is increasingly likely to cause sequence-based alignment methods to misalign, or even refuse to align, homologous ncRNAs, consequently obscuring that structural signal. We have used CMfinder, a structure-oriented local alignment tool, to search the ENCODE regions of vertebrate multiple alignments. In agreement with other studies, we find a large number of potential RNA structures in the ENCODE regions. We report 6587 candidate regions with an estimated false-positive rate of 50%. More intriguingly, many of these candidates may be better represented by alignments taking the RNA secondary structure into account than those based on primary sequence alone, often quite dramatically. For example, approximately one-quarter of our predicted motifs show revisions in >50% of their aligned positions. Furthermore, our results are strongly complementary to those discovered by sequence-alignment-based approaches--84% of our candidates are not covered by Washietl et al., increasing the number of ncRNA candidates in the ENCODE region by 32%. In a group of 11 ncRNA candidates that were tested by RT-PCR, 10 were confirmed to be present as RNA transcripts in human tissue, and most show evidence of significant differential expression across tissues. Our results broadly suggest caution in any analysis relying on multiple sequence alignments in less well-conserved regions, clearly support growing appreciation for the biological significance of ncRNAs, and strongly support the argument for considering RNA structure directly in any searches for these elements.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 18096747      PMCID: PMC2203622          DOI: 10.1101/gr.6887408

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  47 in total

1.  Mapping of conserved RNA secondary structures predicts thousands of functional noncoding RNAs in the human genome.

Authors:  Stefan Washietl; Ivo L Hofacker; Melanie Lukasser; Alexander Hüttenhofer; Peter F Stadler
Journal:  Nat Biotechnol       Date:  2005-11       Impact factor: 54.908

Review 2.  Non-coding RNAs: new players in eukaryotic biology.

Authors:  Fabrício F Costa
Journal:  Gene       Date:  2005-09-12       Impact factor: 3.688

3.  Multiple structural alignment and clustering of RNA sequences.

Authors:  Elfar Torarinsson; Jakob H Havgaard; Jan Gorodkin
Journal:  Bioinformatics       Date:  2007-02-25       Impact factor: 6.937

4.  Identification and characterization of human non-coding RNAs with tissue-specific expression.

Authors:  Yasnory T F Sasaki; Miho Sano; Takashi Ideue; Taishin Kin; Kiyoshi Asai; Tetsuro Hirose
Journal:  Biochem Biophys Res Commun       Date:  2007-04-17       Impact factor: 3.575

5.  A novel brain-specific box C/D small nucleolar RNA processed from tandemly repeated introns of a noncoding RNA gene in rats.

Authors:  J Cavaillé; P Vitali; E Basyuk; A Hüttenhofer; J P Bachellerie
Journal:  J Biol Chem       Date:  2001-05-09       Impact factor: 5.157

6.  An RNA gene expressed during cortical development evolved rapidly in humans.

Authors:  Katherine S Pollard; Sofie R Salama; Nelle Lambert; Marie-Alexandra Lambot; Sandra Coppens; Jakob S Pedersen; Sol Katzman; Bryan King; Courtney Onodera; Adam Siepel; Andrew D Kern; Colette Dehay; Haller Igel; Manuel Ares; Pierre Vanderhaeghen; David Haussler
Journal:  Nature       Date:  2006-08-16       Impact factor: 49.962

7.  Inferring noncoding RNA families and classes by means of genome-scale structure-based clustering.

Authors:  Sebastian Will; Kristin Reiche; Ivo L Hofacker; Peter F Stadler; Rolf Backofen
Journal:  PLoS Comput Biol       Date:  2007-02-22       Impact factor: 4.475

8.  Efficient pairwise RNA structure prediction and alignment using sequence alignment constraints.

Authors:  Robin D Dowell; Sean R Eddy
Journal:  BMC Bioinformatics       Date:  2006-09-04       Impact factor: 3.169

9.  GENCODE: producing a reference annotation for ENCODE.

Authors:  Jennifer Harrow; France Denoeud; Adam Frankish; Alexandre Reymond; Chao-Kung Chen; Jacqueline Chrast; Julien Lagarde; James G R Gilbert; Roy Storey; David Swarbreck; Colette Rossier; Catherine Ucla; Tim Hubbard; Stylianos E Antonarakis; Roderic Guigo
Journal:  Genome Biol       Date:  2006-08-07       Impact factor: 13.583

10.  Efficient pairwise RNA structure prediction using probabilistic alignment constraints in Dynalign.

Authors:  Arif Ozgun Harmanci; Gaurav Sharma; David H Mathews
Journal:  BMC Bioinformatics       Date:  2007-04-19       Impact factor: 3.169

View more
  54 in total

1.  LocARNA-P: accurate boundary prediction and improved detection of structural RNAs.

Authors:  Sebastian Will; Tejal Joshi; Ivo L Hofacker; Peter F Stadler; Rolf Backofen
Journal:  RNA       Date:  2012-03-26       Impact factor: 4.942

2.  Prediction and characterization of noncoding RNAs in C. elegans by integrating conservation, secondary structure, and high-throughput sequencing and array data.

Authors:  Zhi John Lu; Kevin Y Yip; Guilin Wang; Chong Shou; Ladeana W Hillier; Ekta Khurana; Ashish Agarwal; Raymond Auerbach; Joel Rozowsky; Chao Cheng; Masaomi Kato; David M Miller; Frank Slack; Michael Snyder; Robert H Waterston; Valerie Reinke; Mark B Gerstein
Journal:  Genome Res       Date:  2010-12-22       Impact factor: 9.043

Review 3.  Noncoding RNA in development.

Authors:  Paulo P Amaral; John S Mattick
Journal:  Mamm Genome       Date:  2008-10-07       Impact factor: 2.957

4.  Specific alignment of structured RNA: stochastic grammars and sequence annealing.

Authors:  Robert K Bradley; Lior Pachter; Ian Holmes
Journal:  Bioinformatics       Date:  2008-09-16       Impact factor: 6.937

5.  Improved RNA secondary structure prediction by maximizing expected pair accuracy.

Authors:  Zhi John Lu; Jason W Gloor; David H Mathews
Journal:  RNA       Date:  2009-08-24       Impact factor: 4.942

6.  The tedious task of finding homologous noncoding RNA genes.

Authors:  Peter Menzel; Jan Gorodkin; Peter F Stadler
Journal:  RNA       Date:  2009-10-27       Impact factor: 4.942

7.  Optimizing RNA structures by sequence extensions using RNAcop.

Authors:  Nikolai Hecker; Mikkel Christensen-Dalsgaard; Stefan E Seemann; Jakob H Havgaard; Peter F Stadler; Ivo L Hofacker; Henrik Nielsen; Jan Gorodkin
Journal:  Nucleic Acids Res       Date:  2015-08-17       Impact factor: 16.971

8.  Diagnosing and Treating Nervous System Disorders by Targeting Novel Classes of Non-coding RNAs.

Authors:  Irfan A Qureshi; Mark F Mehler
Journal:  Int Drug Discov       Date:  2011 Jun-Jul

9.  Evolutionary modeling and prediction of non-coding RNAs in Drosophila.

Authors:  Robert K Bradley; Andrew V Uzilov; Mitchell E Skinner; Yuri R Bendaña; Lars Barquist; Ian Holmes
Journal:  PLoS One       Date:  2009-08-11       Impact factor: 3.240

10.  Linking fold, function and phylogeny: a comparative genomics view on protein (domain) evolution.

Authors:  Aartjan J W Te Velthuis; Christoph P Bagowski
Journal:  Curr Genomics       Date:  2008-04       Impact factor: 2.236

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.