Literature DB >> 21177971

Prediction and characterization of noncoding RNAs in C. elegans by integrating conservation, secondary structure, and high-throughput sequencing and array data.

Zhi John Lu1, Kevin Y Yip, Guilin Wang, Chong Shou, Ladeana W Hillier, Ekta Khurana, Ashish Agarwal, Raymond Auerbach, Joel Rozowsky, Chao Cheng, Masaomi Kato, David M Miller, Frank Slack, Michael Snyder, Robert H Waterston, Valerie Reinke, Mark B Gerstein.   

Abstract

We present an integrative machine learning method, incRNA, for whole-genome identification of noncoding RNAs (ncRNAs). It combines a large amount of expression data, RNA secondary-structure stability, and evolutionary conservation at the protein and nucleic-acid level. Using the incRNA model and data from the modENCODE consortium, we are able to separate known C. elegans ncRNAs from coding sequences and other genomic elements with a high level of accuracy (97% AUC on an independent validation set), and find more than 7000 novel ncRNA candidates, among which more than 1000 are located in the intergenic regions of C. elegans genome. Based on the validation set, we estimate that 91% of the approximately 7000 novel ncRNA candidates are true positives. We then analyze 15 novel ncRNA candidates by RT-PCR, detecting the expression for 14. In addition, we characterize the properties of all the novel ncRNA candidates and find that they have distinct expression patterns across developmental stages and tend to use novel RNA structural families. We also find that they are often targeted by specific transcription factors (∼59% of intergenic novel ncRNA candidates). Overall, our study identifies many new potential ncRNAs in C. elegans and provides a method that can be adapted to other organisms.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 21177971      PMCID: PMC3032931          DOI: 10.1101/gr.110189.110

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  39 in total

Review 1.  MicroRNAs: genomics, biogenesis, mechanism, and function.

Authors:  David P Bartel
Journal:  Cell       Date:  2004-01-23       Impact factor: 41.582

2.  MSARI: multiple sequence alignments for statistical detection of RNA secondary structure.

Authors:  Alex Coventry; Daniel J Kleitman; Bonnie Berger
Journal:  Proc Natl Acad Sci U S A       Date:  2004-08-10       Impact factor: 11.205

3.  Fast and reliable prediction of noncoding RNAs.

Authors:  Stefan Washietl; Ivo L Hofacker; Peter F Stadler
Journal:  Proc Natl Acad Sci U S A       Date:  2005-01-21       Impact factor: 11.205

4.  Accurate multiplex polony sequencing of an evolved bacterial genome.

Authors:  Jay Shendure; Gregory J Porreca; Nikos B Reppas; Xiaoxia Lin; John P McCutcheon; Abraham M Rosenbaum; Michael D Wang; Kun Zhang; Robi D Mitra; George M Church
Journal:  Science       Date:  2005-08-04       Impact factor: 47.728

5.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

Authors:  T M Lowe; S R Eddy
Journal:  Nucleic Acids Res       Date:  1997-03-01       Impact factor: 16.971

6.  Integrative analysis of the Caenorhabditis elegans genome by the modENCODE project.

Authors:  Mark B Gerstein; Zhi John Lu; Eric L Van Nostrand; Chao Cheng; Bradley I Arshinoff; Tao Liu; Kevin Y Yip; Rebecca Robilotto; Andreas Rechtsteiner; Kohta Ikegami; Pedro Alves; Aurelien Chateigner; Marc Perry; Mitzi Morris; Raymond K Auerbach; Xin Feng; Jing Leng; Anne Vielle; Wei Niu; Kahn Rhrissorrakrai; Ashish Agarwal; Roger P Alexander; Galt Barber; Cathleen M Brdlik; Jennifer Brennan; Jeremy Jean Brouillet; Adrian Carr; Ming-Sin Cheung; Hiram Clawson; Sergio Contrino; Luke O Dannenberg; Abby F Dernburg; Arshad Desai; Lindsay Dick; Andréa C Dosé; Jiang Du; Thea Egelhofer; Sevinc Ercan; Ghia Euskirchen; Brent Ewing; Elise A Feingold; Reto Gassmann; Peter J Good; Phil Green; Francois Gullier; Michelle Gutwein; Mark S Guyer; Lukas Habegger; Ting Han; Jorja G Henikoff; Stefan R Henz; Angie Hinrichs; Heather Holster; Tony Hyman; A Leo Iniguez; Judith Janette; Morten Jensen; Masaomi Kato; W James Kent; Ellen Kephart; Vishal Khivansara; Ekta Khurana; John K Kim; Paulina Kolasinska-Zwierz; Eric C Lai; Isabel Latorre; Amber Leahey; Suzanna Lewis; Paul Lloyd; Lucas Lochovsky; Rebecca F Lowdon; Yaniv Lubling; Rachel Lyne; Michael MacCoss; Sebastian D Mackowiak; Marco Mangone; Sheldon McKay; Desirea Mecenas; Gennifer Merrihew; David M Miller; Andrew Muroyama; John I Murray; Siew-Loon Ooi; Hoang Pham; Taryn Phippen; Elicia A Preston; Nikolaus Rajewsky; Gunnar Rätsch; Heidi Rosenbaum; Joel Rozowsky; Kim Rutherford; Peter Ruzanov; Mihail Sarov; Rajkumar Sasidharan; Andrea Sboner; Paul Scheid; Eran Segal; Hyunjin Shin; Chong Shou; Frank J Slack; Cindie Slightam; Richard Smith; William C Spencer; E O Stinson; Scott Taing; Teruaki Takasaki; Dionne Vafeados; Ksenia Voronina; Guilin Wang; Nicole L Washington; Christina M Whittle; Beijing Wu; Koon-Kiu Yan; Georg Zeller; Zheng Zha; Mei Zhong; Xingliang Zhou; Julie Ahringer; Susan Strome; Kristin C Gunsalus; Gos Micklem; X Shirley Liu; Valerie Reinke; Stuart K Kim; LaDeana W Hillier; Steven Henikoff; Fabio Piano; Michael Snyder; Lincoln Stein; Jason D Lieb; Robert H Waterston
Journal:  Science       Date:  2010-12-22       Impact factor: 47.728

7.  Widespread selection for local RNA secondary structure in coding regions of bacterial genes.

Authors:  Luba Katz; Christopher B Burge
Journal:  Genome Res       Date:  2003-09       Impact factor: 9.043

8.  ddbRNA: detection of conserved secondary structures in multiple alignments.

Authors:  Diego di Bernardo; Thomas Down; Tim Hubbard
Journal:  Bioinformatics       Date:  2003-09-01       Impact factor: 6.937

9.  The tRNAscan-SE, snoscan and snoGPS web servers for the detection of tRNAs and snoRNAs.

Authors:  Peter Schattner; Angela N Brooks; Todd M Lowe
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

10.  Noncoding RNA gene detection using comparative sequence analysis.

Authors:  E Rivas; S R Eddy
Journal:  BMC Bioinformatics       Date:  2001-10-10       Impact factor: 3.169

View more
  37 in total

1.  A machine learning approach to identify hydrogenosomal proteins in Trichomonas vaginalis.

Authors:  David Burstein; Sven B Gould; Verena Zimorski; Thorsten Kloesges; Fuat Kiosse; Peter Major; William F Martin; Tal Pupko; Tal Dagan
Journal:  Eukaryot Cell       Date:  2011-12-02

2.  A spatial and temporal map of C. elegans gene expression.

Authors:  W Clay Spencer; Georg Zeller; Joseph D Watson; Stefan R Henz; Kathie L Watkins; Rebecca D McWhirter; Sarah Petersen; Vipin T Sreedharan; Christian Widmer; Jeanyoung Jo; Valerie Reinke; Lisa Petrella; Susan Strome; Stephen E Von Stetina; Menachem Katz; Shai Shaham; Gunnar Rätsch; David M Miller
Journal:  Genome Res       Date:  2010-12-22       Impact factor: 9.043

3.  A differential sequencing-based analysis of the C. elegans noncoding transcriptome.

Authors:  Tengfei Xiao; Yunfei Wang; Huaxia Luo; Lihui Liu; Guifeng Wei; Xiaowei Chen; Yu Sun; Xiaomin Chen; Geir Skogerbø; Runsheng Chen
Journal:  RNA       Date:  2012-02-16       Impact factor: 4.942

4.  Integrative analysis of the Caenorhabditis elegans genome by the modENCODE project.

Authors:  Mark B Gerstein; Zhi John Lu; Eric L Van Nostrand; Chao Cheng; Bradley I Arshinoff; Tao Liu; Kevin Y Yip; Rebecca Robilotto; Andreas Rechtsteiner; Kohta Ikegami; Pedro Alves; Aurelien Chateigner; Marc Perry; Mitzi Morris; Raymond K Auerbach; Xin Feng; Jing Leng; Anne Vielle; Wei Niu; Kahn Rhrissorrakrai; Ashish Agarwal; Roger P Alexander; Galt Barber; Cathleen M Brdlik; Jennifer Brennan; Jeremy Jean Brouillet; Adrian Carr; Ming-Sin Cheung; Hiram Clawson; Sergio Contrino; Luke O Dannenberg; Abby F Dernburg; Arshad Desai; Lindsay Dick; Andréa C Dosé; Jiang Du; Thea Egelhofer; Sevinc Ercan; Ghia Euskirchen; Brent Ewing; Elise A Feingold; Reto Gassmann; Peter J Good; Phil Green; Francois Gullier; Michelle Gutwein; Mark S Guyer; Lukas Habegger; Ting Han; Jorja G Henikoff; Stefan R Henz; Angie Hinrichs; Heather Holster; Tony Hyman; A Leo Iniguez; Judith Janette; Morten Jensen; Masaomi Kato; W James Kent; Ellen Kephart; Vishal Khivansara; Ekta Khurana; John K Kim; Paulina Kolasinska-Zwierz; Eric C Lai; Isabel Latorre; Amber Leahey; Suzanna Lewis; Paul Lloyd; Lucas Lochovsky; Rebecca F Lowdon; Yaniv Lubling; Rachel Lyne; Michael MacCoss; Sebastian D Mackowiak; Marco Mangone; Sheldon McKay; Desirea Mecenas; Gennifer Merrihew; David M Miller; Andrew Muroyama; John I Murray; Siew-Loon Ooi; Hoang Pham; Taryn Phippen; Elicia A Preston; Nikolaus Rajewsky; Gunnar Rätsch; Heidi Rosenbaum; Joel Rozowsky; Kim Rutherford; Peter Ruzanov; Mihail Sarov; Rajkumar Sasidharan; Andrea Sboner; Paul Scheid; Eran Segal; Hyunjin Shin; Chong Shou; Frank J Slack; Cindie Slightam; Richard Smith; William C Spencer; E O Stinson; Scott Taing; Teruaki Takasaki; Dionne Vafeados; Ksenia Voronina; Guilin Wang; Nicole L Washington; Christina M Whittle; Beijing Wu; Koon-Kiu Yan; Georg Zeller; Zheng Zha; Mei Zhong; Xingliang Zhou; Julie Ahringer; Susan Strome; Kristin C Gunsalus; Gos Micklem; X Shirley Liu; Valerie Reinke; Stuart K Kim; LaDeana W Hillier; Steven Henikoff; Fabio Piano; Michael Snyder; Lincoln Stein; Jason D Lieb; Robert H Waterston
Journal:  Science       Date:  2010-12-22       Impact factor: 47.728

Review 5.  Seeing elegance in gene regulatory networks of the worm.

Authors:  Eric L Van Nostrand; Stuart K Kim
Journal:  Curr Opin Genet Dev       Date:  2011-09-29       Impact factor: 5.578

Review 6.  Understanding the transcriptome through RNA structure.

Authors:  Yue Wan; Michael Kertesz; Robert C Spitale; Eran Segal; Howard Y Chang
Journal:  Nat Rev Genet       Date:  2011-08-18       Impact factor: 53.242

7.  A common set of distinct features that characterize noncoding RNAs across multiple species.

Authors:  Long Hu; Chao Di; Mingxuan Kai; Yu-Cheng T Yang; Yang Li; Yunjiang Qiu; Xihao Hu; Kevin Y Yip; Michael Q Zhang; Zhi John Lu
Journal:  Nucleic Acids Res       Date:  2014-12-12       Impact factor: 16.971

8.  Identification of non-coding RNAs with a new composite feature in the Hybrid Random Forest Ensemble algorithm.

Authors:  Supatcha Lertampaiporn; Chinae Thammarongtham; Chakarida Nukoolkit; Boonserm Kaewkamnerdpong; Marasri Ruengjitchatchawalya
Journal:  Nucleic Acids Res       Date:  2014-04-25       Impact factor: 16.971

9.  Insulin/IGF1 signaling inhibits age-dependent axon regeneration.

Authors:  Alexandra B Byrne; Trent Walradt; Kathryn E Gardner; Austin Hubbert; Valerie Reinke; Marc Hammarlund
Journal:  Neuron       Date:  2014-01-16       Impact factor: 17.173

Review 10.  Long Non-coding RNAs: Mechanisms, Experimental, and Computational Approaches in Identification, Characterization, and Their Biomarker Potential in Cancer.

Authors:  Anshika Chowdhary; Venkata Satagopam; Reinhard Schneider
Journal:  Front Genet       Date:  2021-07-01       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.