Literature DB >> 28053114

FEELnc: a tool for long non-coding RNA annotation and its application to the dog transcriptome.

Valentin Wucher1, Fabrice Legeai2,3, Benoît Hédan1, Guillaume Rizk3, Lætitia Lagoutte1, Tosso Leeb4, Vidhya Jagannathan4, Edouard Cadieu1, Audrey David2, Hannes Lohi5,6, Susanna Cirera7, Merete Fredholm7, Nadine Botherel1, Peter A J Leegwater8, Céline Le Béguec1, Hille Fieten8, Jeremy Johnson9, Jessica Alföldi9, Catherine André1, Kerstin Lindblad-Toh9,10, Christophe Hitte1, Thomas Derrien1.   

Abstract

Whole transcriptome sequencing (RNA-seq) has become a standard for cataloguing and monitoring RNA populations. One of the main bottlenecks, however, is to correctly identify the different classes of RNAs among the plethora of reconstructed transcripts, particularly those that will be translated (mRNAs) from the class of long non-coding RNAs (lncRNAs). Here, we present FEELnc (FlExible Extraction of LncRNAs), an alignment-free program that accurately annotates lncRNAs based on a Random Forest model trained with general features such as multi k-mer frequencies and relaxed open reading frames. Benchmarking versus five state-of-the-art tools shows that FEELnc achieves similar or better classification performance on GENCODE and NONCODE data sets. The program also provides specific modules that enable the user to fine-tune classification accuracy, to formalize the annotation of lncRNA classes and to identify lncRNAs even in the absence of a training set of non-coding RNAs. We used FEELnc on a real data set comprising 20 canine RNA-seq samples produced by the European LUPA consortium to substantially expand the canine genome annotation to include 10 374 novel lncRNAs and 58 640 mRNA transcripts. FEELnc moves beyond conventional coding potential classifiers by providing a standardized and complete solution for annotating lncRNAs and is freely available at https://github.com/tderrien/FEELnc.
© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28053114      PMCID: PMC5416892          DOI: 10.1093/nar/gkw1306

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  65 in total

1.  Nascent RNA sequencing reveals widespread pausing and divergent initiation at human promoters.

Authors:  Leighton J Core; Joshua J Waterfall; John T Lis
Journal:  Science       Date:  2008-12-04       Impact factor: 47.728

2.  Using geneid to identify genes.

Authors:  Enrique Blanco; Genís Parra; Roderic Guigó
Journal:  Curr Protoc Bioinformatics       Date:  2007-06

3.  STAR: ultrafast universal RNA-seq aligner.

Authors:  Alexander Dobin; Carrie A Davis; Felix Schlesinger; Jorg Drenkow; Chris Zaleski; Sonali Jha; Philippe Batut; Mark Chaisson; Thomas R Gingeras
Journal:  Bioinformatics       Date:  2012-10-25       Impact factor: 6.937

4.  StringTie enables improved reconstruction of a transcriptome from RNA-seq reads.

Authors:  Mihaela Pertea; Geo M Pertea; Corina M Antonescu; Tsung-Cheng Chang; Joshua T Mendell; Steven L Salzberg
Journal:  Nat Biotechnol       Date:  2015-02-18       Impact factor: 54.908

5.  Long non-coding antisense RNA controls Uchl1 translation through an embedded SINEB2 repeat.

Authors:  Claudia Carrieri; Laura Cimatti; Marta Biagioli; Anne Beugnet; Silvia Zucchelli; Stefania Fedele; Elisa Pesce; Isidre Ferrer; Licio Collavin; Claudio Santoro; Alistair R R Forrest; Piero Carninci; Stefano Biffo; Elia Stupka; Stefano Gustincich
Journal:  Nature       Date:  2012-10-14       Impact factor: 49.962

6.  GENCODE: the reference human genome annotation for The ENCODE Project.

Authors:  Jennifer Harrow; Adam Frankish; Jose M Gonzalez; Electra Tapanari; Mark Diekhans; Felix Kokocinski; Bronwen L Aken; Daniel Barrell; Amonida Zadissa; Stephen Searle; If Barnes; Alexandra Bignell; Veronika Boychenko; Toby Hunt; Mike Kay; Gaurab Mukherjee; Jeena Rajan; Gloria Despacio-Reyes; Gary Saunders; Charles Steward; Rachel Harte; Michael Lin; Cédric Howald; Andrea Tanzer; Thomas Derrien; Jacqueline Chrast; Nathalie Walters; Suganthi Balasubramanian; Baikang Pei; Michael Tress; Jose Manuel Rodriguez; Iakes Ezkurdia; Jeltje van Baren; Michael Brent; David Haussler; Manolis Kellis; Alfonso Valencia; Alexandre Reymond; Mark Gerstein; Roderic Guigó; Tim J Hubbard
Journal:  Genome Res       Date:  2012-09       Impact factor: 9.043

7.  The GENCODE v7 catalog of human long noncoding RNAs: analysis of their gene structure, evolution, and expression.

Authors:  Thomas Derrien; Rory Johnson; Giovanni Bussotti; Andrea Tanzer; Sarah Djebali; Hagen Tilgner; Gregory Guernec; David Martin; Angelika Merkel; David G Knowles; Julien Lagarde; Lavanya Veeravalli; Xiaoan Ruan; Yijun Ruan; Timo Lassmann; Piero Carninci; James B Brown; Leonard Lipovich; Jose M Gonzalez; Mark Thomas; Carrie A Davis; Ramin Shiekhattar; Thomas R Gingeras; Tim J Hubbard; Cedric Notredame; Jennifer Harrow; Roderic Guigó
Journal:  Genome Res       Date:  2012-09       Impact factor: 9.043

8.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation.

Authors:  Cole Trapnell; Brian A Williams; Geo Pertea; Ali Mortazavi; Gordon Kwan; Marijke J van Baren; Steven L Salzberg; Barbara J Wold; Lior Pachter
Journal:  Nat Biotechnol       Date:  2010-05-02       Impact factor: 54.908

9.  Coordinated international action to accelerate genome-to-phenome with FAANG, the Functional Annotation of Animal Genomes project.

Authors:  Leif Andersson; Alan L Archibald; Cynthia D Bottema; Rudiger Brauning; Shane C Burgess; Dave W Burt; Eduardo Casas; Hans H Cheng; Laura Clarke; Christine Couldrey; Brian P Dalrymple; Christine G Elsik; Sylvain Foissac; Elisabetta Giuffra; Martien A Groenen; Ben J Hayes; LuSheng S Huang; Hassan Khatib; James W Kijas; Heebal Kim; Joan K Lunney; Fiona M McCarthy; John C McEwan; Stephen Moore; Bindu Nanduri; Cedric Notredame; Yniv Palti; Graham S Plastow; James M Reecy; Gary A Rohrer; Elena Sarropoulou; Carl J Schmidt; Jeffrey Silverstein; Ross L Tellam; Michele Tixier-Boichard; Gwenola Tosser-Klopp; Christopher K Tuggle; Johanna Vilkki; Stephen N White; Shuhong Zhao; Huaijun Zhou
Journal:  Genome Biol       Date:  2015-03-25       Impact factor: 13.583

10.  Utilizing sequence intrinsic composition to classify protein-coding and long non-coding transcripts.

Authors:  Liang Sun; Haitao Luo; Dechao Bu; Guoguang Zhao; Kuntao Yu; Changhai Zhang; Yuanning Liu; Runsheng Chen; Yi Zhao
Journal:  Nucleic Acids Res       Date:  2013-07-27       Impact factor: 16.971

View more
  119 in total

1.  lncSLdb: a resource for long non-coding RNA subcellular localization.

Authors:  Xiao Wen; Lin Gao; Xingli Guo; Xing Li; Xiaotai Huang; Ying Wang; Haifu Xu; Ruijie He; Chenglong Jia; Feixiang Liang
Journal:  Database (Oxford)       Date:  2018-01-01       Impact factor: 3.451

Review 2.  Towards a complete map of the human long non-coding RNA transcriptome.

Authors:  Barbara Uszczynska-Ratajczak; Julien Lagarde; Adam Frankish; Roderic Guigó; Rory Johnson
Journal:  Nat Rev Genet       Date:  2018-09       Impact factor: 53.242

3.  The cryptic unstable transcripts are associated with developmentally regulated gene expression in blood-stage Plasmodium falciparum.

Authors:  Shigang Yin; Yanting Fan; Xiaohui He; Guiying Wei; Yuhao Wen; Yuemeng Zhao; Mingli Shi; Jieqiong Wei; Huiling Chen; Jiping Han; Lubin Jiang; Qingfeng Zhang
Journal:  RNA Biol       Date:  2020-02-27       Impact factor: 4.652

4.  lncRNA expression predicts mRNA abundance.

Authors:  Alicja Pacholewska; Myong-Hee Sung
Journal:  Epigenomics       Date:  2019-07-24       Impact factor: 4.778

5.  Bioinformatics Approaches for Functional Prediction of Long Noncoding RNAs.

Authors:  Fayaz Seifuddin; Mehdi Pirooznia
Journal:  Methods Mol Biol       Date:  2021

6.  Control of Cognate Sense mRNA Translation by cis-Natural Antisense RNAs.

Authors:  Jules Deforges; Rodrigo S Reis; Philippe Jacquet; Shaoline Sheppard; Veerendra P Gadekar; Gene Hart-Smith; Andrea Tanzer; Ivo L Hofacker; Christian Iseli; Ioannis Xenarios; Yves Poirier
Journal:  Plant Physiol       Date:  2019-02-13       Impact factor: 8.340

7.  Identification and functional prediction of long non-coding RNAs of rice (Oryza sativa L.) at reproductive stage under salinity stress.

Authors:  Priyanka Jain; Samreen Hussian; Jyoti Nishad; Himanshu Dubey; Deepak Singh Bisht; Tilak Raj Sharma; Tapan Kumar Mondal
Journal:  Mol Biol Rep       Date:  2021-03-19       Impact factor: 2.316

8.  A deep recurrent neural network discovers complex biological rules to decipher RNA protein-coding potential.

Authors:  Steven T Hill; Rachael Kuintzle; Amy Teegarden; Erich Merrill; Padideh Danaee; David A Hendrix
Journal:  Nucleic Acids Res       Date:  2018-09-19       Impact factor: 16.971

9.  CPPred: coding potential prediction based on the global description of RNA sequence.

Authors:  Xiaoxue Tong; Shiyong Liu
Journal:  Nucleic Acids Res       Date:  2019-05-07       Impact factor: 16.971

10.  Premeiotic, 24-Nucleotide Reproductive PhasiRNAs Are Abundant in Anthers of Wheat and Barley But Not Rice and Maize.

Authors:  Sébastien Bélanger; Suresh Pokhrel; Kirk Czymmek; Blake C Meyers
Journal:  Plant Physiol       Date:  2020-09-11       Impact factor: 8.340

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.