Literature DB >> 33635499

LncMachine: a machine learning algorithm for long noncoding RNA annotation in plants.

H Busra Cagirici1,2, S Galvez3, Taner Z Sen1, Hikmet Budak4.   

Abstract

Following the elucidation of the critical roles they play in numerous important biological processes, long noncoding RNAs (lncRNAs) have gained vast attention in recent years. Manual annotation of lncRNAs is restricted by known gene annotations and is prone to false prediction due to the incompleteness of available data. However, with the advent of high-throughput sequencing technologies, a magnitude of high-quality data has become available for annotation, especially for plant species such as wheat. Here, we compared prediction accuracies of several machine learning algorithms using a 10-fold cross-validation. This study includes a comprehensive feature selection step to refine irrelevant and repeated features. We present a crop-specific, alignment-free coding potential prediction tool, LncMachine, that performs at higher prediction accuracies than the currently available popular tools (CPC2, CPAT, and CNIT) when used with the Random Forest algorithm. Further, LncMachine with Random Forest performed well on human and mouse data, with an average accuracy of 92.67%. LncMachine only requires either a FASTA file or a TAB separated CSV file containing features as input files. LncMachine can deploy several user-provided algorithms in real time and therefore be effortlessly applied to a wide range of studies.

Entities:  

Keywords:  LncRNA; Machine learning; Plants; Random Forest

Year:  2021        PMID: 33635499     DOI: 10.1007/s10142-021-00769-w

Source DB:  PubMed          Journal:  Funct Integr Genomics        ISSN: 1438-793X            Impact factor:   3.410


  18 in total

Review 1.  Long non-coding RNAs and their functions in plants.

Authors:  Julia A Chekanova
Journal:  Curr Opin Plant Biol       Date:  2015-09-03       Impact factor: 7.834

2.  LncRNA-ID: Long non-coding RNA IDentification using balanced random forests.

Authors:  Rujira Achawanantakun; Jiao Chen; Yanni Sun; Yuan Zhang
Journal:  Bioinformatics       Date:  2015-08-26       Impact factor: 6.937

3.  Durum wheat genome highlights past domestication signatures and future improvement targets.

Authors:  Marco Maccaferri; Neil S Harris; Sven O Twardziok; Raj K Pasam; Heidrun Gundlach; Manuel Spannagl; Danara Ormanbekova; Thomas Lux; Verena M Prade; Sara G Milner; Axel Himmelbach; Martin Mascher; Paolo Bagnaresi; Primetta Faccioli; Paolo Cozzi; Massimiliano Lauria; Barbara Lazzari; Alessandra Stella; Andrea Manconi; Matteo Gnocchi; Marco Moscatelli; Raz Avni; Jasline Deek; Sezgi Biyiklioglu; Elisabetta Frascaroli; Simona Corneti; Silvio Salvi; Gabriella Sonnante; Francesca Desiderio; Caterina Marè; Cristina Crosatti; Erica Mica; Hakan Özkan; Benjamin Kilian; Pasquale De Vita; Daniela Marone; Reem Joukhadar; Elisabetta Mazzucotelli; Domenica Nigro; Agata Gadaleta; Shiaoman Chao; Justin D Faris; Arthur T O Melo; Mike Pumphrey; Nicola Pecchioni; Luciano Milanesi; Krystalee Wiebe; Jennifer Ens; Ron P MacLachlan; John M Clarke; Andrew G Sharpe; Chu Shin Koh; Kevin Y H Liang; Gregory J Taylor; Ron Knox; Hikmet Budak; Anna M Mastrangelo; Steven S Xu; Nils Stein; Iago Hale; Assaf Distelfeld; Matthew J Hayden; Roberto Tuberosa; Sean Walkowiak; Klaus F X Mayer; Aldo Ceriotti; Curtis J Pozniak; Luigi Cattivelli
Journal:  Nat Genet       Date:  2019-04-08       Impact factor: 38.330

4.  Fewer genes, more noncoding RNA.

Authors:  Jean-Michel Claverie
Journal:  Science       Date:  2005-09-02       Impact factor: 47.728

5.  RNA-directed DNA methylation is involved in regulating photoperiod-sensitive male sterility in rice.

Authors:  Jihua Ding; Jianqiang Shen; Hailiang Mao; Weibo Xie; Xianghua Li; Qifa Zhang
Journal:  Mol Plant       Date:  2012-09-30       Impact factor: 13.164

6.  Vernalization-mediated epigenetic silencing by a long intronic noncoding RNA.

Authors:  Jae Bok Heo; Sibum Sung
Journal:  Science       Date:  2010-12-02       Impact factor: 47.728

7.  Targeted RNA sequencing reveals the deep complexity of the human transcriptome.

Authors:  Tim R Mercer; Daniel J Gerhardt; Marcel E Dinger; Joanna Crawford; Cole Trapnell; Jeffrey A Jeddeloh; John S Mattick; John L Rinn
Journal:  Nat Biotechnol       Date:  2011-11-13       Impact factor: 54.908

8.  Genome-wide analysis uncovers regulation of long intergenic noncoding RNAs in Arabidopsis.

Authors:  Jun Liu; Choonkyun Jung; Jun Xu; Huan Wang; Shulin Deng; Lucia Bernad; Catalina Arenas-Huertero; Nam-Hai Chua
Journal:  Plant Cell       Date:  2012-11-06       Impact factor: 11.277

9.  Principles of long noncoding RNA evolution derived from direct comparison of transcriptomes in 17 species.

Authors:  Hadas Hezroni; David Koppstein; Matthew G Schwartz; Alexandra Avrutin; David P Bartel; Igor Ulitsky
Journal:  Cell Rep       Date:  2015-05-07       Impact factor: 9.423

Review 10.  Long Non-coding RNA in Plants in the Era of Reference Sequences.

Authors:  Hikmet Budak; Sezgi Biyiklioglu Kaya; Halise Busra Cagirici
Journal:  Front Plant Sci       Date:  2020-03-12       Impact factor: 5.753

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.