Literature DB >> 35136929

Deep learning tools are top performers in long non-coding RNA prediction.

Tea Ammunét1, Ning Wang1, Sofia Khan1, Laura L Elo1,2.   

Abstract

The increasing amount of transcriptomic data has brought to light vast numbers of potential novel RNA transcripts. Accurately distinguishing novel long non-coding RNAs (lncRNAs) from protein-coding messenger RNAs (mRNAs) has challenged bioinformatic tool developers. Most recently, tools implementing deep learning architectures have been developed for this task, with the potential of discovering sequence features and their interactions still not surfaced in current knowledge. We compared the performance of deep learning tools with other predictive tools that are currently used in lncRNA coding potential prediction. A total of 15 tools representing the variety of available methods were investigated. In addition to known annotated transcripts, we also evaluated the use of the tools in actual studies with real-life data. The robustness and scalability of the tools' performance was tested with varying sized test sets and test sets with different proportions of lncRNAs and mRNAs. In addition, the ease-of-use for each tested tool was scored. Deep learning tools were top performers in most metrics and labelled transcripts similarly with each other in the real-life dataset. However, the proportion of lncRNAs and mRNAs in the test sets affected the performance of all tools. Computational resources were utilized differently between the top-ranking tools, thus the nature of the study may affect the decision of choosing one well-performing tool over another. Nonetheless, the results suggest favouring the novel deep learning tools over other tools currently in broad use.
© The Author(s) 2021. Published by Oxford University Press.

Entities:  

Keywords:  benchmark; lncRNA; machine learning

Mesh:

Substances:

Year:  2022        PMID: 35136929      PMCID: PMC9123429          DOI: 10.1093/bfgp/elab045

Source DB:  PubMed          Journal:  Brief Funct Genomics        ISSN: 2041-2649            Impact factor:   4.840


  24 in total

Review 1.  Towards a complete map of the human long non-coding RNA transcriptome.

Authors:  Barbara Uszczynska-Ratajczak; Julien Lagarde; Adam Frankish; Roderic Guigó; Rory Johnson
Journal:  Nat Rev Genet       Date:  2018-09       Impact factor: 53.242

2.  Deep learning in bioinformatics: Introduction, application, and perspective in the big data era.

Authors:  Yu Li; Chao Huang; Lizhong Ding; Zhongxiao Li; Yijie Pan; Xin Gao
Journal:  Methods       Date:  2019-04-22       Impact factor: 3.608

3.  A systematic review of computational methods for predicting long noncoding RNAs.

Authors:  Xinran Xu; Shuai Liu; Zhihao Yang; Xiaohan Zhao; Yaozhen Deng; Guangzhan Zhang; Jian Pang; Chengshuai Zhao; Wen Zhang
Journal:  Brief Funct Genomics       Date:  2021-06-09       Impact factor: 4.241

Review 4.  Deep learning in bioinformatics.

Authors:  Seonwoo Min; Byunghan Lee; Sungroh Yoon
Journal:  Brief Bioinform       Date:  2017-09-01       Impact factor: 11.622

5.  Prediction of lncRNAs and their interactions with nucleic acids: benchmarking bioinformatics tools.

Authors:  Ivan V Antonov; Evgeny Mazurov; Mark Borodovsky; Yulia A Medvedeva
Journal:  Brief Bioinform       Date:  2019-03-22       Impact factor: 11.622

6.  Recognition of protein coding regions in DNA sequences.

Authors:  J W Fickett
Journal:  Nucleic Acids Res       Date:  1982-09-11       Impact factor: 16.971

7.  FEELnc: a tool for long non-coding RNA annotation and its application to the dog transcriptome.

Authors:  Valentin Wucher; Fabrice Legeai; Benoît Hédan; Guillaume Rizk; Lætitia Lagoutte; Tosso Leeb; Vidhya Jagannathan; Edouard Cadieu; Audrey David; Hannes Lohi; Susanna Cirera; Merete Fredholm; Nadine Botherel; Peter A J Leegwater; Céline Le Béguec; Hille Fieten; Jeremy Johnson; Jessica Alföldi; Catherine André; Kerstin Lindblad-Toh; Christophe Hitte; Thomas Derrien
Journal:  Nucleic Acids Res       Date:  2017-05-05       Impact factor: 16.971

Review 8.  Long Noncoding RNA Identification: Comparing Machine Learning Based Tools for Long Noncoding Transcripts Discrimination.

Authors:  Siyu Han; Yanchun Liang; Ying Li; Wei Du
Journal:  Biomed Res Int       Date:  2016-11-29       Impact factor: 3.411

Review 9.  Essential guidelines for computational method benchmarking.

Authors:  Lukas M Weber; Wouter Saelens; Robrecht Cannoodt; Charlotte Soneson; Alexander Hapfelmeier; Paul P Gardner; Anne-Laure Boulesteix; Yvan Saeys; Mark D Robinson
Journal:  Genome Biol       Date:  2019-06-20       Impact factor: 13.583

10.  lncRNA_Mdeep: An Alignment-Free Predictor for Distinguishing Long Non-Coding RNAs from Protein-Coding Transcripts by Multimodal Deep Learning.

Authors:  Xiao-Nan Fan; Shao-Wu Zhang; Song-Yao Zhang; Jin-Jie Ni
Journal:  Int J Mol Sci       Date:  2020-07-23       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.