Literature DB >> 33575571

RNAsamba: neural network-based assessment of the protein-coding potential of RNA sequences.

Antonio P Camargo1, Vsevolod Sourkov2, Gonçalo A G Pereira1, Marcelo F Carazzolle1.   

Abstract

The advent of high-throughput sequencing technologies made it possible to obtain large volumes of genetic information, quickly and inexpensively. Thus, many efforts are devoted to unveiling the biological roles of genomic elements, being the distinction between protein-coding and long non-coding RNAs one of the most important tasks. We describe RNAsamba, a tool to predict the coding potential of RNA molecules from sequence information using a neural network-based that models both the whole sequence and the ORF to identify patterns that distinguish coding from non-coding transcripts. We evaluated RNAsamba's classification performance using transcripts coming from humans and several other model organisms and show that it recurrently outperforms other state-of-the-art methods. Our results also show that RNAsamba can identify coding signals in partial-length ORFs and UTR sequences, evidencing that its algorithm is not dependent on complete transcript sequences. Furthermore, RNAsamba can also predict small ORFs, traditionally identified with ribosome profiling experiments. We believe that RNAsamba will enable faster and more accurate biological findings from genomic data of species that are being sequenced for the first time. A user-friendly web interface, the documentation containing instructions for local installation and usage, and the source code of RNAsamba can be found at https://rnasamba.lge.ibi.unicamp.br/.
© The Author(s) 2019. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.

Entities:  

Year:  2020        PMID: 33575571      PMCID: PMC7671399          DOI: 10.1093/nargab/lqz024

Source DB:  PubMed          Journal:  NAR Genom Bioinform        ISSN: 2631-9268


  15 in total

1.  Transcriptomic Profiling and Microsatellite Identification in Cobia (Rachycentron canadum), Using High-Throughput RNA Sequencing.

Authors:  David Aciole Barbosa; Bruno C Araújo; Giovana Souza Branco; Alexandre S Simeone; Alexandre W S Hilsdorf; Daniela L Jabes; Luiz R Nunes; Renata G Moreira; Fabiano B Menegidio
Journal:  Mar Biotechnol (NY)       Date:  2021-12-02       Impact factor: 3.619

2.  Class similarity network for coding and long non-coding RNA classification.

Authors:  Yu Zhang; Yahui Long; Chee Keong Kwoh
Journal:  BMC Bioinformatics       Date:  2021-12-20       Impact factor: 3.169

3.  The genome sequences of the male and female green-veined white, Pieris napi (Linnaeus, 1758).

Authors:  Konrad Lohse; Alex Hayward; Sam Ebdon
Journal:  Wellcome Open Res       Date:  2021-10-26

4.  Surviving in the Brine: A Multi-Omics Approach for Understanding the Physiology of the Halophile Fungus Aspergillus sydowii at Saturated NaCl Concentration.

Authors:  Irina Jiménez-Gómez; Gisell Valdés-Muñoz; Aldo Moreno-Ulloa; Yordanis Pérez-Llano; Tonatiuh Moreno-Perlín; Hortencia Silva-Jiménez; Fernando Barreto-Curiel; María Del Rayo Sánchez-Carbente; Jorge Luis Folch-Mallol; Nina Gunde-Cimerman; Asunción Lago-Lestón; Ramón Alberto Batista-García
Journal:  Front Microbiol       Date:  2022-05-02       Impact factor: 6.064

5.  Computational Analysis Predicts Hundreds of Coding lncRNAs in Zebrafish.

Authors:  Shital Kumar Mishra; Han Wang
Journal:  Biology (Basel)       Date:  2021-04-26

6.  Identification of Long Non-coding RNA Isolated From Naturally Infected Macrophages and Associated With Bovine Johne's Disease in Canadian Holstein Using a Combination of Neural Networks and Logistic Regression.

Authors:  Andrew Marete; Olivier Ariel; Eveline Ibeagha-Awemu; Nathalie Bissonnette
Journal:  Front Vet Sci       Date:  2021-04-22

7.  The genome sequence of the heath fritillary, Melitaea athalia (Rottemburg, 1775).

Authors:  Alex Hayward; Roger Vila; Dominik R Laetsch; Konrad Lohse; Tobias Baril
Journal:  Wellcome Open Res       Date:  2021-11-10

8.  FusionGDB 2.0: fusion gene annotation updates aided by deep learning.

Authors:  Pora Kim; Hua Tan; Jiajia Liu; Haeseung Lee; Hyesoo Jung; Himanshu Kumar; Xiaobo Zhou
Journal:  Nucleic Acids Res       Date:  2022-01-07       Impact factor: 16.971

9.  The genome sequence of the holly blue, Celastrina argiolus (Linnaeus, 1758).

Authors:  Alex Hayward; Charlotte Wright
Journal:  Wellcome Open Res       Date:  2021-12-14

10.  C-Myc-activated long non-coding RNA LINC01050 promotes gastric cancer growth and metastasis by sponging miR-7161-3p to regulate SPZ1 expression.

Authors:  Ziwei Ji; Tianbin Tang; Mengxia Chen; Buyuan Dong; Wenjing Sun; Nan Wu; Hao Chen; Qian Feng; Xingyi Yang; Rong Jin; Lei Jiang
Journal:  J Exp Clin Cancer Res       Date:  2021-11-08
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.