Literature DB >> 35753698

TACOS: a novel approach for accurate prediction of cell-specific long noncoding RNAs subcellular localization.

Young-Jun Jeon1, Md Mehedi Hasan2, Hyun Woo Park1, Ki Wook Lee1, Balachandran Manavalan3.   

Abstract

Long noncoding RNAs (lncRNAs) are primarily regulated by their cellular localization, which is responsible for their molecular functions, including cell cycle regulation and genome rearrangements. Accurately identifying the subcellular location of lncRNAs from sequence information is crucial for a better understanding of their biological functions and mechanisms. In contrast to traditional experimental methods, bioinformatics or computational methods can be applied for the annotation of lncRNA subcellular locations in humans more effectively. In the past, several machine learning-based methods have been developed to identify lncRNA subcellular localization, but relevant work for identifying cell-specific localization of human lncRNA remains limited. In this study, we present the first application of the tree-based stacking approach, TACOS, which allows users to identify the subcellular localization of human lncRNA in 10 different cell types. Specifically, we conducted comprehensive evaluations of six tree-based classifiers with 10 different feature descriptors, using a newly constructed balanced training dataset for each cell type. Subsequently, the strengths of the AdaBoost baseline models were integrated via a stacking approach, with an appropriate tree-based classifier for the final prediction. TACOS displayed consistent performance in both the cross-validation and independent assessments compared with the other two approaches employed in this study. The user-friendly online TACOS web server can be accessed at https://balalab-skku.org/TACOS.
© The Author(s) 2022. Published by Oxford University Press.

Entities:  

Keywords:  bioinformatics; feature extraction; long noncoding RNAs; sequence analysis; stacking strategy; tree-based algorithms

Mesh:

Substances:

Year:  2022        PMID: 35753698      PMCID: PMC9294414          DOI: 10.1093/bib/bbac243

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   13.994


  50 in total

Review 1.  Towards a complete map of the human long non-coding RNA transcriptome.

Authors:  Barbara Uszczynska-Ratajczak; Julien Lagarde; Adam Frankish; Roderic Guigó; Rory Johnson
Journal:  Nat Rev Genet       Date:  2018-09       Impact factor: 53.242

2.  Computational prediction and interpretation of cell-specific replication origin sites from multiple eukaryotes by exploiting stacking framework.

Authors:  Leyi Wei; Wenjia He; Adeel Malik; Ran Su; Lizhen Cui; Balachandran Manavalan
Journal:  Brief Bioinform       Date:  2021-07-20       Impact factor: 11.622

3.  The lncLocator: a subcellular localization predictor for long non-coding RNAs based on a stacked ensemble classifier.

Authors:  Zhen Cao; Xiaoyong Pan; Yang Yang; Yan Huang; Hong-Bin Shen
Journal:  Bioinformatics       Date:  2018-07-01       Impact factor: 6.937

4.  Fluorescent in situ sequencing (FISSEQ) of RNA for gene expression profiling in intact cells and tissues.

Authors:  Je Hyuk Lee; Evan R Daugharthy; Jonathan Scheiman; Reza Kalhor; Thomas C Ferrante; Richard Terry; Brian M Turczyk; Joyce L Yang; Ho Suk Lee; John Aach; Kun Zhang; George M Church
Journal:  Nat Protoc       Date:  2015-02-12       Impact factor: 13.491

5.  Microbes and complex diseases: from experimental results to computational models.

Authors:  Yan Zhao; Chun-Chun Wang; Xing Chen
Journal:  Brief Bioinform       Date:  2021-05-20       Impact factor: 11.622

6.  BioSeq-Analysis2.0: an updated platform for analyzing DNA, RNA and protein sequences at sequence level and residue level based on machine learning approaches.

Authors:  Bin Liu; Xin Gao; Hanyu Zhang
Journal:  Nucleic Acids Res       Date:  2019-11-18       Impact factor: 16.971

7.  Imaging individual mRNA molecules using multiple singly labeled probes.

Authors:  Arjun Raj; Patrick van den Bogaard; Scott A Rifkin; Alexander van Oudenaarden; Sanjay Tyagi
Journal:  Nat Methods       Date:  2008-09-21       Impact factor: 28.547

8.  NONCODEV5: a comprehensive annotation database for long non-coding RNAs.

Authors:  ShuangSang Fang; LiLi Zhang; JinCheng Guo; YiWei Niu; Yang Wu; Hui Li; LianHe Zhao; XiYuan Li; XueYi Teng; XianHui Sun; Liang Sun; Michael Q Zhang; RunSheng Chen; Yi Zhao
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

9.  Three-dimensional intact-tissue sequencing of single-cell transcriptional states.

Authors:  Xiao Wang; William E Allen; Matthew A Wright; Emily L Sylwestrak; Nikolay Samusik; Sam Vesuna; Kathryn Evans; Cindy Liu; Charu Ramakrishnan; Jia Liu; Garry P Nolan; Felice-Alessio Bava; Karl Deisseroth
Journal:  Science       Date:  2018-06-21       Impact factor: 47.728

10.  SDM6A: A Web-Based Integrative Machine-Learning Framework for Predicting 6mA Sites in the Rice Genome.

Authors:  Shaherin Basith; Balachandran Manavalan; Tae Hwan Shin; Gwang Lee
Journal:  Mol Ther Nucleic Acids       Date:  2019-08-16       Impact factor: 8.886

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.