Literature DB >> 28913654

Discriminating cirRNAs from other lncRNAs using a hierarchical extreme learning machine (H-ELM) algorithm with feature selection.

Lei Chen1,2, Yu-Hang Zhang3, Guohua Huang4, Xiaoyong Pan5, ShaoPeng Wang1, Tao Huang6, Yu-Dong Cai7.   

Abstract

As non-coding RNAs, circular RNAs (cirRNAs) and long non-coding RNAs (lncRNAs) have attracted an increasing amount of attention. They have been confirmed to participate in many biological processes, including playing roles in transcriptional regulation, regulating protein-coding genes, and binding to RNA-associated proteins. Until now, the differences between these two types of non-coding RNAs have not been fully uncovered. It is still quite difficult to detect cirRNAs from other lncRNAs using simple techniques. In this study, we investigated these two types of non-coding RNAs using several computational methods. The purpose was to extract important factors that could distinguish cirRNAs from other lncRNAs and build an effective classification model to distinguish them. First, we collected cirRNAs, lncRNAs and their representations from a previous study, in which each cirRNA or lncRNA was represented by 188 features derived from its graph representation, sequence and conservation properties. Second, these features were analyzed by the minimum redundancy maximum relevance (mRMR) method. The obtained mRMR feature list, incremental feature selection method and hierarchical extreme learning machine algorithm were employed to build an optimal classification model with sensitivity of 0.703, specificity of 0.850, accuracy of 0.789 and a Matthews correlation coefficient of 0.561. Finally, we analyzed the 16 most important features. Of them, the sequences and structures of the RNA molecule were top ranking, implying they can be potential indicators of differences between cirRNAs and other lncRNAs. Meanwhile, other features of evolutionary conversation, sequence consecution were also important.

Keywords:  Hierarchical extreme learning machine algorithm; Minimum redundancy maximum relevance; cirRNAs; lncRNAs

Mesh:

Substances:

Year:  2017        PMID: 28913654     DOI: 10.1007/s00438-017-1372-7

Source DB:  PubMed          Journal:  Mol Genet Genomics        ISSN: 1617-4623            Impact factor:   3.291


  98 in total

Review 1.  Non-coding RNAs: An Introduction.

Authors:  Jennifer X Yang; Raphael H Rastetter; Dagmar Wilhelm
Journal:  Adv Exp Med Biol       Date:  2016       Impact factor: 2.622

2.  Prediction and analysis of cell-penetrating peptides using pseudo-amino acid composition and random forest models.

Authors:  Lei Chen; Chen Chu; Tao Huang; Xiangyin Kong; Yu-Dong Cai
Journal:  Amino Acids       Date:  2015-04-18       Impact factor: 3.520

Review 3.  Circular RNA: A new star of noncoding RNAs.

Authors:  Shibin Qu; Xisheng Yang; Xiaolei Li; Jianlin Wang; Yuan Gao; Runze Shang; Wei Sun; Kefeng Dou; Haimin Li
Journal:  Cancer Lett       Date:  2015-06-05       Impact factor: 8.679

Review 4.  Long noncoding RNAs in spermatogenesis: insights from recent high-throughput transcriptome studies.

Authors:  Alfred Chun-Shui Luk; Wai-Yee Chan; Owen M Rennert; Tin-Lap Lee
Journal:  Reproduction       Date:  2014-04-08       Impact factor: 3.906

5.  Human polymorphisms at long non-coding RNAs (lncRNAs) and association with prostate cancer risk.

Authors:  Guangfu Jin; Jielin Sun; Sarah D Isaacs; Kathleen E Wiley; Seong-Tae Kim; Lisa W Chu; Zheng Zhang; Hui Zhao; Siqun Lilly Zheng; William B Isaacs; Jianfeng Xu
Journal:  Carcinogenesis       Date:  2011-08-19       Impact factor: 4.944

6.  RNA binding properties of the AU-rich element-binding recombinant Nup475/TIS11/tristetraprolin protein.

Authors:  Mark T Worthington; Jared W Pelo; Muhammadreza A Sachedina; Joan L Applegate; Kristen O Arseneau; Theresa T Pizarro
Journal:  J Biol Chem       Date:  2002-09-24       Impact factor: 5.157

Review 7.  RNA in unexpected places: long non-coding RNA functions in diverse cellular contexts.

Authors:  Sarah Geisler; Jeff Coller
Journal:  Nat Rev Mol Cell Biol       Date:  2013-10-09       Impact factor: 94.444

8.  Predicting the network of substrate-enzyme-product triads by combining compound similarity and functional domain composition.

Authors:  Lei Chen; Kai-Yan Feng; Yu-Dong Cai; Kuo-Chen Chou; Hai-Peng Li
Journal:  BMC Bioinformatics       Date:  2010-05-31       Impact factor: 3.169

9.  Specific roles of 5' RNA secondary structures in stabilizing transcripts in chloroplasts.

Authors:  Loreto Suay; Maria L Salvador; Emnet Abesha; Uwe Klein
Journal:  Nucleic Acids Res       Date:  2005-08-22       Impact factor: 16.971

10.  Gene expression profiling gut microbiota in different races of humans.

Authors:  Lei Chen; Yu-Hang Zhang; Tao Huang; Yu-Dong Cai
Journal:  Sci Rep       Date:  2016-03-15       Impact factor: 4.379

View more
  16 in total

1.  Circular RNA profile in coronary artery disease.

Authors:  Ren-You Pan; Chen-Hui Zhao; Jin-Xia Yuan; Yong-Jie Zhang; Jian-Liang Jin; Mu-Feng Gu; Zhi-Yuan Mao; Hai-Jian Sun; Qiao-Wei Jia; Ming-Yue Ji; Jing Zhang; Lian-Sheng Wang; Wen-Zhu Ma; Wen-Qi Ma; Jian-Dong Ding; En-Zhi Jia
Journal:  Am J Transl Res       Date:  2019-11-15       Impact factor: 4.060

2.  Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms.

Authors:  Deling Wang; Jia-Rui Li; Yu-Hang Zhang; Lei Chen; Tao Huang; Yu-Dong Cai
Journal:  Genes (Basel)       Date:  2018-03-12       Impact factor: 4.096

3.  Computational Approach to Investigating Key GO Terms and KEGG Pathways Associated with CNV.

Authors:  YuanYuan Luo; Yan Yan; Shiqi Zhang; Zhen Li
Journal:  Biomed Res Int       Date:  2018-04-11       Impact factor: 3.411

Review 4.  Machine Learning and Integrative Analysis of Biomedical Big Data.

Authors:  Bilal Mirza; Wei Wang; Jie Wang; Howard Choi; Neo Christopher Chung; Peipei Ping
Journal:  Genes (Basel)       Date:  2019-01-28       Impact factor: 4.096

5.  Identifying Methylation Pattern and Genes Associated with Breast Cancer Subtypes.

Authors:  Lei Chen; Tao Zeng; Xiaoyong Pan; Yu-Hang Zhang; Tao Huang; Yu-Dong Cai
Journal:  Int J Mol Sci       Date:  2019-08-31       Impact factor: 5.923

6.  circDeep: deep learning approach for circular RNA classification from other long non-coding RNA.

Authors:  Mohamed Chaabane; Robert M Williams; Austin T Stephens; Juw Won Park
Journal:  Bioinformatics       Date:  2020-01-01       Impact factor: 6.937

7.  Tissue Expression Difference between mRNAs and lncRNAs.

Authors:  Lei Chen; Yu-Hang Zhang; Xiaoyong Pan; Min Liu; Shaopeng Wang; Tao Huang; Yu-Dong Cai
Journal:  Int J Mol Sci       Date:  2018-10-31       Impact factor: 5.923

8.  An Efficient Feature Selection Strategy Based on Multiple Support Vector Machine Technology with Gene Expression Data.

Authors:  Ying Zhang; Qingchun Deng; Wenbin Liang; Xianchun Zou
Journal:  Biomed Res Int       Date:  2018-08-30       Impact factor: 3.411

9.  A Computational Method for Classifying Different Human Tissues with Quantitatively Tissue-Specific Expressed Genes.

Authors:  JiaRui Li; Lei Chen; Yu-Hang Zhang; XiangYin Kong; Tao Huang; Yu-Dong Cai
Journal:  Genes (Basel)       Date:  2018-09-07       Impact factor: 4.096

10.  Analysis of Expression Pattern of snoRNAs in Different Cancer Types with Machine Learning Algorithms.

Authors:  Xiaoyong Pan; Lei Chen; Kai-Yan Feng; Xiao-Hua Hu; Yu-Hang Zhang; Xiang-Yin Kong; Tao Huang; Yu-Dong Cai
Journal:  Int J Mol Sci       Date:  2019-05-02       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.