Literature DB >> 29649472

Accurate RNA 5-methylcytosine site prediction based on heuristic physical-chemical properties reduction and classifier ensemble.

Ming Zhang1, Yan Xu2, Lei Li2, Zi Liu3, Xibei Yang2, Dong-Jun Yu4.   

Abstract

RNA 5-methylcytosine (m5C) is an important post-transcriptional modification that plays an indispensable role in biological processes. The accurate identification of m5C sites from primary RNA sequences is especially useful for deeply understanding the mechanisms and functions of m5C. Due to the difficulty and expensive costs of identifying m5C sites with wet-lab techniques, developing fast and accurate machine-learning-based prediction methods is urgently needed. In this study, we proposed a new m5C site predictor, called M5C-HPCR, by introducing a novel heuristic nucleotide physicochemical property reduction (HPCR) algorithm and classifier ensemble. HPCR extracts multiple reducts of physical-chemical properties for encoding discriminative features, while the classifier ensemble is applied to integrate multiple base predictors, each of which is trained based on a separate reduct of the physical-chemical properties obtained from HPCR. Rigorous jackknife tests on two benchmark datasets demonstrate that M5C-HPCR outperforms state-of-the-art m5C site predictors, with the highest values of MCC (0.859) and AUC (0.962). We also implemented the webserver of M5C-HPCR, which is freely available at http://cslab.just.edu.cn:8080/M5C-HPCR/.
Copyright © 2018 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Classifier ensemble; Heuristic properties reduction; Pseudo dinucleotide composition; RNA 5-methylcytosine

Mesh:

Substances:

Year:  2018        PMID: 29649472     DOI: 10.1016/j.ab.2018.03.027

Source DB:  PubMed          Journal:  Anal Biochem        ISSN: 0003-2697            Impact factor:   3.365


  11 in total

1.  Deepm5C: A deep-learning-based hybrid framework for identifying human RNA N5-methylcytosine sites using a stacking strategy.

Authors:  Md Mehedi Hasan; Sho Tsukiyama; Jae Youl Cho; Hiroyuki Kurata; Md Ashad Alam; Xiaowen Liu; Balachandran Manavalan; Hong-Wen Deng
Journal:  Mol Ther       Date:  2022-05-06       Impact factor: 12.910

2.  Prediction of disease-associated nsSNPs by integrating multi-scale ResNet models with deep feature fusion.

Authors:  Fang Ge; Ying Zhang; Jian Xu; Arif Muhammad; Jiangning Song; Dong-Jun Yu
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

3.  DNN-m6A: A Cross-Species Method for Identifying RNA N6-Methyladenosine Sites Based on Deep Neural Network with Multi-Information Fusion.

Authors:  Lu Zhang; Xinyi Qin; Min Liu; Ziwei Xu; Guangzhong Liu
Journal:  Genes (Basel)       Date:  2021-02-28       Impact factor: 4.096

4.  Prediction of m5C Modifications in RNA Sequences by Combining Multiple Sequence Features.

Authors:  Lijun Dou; Xiaoling Li; Hui Ding; Lei Xu; Huaikun Xiang
Journal:  Mol Ther Nucleic Acids       Date:  2020-06-10       Impact factor: 8.886

5.  RNAm5Cfinder: A Web-server for Predicting RNA 5-methylcytosine (m5C) Sites Based on Random Forest.

Authors:  Jianwei Li; Yan Huang; Xiaoyue Yang; Yiran Zhou; Yuan Zhou
Journal:  Sci Rep       Date:  2018-11-23       Impact factor: 4.379

6.  RNAm5CPred: Prediction of RNA 5-Methylcytosine Sites Based on Three Different Kinds of Nucleotide Composition.

Authors:  Ting Fang; Zizheng Zhang; Rui Sun; Lin Zhu; Jingjing He; Bei Huang; Yi Xiong; Xiaolei Zhu
Journal:  Mol Ther Nucleic Acids       Date:  2019-10-18       Impact factor: 8.886

7.  Accurate identification of RNA D modification using multiple features.

Authors:  Lijun Dou; Wenyang Zhou; Lichao Zhang; Lei Xu; Ke Han
Journal:  RNA Biol       Date:  2021-03-17       Impact factor: 4.652

Review 8.  Epigenetics: Roles and therapeutic implications of non-coding RNA modifications in human cancers.

Authors:  Dawei Rong; Guangshun Sun; Fan Wu; Ye Cheng; Guoqiang Sun; Wei Jiang; Xiao Li; Yi Zhong; Liangliang Wu; Chuanyong Zhang; Weiwei Tang; Xuehao Wang
Journal:  Mol Ther Nucleic Acids       Date:  2021-05-01       Impact factor: 8.886

9.  SDM6A: A Web-Based Integrative Machine-Learning Framework for Predicting 6mA Sites in the Rice Genome.

Authors:  Shaherin Basith; Balachandran Manavalan; Tae Hwan Shin; Gwang Lee
Journal:  Mol Ther Nucleic Acids       Date:  2019-08-16       Impact factor: 8.886

10.  BiLSTM-5mC: A Bidirectional Long Short-Term Memory-Based Approach for Predicting 5-Methylcytosine Sites in Genome-Wide DNA Promoters.

Authors:  Xin Cheng; Jun Wang; Qianyue Li; Taigang Liu
Journal:  Molecules       Date:  2021-12-07       Impact factor: 4.411

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.