Literature DB >> 34850821

NmRF: identification of multispecies RNA 2'-O-methylation modification sites from RNA sequences.

Chunyan Ao1, Quan Zou2,3, Liang Yu1.   

Abstract

2'-O-methylation (Nm) is a post-transcriptional modification of RNA that is catalyzed by 2'-O-methyltransferase and involves replacing the H on the 2'-hydroxyl group with a methyl group. The 2'-O-methylation modification site is detected in a variety of RNA types (miRNA, tRNA, mRNA, etc.), plays an important role in biological processes and is associated with different diseases. There are few functional mechanisms developed at present, and traditional high-throughput experiments are time-consuming and expensive to explore functional mechanisms. For a deeper understanding of relevant biological mechanisms, it is necessary to develop efficient and accurate recognition tools based on machine learning. Based on this, we constructed a predictor called NmRF based on optimal mixed features and random forest classifier to identify 2'-O-methylation modification sites. The predictor can identify modification sites of multiple species at the same time. To obtain a better prediction model, a two-step strategy is adopted; that is, the optimal hybrid feature set is obtained by combining the light gradient boosting algorithm and incremental feature selection strategy. In 10-fold cross-validation, the accuracies of Homo sapiens and Saccharomyces cerevisiae were 89.069 and 93.885%, and the AUC were 0.9498 and 0.9832, respectively. The rigorous 10-fold cross-validation and independent tests confirm that the proposed method is significantly better than existing tools. A user-friendly web server is accessible at http://lab.malab.cn/∼acy/NmRF.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  2’-O-methylation sites; RNA; feature extraction; light gradient boosting; random forest

Mesh:

Substances:

Year:  2022        PMID: 34850821     DOI: 10.1093/bib/bbab480

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  5 in total

1.  Geographic encoding of transcripts enabled high-accuracy and isoform-aware deep learning of RNA methylation.

Authors:  Daiyun Huang; Kunqi Chen; Bowen Song; Zhen Wei; Jionglong Su; Frans Coenen; João Pedro de Magalhães; Daniel J Rigden; Jia Meng
Journal:  Nucleic Acids Res       Date:  2022-10-14       Impact factor: 19.160

2.  IBPred: A sequence-based predictor for identifying ion binding protein in phage.

Authors:  Shi-Shi Yuan; Dong Gao; Xue-Qin Xie; Cai-Yi Ma; Wei Su; Zhao-Yue Zhang; Yan Zheng; Hui Ding
Journal:  Comput Struct Biotechnol J       Date:  2022-08-28       Impact factor: 6.155

3.  iPiDA-LTR: Identifying piwi-interacting RNA-disease associations based on Learning to Rank.

Authors:  Wenxiang Zhang; Jialu Hou; Bin Liu
Journal:  PLoS Comput Biol       Date:  2022-08-15       Impact factor: 4.779

4.  ISTRF: Identification of sucrose transporter using random forest.

Authors:  Dong Chen; Sai Li; Yu Chen
Journal:  Front Genet       Date:  2022-09-12       Impact factor: 4.772

5.  CRBPDL: Identification of circRNA-RBP interaction sites using an ensemble neural network approach.

Authors:  Mengting Niu; Quan Zou; Chen Lin
Journal:  PLoS Comput Biol       Date:  2022-01-20       Impact factor: 4.475

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.