Literature DB >> 33446014

HSM6AP: a high-precision predictor for the Homo sapiens N6-methyladenosine (m^6 A) based on multiple weights and feature stitching.

Jing Li1, Shida He1, Fei Guo1, Quan Zou2.   

Abstract

Recent studies have shown that RNA methylation modification can affect RNA transcription, metabolism, splicing and stability. In addition, RNA methylation modification has been associated with cancer, obesity and other diseases. Based on information about human genome and machine learning, this paper discusses the effect of the fusion sequence and gene-level feature extraction on the accuracy of methylation site recognition. The significant limitation of existing computing tools was exposed by discovered of new features. (1) Most prediction models are based solely on sequence features and use SVM or random forest as classification methods. (2) Limited by the number of samples, the model may not achieve good performance. In order to establish a better prediction model for methylation sites, we must set specific weighting strategies for training samples and find more powerful and informative feature matrices to establish a comprehensive model. In this paper, we present HSM6AP, a high-precision predictor for the Homo sapiens N6-methyladenosine (m6A) based on multiple weights and feature stitching. Compared with existing methods, HSM6AP samples were creatively weighted during training, and a wide range of features were explored. Max-Relevance-Max-Distance (MRMD) is employed for feature selection, and the feature matrix is generated by fusing a single feature. The extreme gradient boosting (XGBoost), an integrated machine learning algorithm based on decision tree, is used for model training and improves model performance through parameter adjustment. Two rigorous independent data sets demonstrated the superiority of HSM6AP in identifying methylation sites. HSM6AP is an advanced predictor that can be directly employed by users (especially non-professional users) to predict methylation sites. Users can access our related tools and data sets at the following website: http://lab.malab.cn/~lijing/HSM6AP.html The codes of our tool can be publicly accessible at https://github.com/lijingtju/HSm6AP.git.

Entities:  

Keywords:  Methylation site; XGBoost; feature stitching; gene-derived features; sequence-derived feature

Mesh:

Substances:

Year:  2021        PMID: 33446014      PMCID: PMC8583144          DOI: 10.1080/15476286.2021.1875180

Source DB:  PubMed          Journal:  RNA Biol        ISSN: 1547-6286            Impact factor:   4.652


  58 in total

1.  What Contributes to Serotonin-Norepinephrine Reuptake Inhibitors' Dual-Targeting Mechanism? The Key Role of Transmembrane Domain 6 in Human Serotonin and Norepinephrine Transporters Revealed by Molecular Dynamics Simulation.

Authors:  Weiwei Xue; Fengyuan Yang; Panpan Wang; Guoxun Zheng; Yuzong Chen; Xiaojun Yao; Feng Zhu
Journal:  ACS Chem Neurosci       Date:  2018-01-24       Impact factor: 4.418

2.  PDC-SGB: Prediction of effective drug combinations using a stochastic gradient boosting algorithm.

Authors:  Qian Xu; Yi Xiong; Hao Dai; Kotni Meena Kumari; Qin Xu; Hong-Yu Ou; Dong-Qing Wei
Journal:  J Theor Biol       Date:  2017-01-16       Impact factor: 2.691

3.  Bastion6: a bioinformatics approach for accurate prediction of type VI secreted effectors.

Authors:  Jiawei Wang; Bingjiao Yang; André Leier; Tatiana T Marquez-Lago; Morihiro Hayashida; Andrea Rocker; Yanju Zhang; Tatsuya Akutsu; Kuo-Chen Chou; Richard A Strugnell; Jiangning Song; Trevor Lithgow
Journal:  Bioinformatics       Date:  2018-08-01       Impact factor: 6.937

4.  SRAMP: prediction of mammalian N6-methyladenosine (m6A) sites based on sequence-derived features.

Authors:  Yuan Zhou; Pan Zeng; Yan-Hui Li; Ziding Zhang; Qinghua Cui
Journal:  Nucleic Acids Res       Date:  2016-02-20       Impact factor: 16.971

5.  Pse-in-One: a web server for generating various modes of pseudo components of DNA, RNA, and protein sequences.

Authors:  Bin Liu; Fule Liu; Xiaolong Wang; Junjie Chen; Longyun Fang; Kuo-Chen Chou
Journal:  Nucleic Acids Res       Date:  2015-05-09       Impact factor: 16.971

6.  Large-scale comparative assessment of computational predictors for lysine post-translational modification sites.

Authors:  Zhen Chen; Xuhan Liu; Fuyi Li; Chen Li; Tatiana Marquez-Lago; André Leier; Tatsuya Akutsu; Geoffrey I Webb; Dakang Xu; Alexander Ian Smith; Lei Li; Kuo-Chen Chou; Jiangning Song
Journal:  Brief Bioinform       Date:  2019-11-27       Impact factor: 11.622

7.  Predicting effective microRNA target sites in mammalian mRNAs.

Authors:  Vikram Agarwal; George W Bell; Jin-Wu Nam; David P Bartel
Journal:  Elife       Date:  2015-08-12       Impact factor: 8.140

8.  iTerm-PseKNC: a sequence-based tool for predicting bacterial transcriptional terminators.

Authors:  Chao-Qin Feng; Zhao-Yue Zhang; Xiao-Juan Zhu; Yan Lin; Wei Chen; Hua Tang; Hao Lin
Journal:  Bioinformatics       Date:  2019-05-01       Impact factor: 6.937

9.  PredT4SE-Stack: Prediction of Bacterial Type IV Secreted Effectors From Protein Sequences Using a Stacked Ensemble Method.

Authors:  Yi Xiong; Qiankun Wang; Junchen Yang; Xiaolei Zhu; Dong-Qing Wei
Journal:  Front Microbiol       Date:  2018-10-26       Impact factor: 5.640

10.  Is There Any Sequence Feature in the RNA Pseudouridine Modification Prediction Problem?

Authors:  Lijun Dou; Xiaoling Li; Hui Ding; Lei Xu; Huaikun Xiang
Journal:  Mol Ther Nucleic Acids       Date:  2019-11-21       Impact factor: 8.886

View more
  3 in total

1.  m5CRegpred: Epitranscriptome Target Prediction of 5-Methylcytosine (m5C) Regulators Based on Sequencing Features.

Authors:  Zhizhou He; Jing Xu; Haoran Shi; Shuxiang Wu
Journal:  Genes (Basel)       Date:  2022-04-12       Impact factor: 4.141

Review 2.  The functional roles, cross-talk and clinical implications of m6A modification and circRNA in hepatocellular carcinoma.

Authors:  Sha Qin; Yitao Mao; Xue Chen; Juxiong Xiao; Yan Qin; Luqing Zhao
Journal:  Int J Biol Sci       Date:  2021-07-22       Impact factor: 6.580

3.  M6A-BiNP: predicting N6-methyladenosine sites based on bidirectional position-specific propensities of polynucleotides and pointwise joint mutual information.

Authors:  Mingzhao Wang; Juanying Xie; Shengquan Xu
Journal:  RNA Biol       Date:  2021-06-23       Impact factor: 4.652

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.