Literature DB >> 32140819

i6mA-Fuse: improved and robust prediction of DNA 6 mA sites in the Rosaceae genome by fusing multiple feature representation.

Md Mehedi Hasan1,2, Balachandran Manavalan3, Watshara Shoombuatong4, Mst Shamima Khatun1, Hiroyuki Kurata5,6.   

Abstract

DNA N6-methyladenine (6 mA) is one of the most vital epigenetic modifications and involved in controlling the various gene expression levels. With the avalanche of DNA sequences generated in numerous databases, the accurate identification of 6 mA plays an essential role for understanding molecular mechanisms. Because the experimental approaches are time-consuming and costly, it is desirable to develop a computation model for rapidly and accurately identifying 6 mA. To the best of our knowledge, we first proposed a computational model named i6mA-Fuse to predict 6 mA sites from the Rosaceae genomes, especially in Rosa chinensis and Fragaria vesca. We implemented the five encoding schemes, i.e., mononucleotide binary, dinucleotide binary, k-space spectral nucleotide, k-mer, and electron-ion interaction pseudo potential compositions, to build the five, single-encoding random forest (RF) models. The i6mA-Fuse uses a linear regression model to combine the predicted probability scores of the five, single encoding-based RF models. The resultant species-specific i6mA-Fuse achieved remarkably high performances with AUCs of 0.982 and 0.978 and with MCCs of 0.869 and 0.858 on the independent datasets of Rosa chinensis and Fragaria vesca, respectively. In the F. vesca-specific i6mA-Fuse, the MBE and EIIP contributed to 75% and 25% of the total prediction; in the R. chinensis-specific i6mA-Fuse, Kmer, MBE, and EIIP contribute to 15%, 65%, and 20% of the total prediction. To assist high-throughput prediction for DNA 6 mA identification, the i6mA-Fuse is publicly accessible at https://kurata14.bio.kyutech.ac.jp/i6mA-Fuse/.

Entities:  

Keywords:  DNA 6 mA; Feature encoding; Machine learning; Sequence analysis

Mesh:

Substances:

Year:  2020        PMID: 32140819     DOI: 10.1007/s11103-020-00988-y

Source DB:  PubMed          Journal:  Plant Mol Biol        ISSN: 0167-4412            Impact factor:   4.076


  55 in total

1.  N6-methyladenine DNA modification in Drosophila.

Authors:  Guoqiang Zhang; Hua Huang; Di Liu; Ying Cheng; Xiaoling Liu; Wenxin Zhang; Ruichuan Yin; Dapeng Zhang; Peng Zhang; Jianzhao Liu; Chaoyi Li; Baodong Liu; Yuewan Luo; Yuanxiang Zhu; Ning Zhang; Shunmin He; Chuan He; Hailin Wang; Dahua Chen
Journal:  Cell       Date:  2015-04-30       Impact factor: 41.582

2.  Empirical comparison and analysis of web-based cell-penetrating peptide prediction tools.

Authors:  Ran Su; Jie Hu; Quan Zou; Balachandran Manavalan; Leyi Wei
Journal:  Brief Bioinform       Date:  2020-03-23       Impact factor: 11.622

Review 3.  N6-Methyladenine: A Conserved and Dynamic DNA Mark.

Authors:  Zach Klapholz O'Brown; Eric Lieberman Greer
Journal:  Adv Exp Med Biol       Date:  2016       Impact factor: 2.622

4.  i4mC-ROSE, a bioinformatics tool for the identification of DNA N4-methylcytosine sites in the Rosaceae genome.

Authors:  Md Mehedi Hasan; Balachandran Manavalan; Mst Shamima Khatun; Hiroyuki Kurata
Journal:  Int J Biol Macromol       Date:  2019-12-02       Impact factor: 6.953

5.  DeepCleave: a deep learning predictor for caspase and matrix metalloprotease substrates and cleavage sites.

Authors:  Fuyi Li; Jinxiang Chen; André Leier; Tatiana Marquez-Lago; Quanzhong Liu; Yanze Wang; Jerico Revote; A Ian Smith; Tatsuya Akutsu; Geoffrey I Webb; Lukasz Kurgan; Jiangning Song
Journal:  Bioinformatics       Date:  2020-02-15       Impact factor: 6.937

6.  N 6-Hydroxymethyladenine: a hydroxylation derivative of N6-methyladenine in genomic DNA of mammals.

Authors:  Jun Xiong; Tian-Tian Ye; Cheng-Jie Ma; Qing-Yun Cheng; Bi-Feng Yuan; Yu-Qi Feng
Journal:  Nucleic Acids Res       Date:  2019-02-20       Impact factor: 16.971

7.  PreAIP: Computational Prediction of Anti-inflammatory Peptides by Integrating Multiple Complementary Features.

Authors:  Mst Shamima Khatun; Md Mehedi Hasan; Hiroyuki Kurata
Journal:  Front Genet       Date:  2019-03-05       Impact factor: 4.599

8.  Meta-4mCpred: A Sequence-Based Meta-Predictor for Accurate DNA 4mC Site Prediction Using Effective Feature Representation.

Authors:  Balachandran Manavalan; Shaherin Basith; Tae Hwan Shin; Leyi Wei; Gwang Lee
Journal:  Mol Ther Nucleic Acids       Date:  2019-04-30

9.  iDNA6mA-Rice: A Computational Tool for Detecting N6-Methyladenine Sites in Rice.

Authors:  Hao Lv; Fu-Ying Dao; Zheng-Xing Guan; Dan Zhang; Jiu-Xin Tan; Yong Zhang; Wei Chen; Hao Lin
Journal:  Front Genet       Date:  2019-09-10       Impact factor: 4.599

10.  AIPpred: Sequence-Based Prediction of Anti-inflammatory Peptides Using Random Forest.

Authors:  Balachandran Manavalan; Tae H Shin; Myeong O Kim; Gwang Lee
Journal:  Front Pharmacol       Date:  2018-03-27       Impact factor: 5.810

View more
  13 in total

1.  Computational prediction of species-specific yeast DNA replication origin via iterative feature representation.

Authors:  Balachandran Manavalan; Shaherin Basith; Tae Hwan Shin; Gwang Lee
Journal:  Brief Bioinform       Date:  2021-07-20       Impact factor: 11.622

2.  BERT6mA: prediction of DNA N6-methyladenine site using deep learning-based approaches.

Authors:  Sho Tsukiyama; Md Mehedi Hasan; Hong-Wen Deng; Hiroyuki Kurata
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

3.  Deepm5C: A deep-learning-based hybrid framework for identifying human RNA N5-methylcytosine sites using a stacking strategy.

Authors:  Md Mehedi Hasan; Sho Tsukiyama; Jae Youl Cho; Hiroyuki Kurata; Md Ashad Alam; Xiaowen Liu; Balachandran Manavalan; Hong-Wen Deng
Journal:  Mol Ther       Date:  2022-05-06       Impact factor: 12.910

4.  i4mC-Mouse: Improved identification of DNA N4-methylcytosine sites in the mouse genome using multiple encoding schemes.

Authors:  Md Mehedi Hasan; Balachandran Manavalan; Watshara Shoombuatong; Mst Shamima Khatun; Hiroyuki Kurata
Journal:  Comput Struct Biotechnol J       Date:  2020-04-08       Impact factor: 7.271

5.  Deep6mA: A deep learning framework for exploring similar patterns in DNA N6-methyladenine sites across different species.

Authors:  Zutan Li; Hangjin Jiang; Lingpeng Kong; Yuanyuan Chen; Kun Lang; Xiaodan Fan; Liangyun Zhang; Cong Pian
Journal:  PLoS Comput Biol       Date:  2021-02-18       Impact factor: 4.475

6.  Feature selection for RNA cleavage efficiency at specific sites using the LASSO regression model in Arabidopsis thaliana.

Authors:  Daishin Ueno; Harunori Kawabe; Shotaro Yamasaki; Taku Demura; Ko Kato
Journal:  BMC Bioinformatics       Date:  2021-07-22       Impact factor: 3.169

7.  Identifying Antioxidant Proteins by Using Amino Acid Composition and Protein-Protein Interactions.

Authors:  Yixiao Zhai; Yu Chen; Zhixia Teng; Yuming Zhao
Journal:  Front Cell Dev Biol       Date:  2020-10-29

8.  im6A-TS-CNN: Identifying the N6-Methyladenine Site in Multiple Tissues by Using the Convolutional Neural Network.

Authors:  Kewei Liu; Lei Cao; Pufeng Du; Wei Chen
Journal:  Mol Ther Nucleic Acids       Date:  2020-07-31       Impact factor: 8.886

Review 9.  Evolution of Sequence-based Bioinformatics Tools for Protein-protein Interaction Prediction.

Authors:  Mst Shamima Khatun; Watshara Shoombuatong; Md Mehedi Hasan; Hiroyuki Kurata
Journal:  Curr Genomics       Date:  2020-09       Impact factor: 2.236

10.  PredNTS: Improved and Robust Prediction of Nitrotyrosine Sites by Integrating Multiple Sequence Features.

Authors:  Andi Nur Nilamyani; Firda Nurul Auliah; Mohammad Ali Moni; Watshara Shoombuatong; Md Mehedi Hasan; Hiroyuki Kurata
Journal:  Int J Mol Sci       Date:  2021-03-08       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.