Literature DB >> 31729524

A novel molecular representation with BiGRU neural networks for learning atom.

Xuan Lin1, Zhe Quan1, Zhi-Jie Wang1, Huang Huang2, Xiangxiang Zeng1,3.   

Abstract

Molecular representations play critical roles in researching drug design and properties, and effective methods are beneficial to assisting in the calculation of molecules and solving related problem in drug discovery. In previous years, most of the traditional molecular representations are based on hand-crafted features and rely heavily on biological experimentations, which are often costly and time consuming. However, recent researches achieve promising results using machine learning on various domains. In this article, we present a novel method named Smi2Vec-BiGRU that is designed for learning atoms and solving the single- and multitask binary classification problems in the field of drug discovery, which are the basic and also key problems in this field. Specifically, our approach transforms the molecule data in the SMILES format into a set of sample vectors and then feeds them into the bidirectional gated recurrent unit neural networks for training, which learns low-dimensional vector representations for molecular drug. We conduct extensive experiments on several widely used benchmarks including Tox21, SIDER and ClinTox. The experimental results show that our approach can achieve state-of-the-art performance on these benchmarking datasets, demonstrating the feasibility and competitiveness of our proposed approach.
© The Authors 2019. Published by Oxford University Press on behalf of the Institute of Mathematics and its Applications. All rights reserved.

Entities:  

Keywords:  drug discovery; machine learning; molecular representation; recurrent neural networks

Year:  2020        PMID: 31729524     DOI: 10.1093/bib/bbz125

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  8 in total

1.  HSM6AP: a high-precision predictor for the Homo sapiens N6-methyladenosine (m^6 A) based on multiple weights and feature stitching.

Authors:  Jing Li; Shida He; Fei Guo; Quan Zou
Journal:  RNA Biol       Date:  2021-02-12       Impact factor: 4.652

2.  A Method for Prediction of Thermophilic Protein Based on Reduced Amino Acids and Mixed Features.

Authors:  Changli Feng; Zhaogui Ma; Deyun Yang; Xin Li; Jun Zhang; Yanjuan Li
Journal:  Front Bioeng Biotechnol       Date:  2020-05-05

3.  A Novel Molecular Representation Learning for Molecular Property Prediction with a Multiple SMILES-Based Augmentation.

Authors:  Chunyan Li; Jihua Feng; Shihu Liu; Junfeng Yao
Journal:  Comput Intell Neurosci       Date:  2022-01-28

4.  Identifying and Classifying Enhancers by Dinucleotide-Based Auto-Cross Covariance and Attention-Based Bi-LSTM.

Authors:  Shulin Zhao; Qingfeng Pan; Quan Zou; Ying Ju; Lei Shi; Xi Su
Journal:  Comput Math Methods Med       Date:  2022-04-05       Impact factor: 2.238

5.  Rational Discovery of Antimicrobial Peptides by Means of Artificial Intelligence.

Authors:  Paola Ruiz Puentes; Maria C Henao; Javier Cifuentes; Carolina Muñoz-Camargo; Luis H Reyes; Juan C Cruz; Pablo Arbeláez
Journal:  Membranes (Basel)       Date:  2022-07-14

6.  Accurate identification of RNA D modification using multiple features.

Authors:  Lijun Dou; Wenyang Zhou; Lichao Zhang; Lei Xu; Ke Han
Journal:  RNA Biol       Date:  2021-03-17       Impact factor: 4.652

7.  4mCPred-MTL: Accurate Identification of DNA 4mC Sites in Multiple Species Using Multi-Task Deep Learning Based on Multi-Head Attention Mechanism.

Authors:  Rao Zeng; Song Cheng; Minghong Liao
Journal:  Front Cell Dev Biol       Date:  2021-05-10

8.  sgRNA-PSM: Predict sgRNAs On-Target Activity Based on Position-Specific Mismatch.

Authors:  Bin Liu; Zhihua Luo; Juan He
Journal:  Mol Ther Nucleic Acids       Date:  2020-01-31       Impact factor: 8.886

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.