Literature DB >> 33733106

Descriptor Free QSAR Modeling Using Deep Learning With Long Short-Term Memory Neural Networks.

Suman K Chakravarti1, Sai Radha Mani Alla1.   

Abstract

Current practice of building QSAR models usually involves computing a set of descriptors for the training set compounds, applying a descriptor selection algorithm and finally using a statistical fitting method to build the model. In this study, we explored the prospects of building good quality interpretable QSARs for big and diverse datasets, without using any pre-calculated descriptors. We have used different forms of Long Short-Term Memory (LSTM) neural networks to achieve this, trained directly using either traditional SMILES codes or a new linear molecular notation developed as part of this work. Three endpoints were modeled: Ames mutagenicity, inhibition of P. falciparum Dd2 and inhibition of Hepatitis C Virus, with training sets ranging from 7,866 to 31,919 compounds. To boost the interpretability of the prediction results, attention-based machine learning mechanism, jointly with a bidirectional LSTM was used to detect structural alerts for the mutagenicity data set. Traditional fragment descriptor-based models were used for comparison. As per the results of the external and cross-validation experiments, overall prediction accuracies of the LSTM models were close to the fragment-based models. However, LSTM models were superior in predicting test chemicals that are dissimilar to the training set compounds, a coveted quality of QSAR models in real world applications. In summary, it is possible to build QSAR models using LSTMs without using pre-computed traditional descriptors, and models are far from being "black box." We wish that this study will be helpful in bringing large, descriptor-less QSARs to mainstream use.
Copyright © 2019 Chakravarti and Alla.

Entities:  

Keywords:  LSTM (long short term memory networks); QSAR (quantitative structure-activity relationships); RNN (recurrent neural network); big data; hepatitis (C) virus; machine learning; malaria; mutagenicity

Year:  2019        PMID: 33733106      PMCID: PMC7861338          DOI: 10.3389/frai.2019.00017

Source DB:  PubMed          Journal:  Front Artif Intell        ISSN: 2624-8212


  27 in total

1.  Convolutional Embedding of Attributed Molecular Graphs for Physical Property Prediction.

Authors:  Connor W Coley; Regina Barzilay; William H Green; Tommi S Jaakkola; Klavs F Jensen
Journal:  J Chem Inf Model       Date:  2017-07-25       Impact factor: 4.956

Review 2.  Mutagenic and carcinogenic structural alerts and their mechanisms of action.

Authors:  Alja Plošnik; Marjan Vračko; Marija Sollner Dolenc
Journal:  Arh Hig Rada Toksikol       Date:  2016-09-01       Impact factor: 1.948

3.  Fragment-based QSAR strategies in drug design.

Authors:  Lívia B Salum; Adriano D Andricopulo
Journal:  Expert Opin Drug Discov       Date:  2010-03-31       Impact factor: 6.098

Review 4.  Deep learning for computational chemistry.

Authors:  Garrett B Goh; Nathan O Hodas; Abhinav Vishnu
Journal:  J Comput Chem       Date:  2017-03-08       Impact factor: 3.376

5.  In silico toxicity prediction by support vector machine and SMILES representation-based string kernel.

Authors:  D-S Cao; J-C Zhao; Y-N Yang; C-X Zhao; J Yan; S Liu; Q-N Hu; Q-S Xu; Y-Z Liang
Journal:  SAR QSAR Environ Res       Date:  2012-01-09       Impact factor: 3.000

6.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

7.  CORAL: QSPR model of water solubility based on local and global SMILES attributes.

Authors:  Andrey A Toropov; Alla P Toropova; Emilio Benfenati; Giuseppina Gini; Danuta Leszczynska; Jerzy Leszczynski
Journal:  Chemosphere       Date:  2012-08-23       Impact factor: 7.086

8.  Distributed Representation of Chemical Fragments.

Authors:  Suman K Chakravarti
Journal:  ACS Omega       Date:  2018-03-08

9.  Learning continuous and data-driven molecular descriptors by translating equivalent chemical representations.

Authors:  Robin Winter; Floriane Montanari; Frank Noé; Djork-Arné Clevert
Journal:  Chem Sci       Date:  2018-11-19       Impact factor: 9.825

10.  Improvement of quantitative structure-activity relationship (QSAR) tools for predicting Ames mutagenicity: outcomes of the Ames/QSAR International Challenge Project.

Authors:  Masamitsu Honma; Airi Kitazawa; Alex Cayley; Richard V Williams; Chris Barber; Thierry Hanser; Roustem Saiakhov; Suman Chakravarti; Glenn J Myatt; Kevin P Cross; Emilio Benfenati; Giuseppa Raitano; Ovanes Mekenyan; Petko Petkov; Cecilia Bossa; Romualdo Benigni; Chiara Laura Battistelli; Alessandro Giuliani; Olga Tcheremenskaia; Christine DeMeo; Ulf Norinder; Hiromi Koga; Ciloy Jose; Nina Jeliazkova; Nikolay Kochev; Vesselina Paskaleva; Chihae Yang; Pankaj R Daga; Robert D Clark; James Rathman
Journal:  Mutagenesis       Date:  2019-03-06       Impact factor: 3.000

View more
  6 in total

1.  QSAR Methods.

Authors:  Giuseppina Gini
Journal:  Methods Mol Biol       Date:  2022

2.  QSAR modeling without descriptors using graph convolutional neural networks: the case of mutagenicity prediction.

Authors:  Chiakang Hung; Giuseppina Gini
Journal:  Mol Divers       Date:  2021-06-19       Impact factor: 2.943

3.  Trade-off Predictivity and Explainability for Machine-Learning Powered Predictive Toxicology: An in-Depth Investigation with Tox21 Data Sets.

Authors:  Leihong Wu; Ruili Huang; Igor V Tetko; Zhonghua Xia; Joshua Xu; Weida Tong
Journal:  Chem Res Toxicol       Date:  2021-01-29       Impact factor: 3.739

Review 4.  Artificial Intelligence in Drug Discovery: A Comprehensive Review of Data-driven and Machine Learning Approaches.

Authors:  Hyunho Kim; Eunyoung Kim; Ingoo Lee; Bongsung Bae; Minsu Park; Hojung Nam
Journal:  Biotechnol Bioprocess Eng       Date:  2021-01-07       Impact factor: 3.386

5.  Improving Compound Activity Classification via Deep Transfer and Representation Learning.

Authors:  Vishal Dey; Raghu Machiraju; Xia Ning
Journal:  ACS Omega       Date:  2022-03-11

6.  Retrospective assessment of rat liver microsomal stability at NCATS: data and QSAR models.

Authors:  Vishal B Siramshetty; Pranav Shah; Edward Kerns; Kimloan Nguyen; Kyeong Ri Yu; Md Kabir; Jordan Williams; Jorge Neyra; Noel Southall; Ðắc-Trung Nguyễn; Xin Xu
Journal:  Sci Rep       Date:  2020-11-26       Impact factor: 4.996

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.