Literature DB >> 31070725

FP2VEC: a new molecular featurizer for learning molecular properties.

Woosung Jeon1, Dongsup Kim1.   

Abstract

MOTIVATION: One of the most successful methods for predicting the properties of chemical compounds is the quantitative structure-activity relationship (QSAR) methods. The prediction accuracy of QSAR models has recently been greatly improved by employing deep learning technology. Especially, newly developed molecular featurizers based on graph convolution operations on molecular graphs significantly outperform the conventional extended connectivity fingerprints (ECFP) feature in both classification and regression tasks, indicating that it is critical to develop more effective new featurizers to fully realize the power of deep learning techniques. Motivated by the fact that there is a clear analogy between chemical compounds and natural languages, this work develops a new molecular featurizer, FP2VEC, which represents a chemical compound as a set of trainable embedding vectors.
RESULTS: To implement and test our new featurizer, we build a QSAR model using a simple convolutional neural network (CNN) architecture that has been successfully used for natural language processing tasks such as sentence classification task. By testing our new method on several benchmark datasets, we demonstrate that the combination of FP2VEC and CNN model can achieve competitive results in many QSAR tasks, especially in classification tasks. We also demonstrate that the FP2VEC model is especially effective for multitask learning.
AVAILABILITY AND IMPLEMENTATION: FP2VEC is available from https://github.com/wsjeon92/FP2VEC. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Year:  2019        PMID: 31070725     DOI: 10.1093/bioinformatics/btz307

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  6 in total

Review 1.  Machine Learning in Antibacterial Drug Design.

Authors:  Marko Jukič; Urban Bren
Journal:  Front Pharmacol       Date:  2022-05-03       Impact factor: 5.988

2.  Quantitative Toxicity Prediction via Meta Ensembling of Multitask Deep Learning Models.

Authors:  Abdul Karim; Vahid Riahi; Avinash Mishra; M A Hakim Newton; Abdollah Dehzangi; Thomas Balle; Abdul Sattar
Journal:  ACS Omega       Date:  2021-05-03

3.  Membrane contact probability: An essential and predictive character for the structural and functional studies of membrane proteins.

Authors:  Lei Wang; Jiangguo Zhang; Dali Wang; Chen Song
Journal:  PLoS Comput Biol       Date:  2022-03-30       Impact factor: 4.475

4.  Convolutional neural networks (CNNs): concepts and applications in pharmacogenomics.

Authors:  Joel Markus Vaz; S Balaji
Journal:  Mol Divers       Date:  2021-05-24       Impact factor: 3.364

5.  Screening of antibacterial compounds with novel structure from the FDA approved drugs using machine learning methods.

Authors:  Wen-Xing Li; Xin Tong; Peng-Peng Yang; Yang Zheng; Ji-Hao Liang; Gong-Hua Li; Dahai Liu; Dao-Gang Guan; Shao-Xing Dai
Journal:  Aging (Albany NY)       Date:  2022-02-12       Impact factor: 5.682

Review 6.  On modeling and utilizing chemical compound information with deep learning technologies: A task-oriented approach.

Authors:  Sangsoo Lim; Sangseon Lee; Yinhua Piao; MinGyu Choi; Dongmin Bang; Jeonghyeon Gu; Sun Kim
Journal:  Comput Struct Biotechnol J       Date:  2022-08-05       Impact factor: 6.155

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.