Literature DB >> 33940598

An effective self-supervised framework for learning expressive molecular global representations to drug discovery.

Pengyong Li1, Jun Wang2, Yixuan Qiao3, Hao Chen4, Yihuan Yu5, Xiaojun Yao6, Peng Gao2, Guotong Xie2, Sen Song7.   

Abstract

How to produce expressive molecular representations is a fundamental challenge in artificial intelligence-driven drug discovery. Graph neural network (GNN) has emerged as a powerful technique for modeling molecular data. However, previous supervised approaches usually suffer from the scarcity of labeled data and poor generalization capability. Here, we propose a novel molecular pre-training graph-based deep learning framework, named MPG, that learns molecular representations from large-scale unlabeled molecules. In MPG, we proposed a powerful GNN for modelling molecular graph named MolGNet, and designed an effective self-supervised strategy for pre-training the model at both the node and graph-level. After pre-training on 11 million unlabeled molecules, we revealed that MolGNet can capture valuable chemical insights to produce interpretable representation. The pre-trained MolGNet can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of drug discovery tasks, including molecular properties prediction, drug-drug interaction and drug-target interaction, on 14 benchmark datasets. The pre-trained MolGNet in MPG has the potential to become an advanced molecular encoder in the drug discovery pipeline. © The authors 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  deep learning; graph neural network; molecular representation; self-supervised learning

Mesh:

Year:  2021        PMID: 33940598     DOI: 10.1093/bib/bbab109

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  3 in total

1.  Multi-type feature fusion based on graph neural network for drug-drug interaction prediction.

Authors:  Changxiang He; Yuru Liu; Hao Li; Hui Zhang; Yaping Mao; Xiaofei Qin; Lele Liu; Xuedian Zhang
Journal:  BMC Bioinformatics       Date:  2022-06-10       Impact factor: 3.307

2.  Cross-Adversarial Learning for Molecular Generation in Drug Design.

Authors:  Banghua Wu; Linjie Li; Yue Cui; Kai Zheng
Journal:  Front Pharmacol       Date:  2022-01-21       Impact factor: 5.810

Review 3.  On modeling and utilizing chemical compound information with deep learning technologies: A task-oriented approach.

Authors:  Sangsoo Lim; Sangseon Lee; Yinhua Piao; MinGyu Choi; Dongmin Bang; Jeonghyeon Gu; Sun Kim
Journal:  Comput Struct Biotechnol J       Date:  2022-08-05       Impact factor: 6.155

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.