Literature DB >> 35571558

MoCL: Data-driven Molecular Fingerprint via Knowledge-aware Contrastive Learning from Molecular Graph.

Mengying Sun1, Jing Xing2, Huijun Wang3, Bin Chen2, Jiayu Zhou1.   

Abstract

Recent years have seen a rapid growth of utilizing graph neural networks (GNNs) in the biomedical domain for tackling drug-related problems. However, like any other deep architectures, GNNs are data hungry. While requiring labels in real world is often expensive, pretraining GNNs in an unsupervised manner has been actively explored. Among them, graph contrastive learning, by maximizing the mutual information between paired graph augmentations, has been shown to be effective on various downstream tasks. However, the current graph contrastive learning framework has two limitations. First, the augmentations are designed for general graphs and thus may not be suitable or powerful enough for certain domains. Second, the contrastive scheme only learns representations that are invariant to local perturbations and thus does not consider the global structure of the dataset, which may also be useful for downstream tasks. In this paper, we study graph contrastive learning designed specifically for the biomedical domain, where molecular graphs are present. We propose a novel framework called MoCL, which utilizes domain knowledge at both local- and global-level to assist representation learning. The local-level domain knowledge guides the augmentation process such that variation is introduced without changing graph semantics. The global-level knowledge encodes the similarity information between graphs in the entire dataset and helps to learn representations with richer semantics. The entire model is learned through a double contrast objective. We evaluate MoCL on various molecular datasets under both linear and semi-supervised settings and results show that MoCL achieves state-of-the-art performance.

Entities:  

Keywords:  Contrastive Learning; Domain knowledge; Molecular Graph

Year:  2021        PMID: 35571558      PMCID: PMC9105980          DOI: 10.1145/3447548.3467186

Source DB:  PubMed          Journal:  KDD        ISSN: 2154-817X


  13 in total

1.  Extended-connectivity fingerprints.

Authors:  David Rogers; Mathew Hahn
Journal:  J Chem Inf Model       Date:  2010-05-24       Impact factor: 4.956

2.  Discriminative Unsupervised Feature Learning with Exemplar Convolutional Neural Networks.

Authors:  Alexey Dosovitskiy; Philipp Fischer; Jost Tobias Springenberg; Martin Riedmiller; Thomas Brox
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2015-10-29       Impact factor: 6.226

3.  ToxCast Chemical Landscape: Paving the Road to 21st Century Toxicology.

Authors:  Ann M Richard; Richard S Judson; Keith A Houck; Christopher M Grulke; Patra Volarath; Inthirany Thillainadarajah; Chihae Yang; James Rathman; Matthew T Martin; John F Wambaugh; Thomas B Knudsen; Jayaram Kancherla; Kamel Mansouri; Grace Patlewicz; Antony J Williams; Stephen B Little; Kevin M Crofton; Russell S Thomas
Journal:  Chem Res Toxicol       Date:  2016-07-20       Impact factor: 3.739

4.  A mathematical model for the determination of total area under glucose tolerance and other metabolic curves.

Authors:  M M Tai
Journal:  Diabetes Care       Date:  1994-02       Impact factor: 19.112

5.  Computational Modeling of β-Secretase 1 (BACE-1) Inhibitors Using Ligand Based Approaches.

Authors:  Govindan Subramanian; Bharath Ramsundar; Vijay Pande; Rajiah Aldrin Denny
Journal:  J Chem Inf Model       Date:  2016-10-10       Impact factor: 4.956

6.  Why is Tanimoto index an appropriate choice for fingerprint-based similarity calculations?

Authors:  Dávid Bajusz; Anita Rácz; Károly Héberger
Journal:  J Cheminform       Date:  2015-05-20       Impact factor: 5.514

7.  The SIDER database of drugs and side effects.

Authors:  Michael Kuhn; Ivica Letunic; Lars Juhl Jensen; Peer Bork
Journal:  Nucleic Acids Res       Date:  2015-10-19       Impact factor: 16.971

8.  Prediction of pharmacological activities from chemical structures with graph convolutional neural networks.

Authors:  Miyuki Sakai; Kazuki Nagayasu; Norihiro Shibui; Chihiro Andoh; Kaito Takayama; Hisashi Shirakawa; Shuji Kaneko
Journal:  Sci Rep       Date:  2021-01-12       Impact factor: 4.379

9.  SWEETLEAD: an in silico database of approved drugs, regulated chemicals, and herbal isolates for computer-aided drug discovery.

Authors:  Paul A Novick; Oscar F Ortiz; Jared Poelman; Amir Y Abdulhay; Vijay S Pande
Journal:  PLoS One       Date:  2013-11-01       Impact factor: 3.240

10.  A Deep Learning Approach to Antibiotic Discovery.

Authors:  Jonathan M Stokes; Kevin Yang; Kyle Swanson; Wengong Jin; Andres Cubillos-Ruiz; Nina M Donghia; Craig R MacNair; Shawn French; Lindsey A Carfrae; Zohar Bloom-Ackermann; Victoria M Tran; Anush Chiappino-Pepe; Ahmed H Badran; Ian W Andrews; Emma J Chory; George M Church; Eric D Brown; Tommi S Jaakkola; Regina Barzilay; James J Collins
Journal:  Cell       Date:  2020-02-20       Impact factor: 41.582

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.