Literature DB >> 35509686

FAME: Fragment-based Conditional Molecular Generation for Phenotypic Drug Discovery.

Thai-Hoang Pham1, Lei Xie2, Ping Zhang3.   

Abstract

De novo molecular design is a key challenge in drug discovery due to the complexity of chemical space. With the availability of molecular datasets and advances in machine learning, many deep generative models are proposed for generating novel molecules with desired properties. However, most of the existing models focus only on molecular distribution learning and target-based molecular design, thereby hindering their potentials in real-world applications. In drug discovery, phenotypic molecular design has advantages over target-based molecular design, especially in first-in-class drug discovery. In this work, we propose the first deep graph generative model (FAME) targeting phenotypic molecular design, in particular gene expression-based molecular design. FAME leverages a conditional variational autoencoder framework to learn the conditional distribution generating molecules from gene expression profiles. However, this distribution is difficult to learn due to the complexity of the molecular space and the noisy phenomenon in gene expression data. To tackle these issues, a gene expression denoising (GED) model that employs contrastive objective function is first proposed to reduce noise from gene expression data. FAME is then designed to treat molecules as the sequences of fragments and learn to generate these fragments in autoregressive manner. By leveraging this fragment-based generation strategy and the denoised gene expression profiles, FAME can generate novel molecules with a high validity rate and desired biological activity. The experimental results show that FAME outperforms existing methods including both SMILES-based and graph-based deep generative models for phenotypic molecular design. Furthermore, the effective mechanism for reducing noise in gene expression data proposed in our study can be applied to omics data modeling in general for facilitating phenotypic drug discovery.

Entities:  

Keywords:  conditional generation; contrastive learning; fragment; gene expression; variational autoencoder

Year:  2022        PMID: 35509686      PMCID: PMC9061137          DOI: 10.1137/1.9781611977172.81

Source DB:  PubMed          Journal:  Proc SIAM Int Conf Data Min


  15 in total

Review 1.  Opportunities and challenges in phenotypic drug discovery: an industry perspective.

Authors:  John G Moffat; Fabien Vincent; Jonathan A Lee; Jörg Eder; Marco Prunotto
Journal:  Nat Rev Drug Discov       Date:  2017-07-07       Impact factor: 84.694

2.  Fréchet ChemNet Distance: A Metric for Generative Models for Molecules in Drug Discovery.

Authors:  Kristina Preuer; Philipp Renz; Thomas Unterthiner; Sepp Hochreiter; Günter Klambauer
Journal:  J Chem Inf Model       Date:  2018-08-28       Impact factor: 4.956

3.  Phenotypic vs. target-based drug discovery for first-in-class medicines.

Authors:  D C Swinney
Journal:  Clin Pharmacol Ther       Date:  2013-04       Impact factor: 6.875

4.  A Next Generation Connectivity Map: L1000 Platform and the First 1,000,000 Profiles.

Authors:  Aravind Subramanian; Rajiv Narayan; Steven M Corsello; David D Peck; Ted E Natoli; Xiaodong Lu; Joshua Gould; John F Davis; Andrew A Tubelli; Jacob K Asiedu; David L Lahr; Jodi E Hirschman; Zihan Liu; Melanie Donahue; Bina Julian; Mariya Khan; David Wadden; Ian C Smith; Daniel Lam; Arthur Liberzon; Courtney Toder; Mukta Bagul; Marek Orzechowski; Oana M Enache; Federica Piccioni; Sarah A Johnson; Nicholas J Lyons; Alice H Berger; Alykhan F Shamji; Angela N Brooks; Anita Vrcic; Corey Flynn; Jacqueline Rosains; David Y Takeda; Roger Hu; Desiree Davison; Justin Lamb; Kristin Ardlie; Larson Hogstrom; Peyton Greenside; Nathanael S Gray; Paul A Clemons; Serena Silver; Xiaoyun Wu; Wen-Ning Zhao; Willis Read-Button; Xiaohua Wu; Stephen J Haggarty; Lucienne V Ronco; Jesse S Boehm; Stuart L Schreiber; John G Doench; Joshua A Bittker; David E Root; Bang Wong; Todd R Golub
Journal:  Cell       Date:  2017-11-30       Impact factor: 41.582

5.  Multi-objective de novo drug design with conditional graph generative model.

Authors:  Yibo Li; Liangren Zhang; Zhenming Liu
Journal:  J Cheminform       Date:  2018-07-24       Impact factor: 5.514

6.  A dataset of images and morphological profiles of 30 000 small-molecule treatments using the Cell Painting assay.

Authors:  Mark-Anthony Bray; Sigrun M Gustafsdottir; Mohammad H Rohban; Shantanu Singh; Vebjorn Ljosa; Katherine L Sokolnicki; Joshua A Bittker; Nicole E Bodycombe; Vlado Dancík; Thomas P Hasaka; Cindy S Hon; Melissa M Kemp; Kejie Li; Deepika Walpita; Mathias J Wawer; Todd R Golub; Stuart L Schreiber; Paul A Clemons; Alykhan F Shamji; Anne E Carpenter
Journal:  Gigascience       Date:  2017-12-01       Impact factor: 6.524

7.  ExCAPE-DB: an integrated large scale dataset facilitating Big Data analysis in chemogenomics.

Authors:  Jiangming Sun; Nina Jeliazkova; Vladimir Chupakin; Jose-Felipe Golib-Dzib; Ola Engkvist; Lars Carlsson; Jörg Wegner; Hugo Ceulemans; Ivan Georgiev; Vedrin Jeliazkov; Nikolay Kochev; Thomas J Ashby; Hongming Chen
Journal:  J Cheminform       Date:  2017-03-07       Impact factor: 5.514

8.  Deep reinforcement learning for de novo drug design.

Authors:  Mariya Popova; Olexandr Isayev; Alexander Tropsha
Journal:  Sci Adv       Date:  2018-07-25       Impact factor: 14.136

9.  Molecular Generation for Desired Transcriptome Changes With Adversarial Autoencoders.

Authors:  Rim Shayakhmetov; Maksim Kuznetsov; Alexander Zhebrak; Artur Kadurin; Sergey Nikolenko; Alexander Aliper; Daniil Polykovskiy
Journal:  Front Pharmacol       Date:  2020-04-17       Impact factor: 5.810

10.  De novo generation of hit-like molecules from gene expression signatures using artificial intelligence.

Authors:  Oscar Méndez-Lucio; Benoit Baillif; Djork-Arné Clevert; David Rouquié; Joerg Wichard
Journal:  Nat Commun       Date:  2020-01-03       Impact factor: 14.919

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.