| Literature DB >> 28029644 |
Artur Kadurin1,2,3,4, Alexander Aliper2, Andrey Kazennov2,5, Polina Mamoshina2,6, Quentin Vanhaelen2, Kuzma Khrabrov1, Alex Zhavoronkov2,7,5.
Abstract
Recent advances in deep learning and specifically in generative adversarial networks have demonstrated surprising results in generating new images and videos upon request even using natural language as input. In this paper we present the first application of generative adversarial autoencoders (AAE) for generating novel molecular fingerprints with a defined set of parameters. We developed a 7-layer AAE architecture with the latent middle layer serving as a discriminator. As an input and output the AAE uses a vector of binary fingerprints and concentration of the molecule. In the latent layer we also introduced a neuron responsible for growth inhibition percentage, which when negative indicates the reduction in the number of tumor cells after the treatment. To train the AAE we used the NCI-60 cell line assay data for 6252 compounds profiled on MCF-7 cell line. The output of the AAE was used to screen 72 million compounds in PubChem and select candidate molecules with potential anti-cancer properties. This approach is a proof of concept of an artificially-intelligent drug discovery engine, where AAEs are used to generate new molecular fingerprints with the desired molecular properties.Entities:
Keywords: adversarial autoencoder; artificial intelligence; deep learning; drug discovery; generative adversarian networks
Mesh:
Substances:
Year: 2017 PMID: 28029644 PMCID: PMC5355231 DOI: 10.18632/oncotarget.14073
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Figure 1Architecture of Adversarial Autoencoder (AAE) used in this study
Encoder consists of two consequent layers L1 and L2 with 128 and 64 neurons, respectively. In turn, decoder consists of layers L'1 and L'2 comprising 64 and 128 neurons. Latent layer consists of 5 neurons one of which is Growth Inhibition percentage (GI) and the other 4 are discriminated with normal distribution.
Figure 2Mapping generated molecules to chemical space of Pubchem
Pubchem compounds are depicted in green, training set is shown in blue and mapped predictions in red.