| Literature DB >> 33390943 |
Daniil Polykovskiy1, Alexander Zhebrak1, Benjamin Sanchez-Lengeling2, Sergey Golovanov3, Oktai Tatanov3, Stanislav Belyaev3, Rauf Kurbanov3, Aleksey Artamonov3, Vladimir Aladinskiy1, Mark Veselov1, Artur Kadurin1, Simon Johansson4, Hongming Chen4, Sergey Nikolenko1,3,5, Alán Aspuru-Guzik6,7,8,9, Alex Zhavoronkov1.
Abstract
Generative models are becoming a tool of choice for exploring the molecular space. These models learn on a large training dataset and produce novel molecular structures with similar properties. Generated structures can be utilized for virtual screening or training semi-supervized predictive models in the downstream tasks. While there are plenty of generative models, it is unclear how to compare and rank them. In this work, we introduce a benchmarking platform called Molecular Sets (MOSES) to standardize training and comparison of molecular generative models. MOSES provides training and testing datasets, and a set of metrics to evaluate the quality and diversity of generated structures. We have implemented and compared several molecular generation models and suggest to use our results as reference points for further advancements in generative chemistry research. The platform and source code are available at https://github.com/molecularsets/moses.Entities:
Keywords: benchmark; deep learning; distribution learning; drug discovery; generative models
Year: 2020 PMID: 33390943 PMCID: PMC7775580 DOI: 10.3389/fphar.2020.565644
Source DB: PubMed Journal: Front Pharmacol ISSN: 1663-9812 Impact factor: 5.810