| Literature DB >> 33618772 |
Nicolas Bosc1, Eloy Felix2, Ricardo Arcila2, David Mendez2, Martin R Saunders3, Darren V S Green3, Jason Ochoada4, Anang A Shelat4, Eric J Martin5, Preeti Iyer6, Ola Engkvist6, Andreas Verras7, James Duffy8, Jeremy Burrows8, J Mark F Gardner9, Andrew R Leach10.
Abstract
Malaria is a disease affecting hundreds of millions of people across the world, mainly in developing countries and especially in sub-Saharan Africa. It is the cause of hundreds of thousands of deaths each year and there is an ever-present need to identify and develop effective new therapies to tackle the disease and overcome increasing drug resistance. Here, we extend a previous study in which a number of partners collaborated to develop a consensus in silico model that can be used to identify novel molecules that may have antimalarial properties. The performance of machine learning methods generally improves with the number of data points available for training. One practical challenge in building large training sets is that the data are often proprietary and cannot be straightforwardly integrated. Here, this was addressed by sharing QSAR models, each built on a private data set. We describe the development of an open-source software platform for creating such models, a comprehensive evaluation of methods to create a single consensus model and a web platform called MAIP available at https://www.ebi.ac.uk/chembl/maip/ . MAIP is freely available for the wider community to make large-scale predictions of potential malaria inhibiting compounds. This project also highlights some of the practical challenges in reproducing published computational methods and the opportunities that open-source software can offer to the community.Entities:
Keywords: Antimalarial drug discovery; Classification modelling; Data fusion; Machine learning; Malaria; Naïve Bayes; Open‐source software; QSAR
Year: 2021 PMID: 33618772 PMCID: PMC7898753 DOI: 10.1186/s13321-021-00487-2
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514