| Literature DB >> 29467659 |
Rodolfo S Simões1, Vinicius G Maltarollo2, Patricia R Oliveira1, Kathia M Honorio1,3.
Abstract
Medicinal chemistry projects involve some steps aiming to develop a new drug, such as the analysis of biological targets related to a given disease, the discovery and the development of drug candidates for these targets, performing parallel biological tests to validate the drug effectiveness and side effects. Approaches as quantitative study of activity-structure relationships (QSAR) involve the construction of predictive models that relate a set of descriptors of a chemical compound series and its biological activities with respect to one or more targets in the human body. Datasets used to perform QSAR analyses are generally characterized by a small number of samples and this makes them more complex to build accurate predictive models. In this context, transfer and multi-task learning techniques are very suitable since they take information from other QSAR models to the same biological target, reducing efforts and costs for generating new chemical compounds. Therefore, this review will present the main features of transfer and multi-task learning studies, as well as some applications and its potentiality in drug design projects.Entities:
Keywords: QSAR; drug design; machine learning; medicinal chemistry; multi-task learning; transfer learning
Year: 2018 PMID: 29467659 PMCID: PMC5807924 DOI: 10.3389/fphar.2018.00074
Source DB: PubMed Journal: Front Pharmacol ISSN: 1663-9812 Impact factor: 5.810
Examples of potential applications of transfer learning methods in drug design.
| Transfer learning approach | Concept | Application in drug design |
|---|---|---|
| Instance-based | Uses the same ML technique for modeling but apply some changes to the parameters of the target model. | Source and target datasets have the same endpoint (e.g., same molecular target) but the training data can be colleted at different experimental conditions. |
| Feature representation | Based on some mathematical transformations of data. | Source and target datasets have different but related endpoints, e.g., same classes of molecular target (kinases, nuclear receptors, proteases, etc.). |
| Parameters | It is assumed that both datasets share some properties. | Source and target datasets have the same or related endpoints. |
| Relational knowledge transfer | Based on technique for mapping the data in the target domain. | The endpoints of the source and target datasets are different but the domains (the independent variable in QSAR models) are related; e.g., cellular permeability and log P. |