Literature DB >> 18637673

Evolving interpretable structure-activity relationship models. 2. Using multiobjective optimization to derive multiple models.

Kristian Birchall1, Valerie J Gillet, Gavin Harper, Stephen D Pickett.   

Abstract

A multiobjective evolutionary algorithm (MOEA) is described for evolving multiple structure-activity relationships (SARs). The SARs are encoded in easy-to-interpret reduced graph queries which describe features that are preferentially present in active compounds compared to inactives. The MOEA addresses a limitation associated with many machine learning methods; that is, the inherent tradeoff that exists in recall and precision which is usually handled by combining the two objectives into a single measure with a consequent loss of control. By simultaneously optimizing recall and precision, the MOEA generates a family of SARs that lie on the precision-recall (PR) curve. The user is then able to select a query with an appropriate balance in the two objectives: for example, a low recall-high precision query may be preferred when establishing the SAR, whereas a high recall-low precision query may be more appropriate in a virtual screening context. Each query on the PR curve aims at capturing the structure-activity information into a single representation, and each can be considered as an alternative (equally valid) solution. We then investigate combining individual queries into teams with the aim of capturing multiple SARs that may exist in a data set, for example, as is commonly seen in high-throughput screening data sets. Team formation is carried out iteratively as a postprocessing step following the evolution of the individual queries. The inclusion of uniqueness as a third objective within the MOEA provides an effective way of ensuring the queries are complementary in the active compounds they describe. Substantial improvements in both recall and precision are seen for some data sets. Furthermore, the resulting queries provide more detailed structure-activity information than is present in a single query.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18637673     DOI: 10.1021/ci800051h

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  2 in total

1.  Consensus QSAR model for identifying novel H5N1 inhibitors.

Authors:  Nitin Sharma; Chun Wei Yap
Journal:  Mol Divers       Date:  2012-07-21       Impact factor: 2.943

2.  QSAR workbench: automating QSAR modeling to drive compound design.

Authors:  Richard Cox; Darren V S Green; Christopher N Luscombe; Noj Malcolm; Stephen D Pickett
Journal:  J Comput Aided Mol Des       Date:  2013-04-25       Impact factor: 3.686

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.