Literature DB >> 26956430

Accurate and interpretable nanoSAR models from genetic programming-based decision tree construction approaches.

Ceyda Oksel1, David A Winkler2,3,4,5, Cai Y Ma1, Terry Wilkins1, Xue Z Wang1.   

Abstract

The number of engineered nanomaterials (ENMs) being exploited commercially is growing rapidly, due to the novel properties they exhibit. Clearly, it is important to understand and minimize any risks to health or the environment posed by the presence of ENMs. Data-driven models that decode the relationships between the biological activities of ENMs and their physicochemical characteristics provide an attractive means of maximizing the value of scarce and expensive experimental data. Although such structure-activity relationship (SAR) methods have become very useful tools for modelling nanotoxicity endpoints (nanoSAR), they have limited robustness and predictivity and, most importantly, interpretation of the models they generate is often very difficult. New computational modelling tools or new ways of using existing tools are required to model the relatively sparse and sometimes lower quality data on the biological effects of ENMs. The most commonly used SAR modelling methods work best with large datasets, are not particularly good at feature selection, can be relatively opaque to interpretation, and may not account for nonlinearity in the structure-property relationships. To overcome these limitations, we describe the application of a novel algorithm, a genetic programming-based decision tree construction tool (GPTree) to nanoSAR modelling. We demonstrate the use of GPTree in the construction of accurate and interpretable nanoSAR models by applying it to four diverse literature datasets. We describe the algorithm and compare model results across the four studies. We show that GPTree generates models with accuracies equivalent to or superior to those of prior modelling studies on the same datasets. GPTree is a robust, automatic method for generation of accurate nanoSAR models with important advantages that it works with small datasets, automatically selects descriptors, and provides significantly improved interpretability of models.

Keywords:  Decision trees; QSAR; genetic-programming; nanoSAR; nanotoxicology

Mesh:

Year:  2016        PMID: 26956430     DOI: 10.3109/17435390.2016.1161857

Source DB:  PubMed          Journal:  Nanotoxicology        ISSN: 1743-5390            Impact factor:   5.913


  3 in total

1.  Fermented mulberry leaf meal as fishmeal replacer in the formulation of feed for carp Labeo rohita and catfish Heteropneustes fossilis-optimization by mathematical programming.

Authors:  Saheli Ali; Subrata Saha; Anilava Kaviraj
Journal:  Trop Anim Health Prod       Date:  2019-10-04       Impact factor: 1.559

Review 2.  Practices and Trends of Machine Learning Application in Nanotoxicology.

Authors:  Irini Furxhi; Finbarr Murphy; Martin Mullins; Athanasios Arvanitis; Craig A Poland
Journal:  Nanomaterials (Basel)       Date:  2020-01-08       Impact factor: 5.076

Review 3.  Understanding Nanoparticle Toxicity to Direct a Safe-by-Design Approach in Cancer Nanomedicine.

Authors:  Jossana A Damasco; Saisree Ravi; Joy D Perez; Daniel E Hagaman; Marites P Melancon
Journal:  Nanomaterials (Basel)       Date:  2020-11-02       Impact factor: 5.076

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.