Literature DB >> 31026752

QSAR modelling study of the bioconcentration factor and toxicity of organic compounds to aquatic organisms using machine learning and ensemble methods.

Haixin Ai1, Xuewei Wu2, Li Zhang1, Mengyuan Qi2, Ying Zhao2, Qi Zhao3, Jian Zhao2, Hongsheng Liu4.   

Abstract

Bioconcentration factors and median lethal concentrations (LC50s) are important when assessing risks posed by organic pollutants to aquatic ecosystems. Various quantitative structure-activity relationship models have been developed to predict bioconcentration factors and classify acute toxicity. In the study, we developed a regression model using Recursive Feature Elimination (RFE) method combined with the Support Vector Machine (SVM) algorithm. We calculated 2D molecular descriptors from a dataset containing 450 diverse chemicals in our regression model. Then we built three ensemble models using three machine learning algorithms and calculated 12 molecular fingerprints from a dataset containing 400 diverse chemicals in our classification models. In the regression model, the R2 and Rpred2 for the regression model were 0.860 and 0.757, respectively. Other parameters indicated that the regression model made good predictions and could efficiently predict a new set of compounds following standards set by Golbraikh, Tropsha, and Roy. In the classification models, the ensemble-SVM classification model gave an overall accuracy, sensitivity, specificity, and area under the receiver operating characteristic curve of 92.2, 95.1, 86.0, and 0.965, respectively, in a five-fold cross-validation and of 87.3, 92.6, 76.0, and 0.940, respectively, in an external validation. These parameters indicated that our ensemble-SVM model was more stable and gave more accurate predictions than previous models. The model could therefore be used to effectively predict aquatic toxicity and assess risks posed to aquatic ecosystems. We identified several structures most relevant to acute aquatic toxicity through predictions made by the two types of models, and this information may be important to aquatic toxicology experiments and aquatic system management.
Copyright © 2019 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Acute aquatic toxicity; Aquatic toxicology; Assessing risks; Bioconcentration factors; Ensemble-SVM

Mesh:

Substances:

Year:  2019        PMID: 31026752     DOI: 10.1016/j.ecoenv.2019.04.035

Source DB:  PubMed          Journal:  Ecotoxicol Environ Saf        ISSN: 0147-6513            Impact factor:   6.291


  3 in total

1.  Modeling and insights into molecular basis of low molecular weight respiratory sensitizers.

Authors:  Xueyan Cui; Rui Yang; Siwen Li; Juan Liu; Qiuyun Wu; Xiao Li
Journal:  Mol Divers       Date:  2020-03-12       Impact factor: 2.943

2.  In silico prediction models for thyroid peroxidase inhibitors and their application to synthetic flavors.

Authors:  Mihyun Seo; Changwon Lim; Hoonjeong Kwon
Journal:  Food Sci Biotechnol       Date:  2022-03-12       Impact factor: 2.391

3.  Incremental Learning in Modelling Process Analysis Technology (PAT)-An Important Tool in the Measuring and Control Circuit on the Way to the Smart Factory.

Authors:  Shivani Choudhary; Deborah Herdt; Erik Spoor; José Fernando García Molina; Marcel Nachtmann; Matthias Rädle
Journal:  Sensors (Basel)       Date:  2021-05-01       Impact factor: 3.576

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.