Literature DB >> 32289603

Development of models predicting biodegradation rate rating with multiple linear regression and support vector machine algorithms.

Weihao Tang1, Yanying Li1, Yang Yu2, Zhongyu Wang1, Tong Xu1, Jingwen Chen1, Jun Lin2, Xuehua Li3.   

Abstract

Biodegradation is a significant process for removing organic chemicals from water, soil and sediment environments, and therefore biodegradability is critical to evaluate the environmental persistence of organic chemicals. In this study, based on a dataset with 171 compounds, four quantitative structure-activity relationship (QSAR) models were developed for predicting primary and ultimate biodegradation rate rating with multiple linear regression (MLR) and support vector machine (SVM) algorithms. Two MLR models were built with a dataset with carbon atom number ≤9, and two SVM models were built with a dataset with carbon atom number >9. In the MLR models, nArX (number of X on aromatic ring) is the most important descriptor governing primary and ultimate biodegradation of organic chemicals. For the SVM models, determination coefficient (R2) values, cross-validated coefficients (Q2LOO) and external validation coefficient (Q2ext) values are over 0.9, indicating the SVM models have satisfactory goodness-of-fit, robustness and external predictive abilities. The applicability domains of these models were visualized by the Williams plot. The developed models can be used as effective tools to predict biodegradability of organic chemicals.
Copyright © 2020 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Biodegradability; Molecular structure descriptors; Multiple linear regression; Quantitative structure–activity relationship; Support vector machine

Mesh:

Substances:

Year:  2020        PMID: 32289603     DOI: 10.1016/j.chemosphere.2020.126666

Source DB:  PubMed          Journal:  Chemosphere        ISSN: 0045-6535            Impact factor:   7.086


  2 in total

1.  Classification of Biodegradable Substances Using Balanced Random Trees and Boosted C5.0 Decision Trees.

Authors:  Alaa M Elsayad; Ahmed M Nassef; Mujahed Al-Dhaifallah; Khaled A Elsayad
Journal:  Int J Environ Res Public Health       Date:  2020-12-13       Impact factor: 3.390

2.  A novel reliability-based regression model to analyze and forecast the severity of COVID-19 patients.

Authors:  Negar Bakhtiarvand; Mehdi Khashei; Mehdi Mahnam; Somayeh Hajiahmadi
Journal:  BMC Med Inform Decis Mak       Date:  2022-05-05       Impact factor: 3.298

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.