Literature DB >> 33148423

Quantitative structure-property relationships for the calculation of the soil adsorption coefficient using machine learning algorithms with calculated chemical properties from open-source software.

Yoshiyuki Kobayashi1, Kenichi Yoshida2.   

Abstract

The soil adsorption coefficient (Koc) is an environmental fate parameter that is essential for environmental risk assessment. However, obtaining Koc requires a significant amount of time and enormous expenditure. Thus, it is necessary to efficiently estimate Koc in the early stages of a chemical's development. In this study, a quantitative structure-property relationship (QSPR) model was developed using calculated physicochemical properties and molecular descriptors with the OPEn structure-activity/property Relationship App (OPERA) and Mordred software using the largest available Koc dataset. Specifically, we compared the accuracies of the model using the light gradient boosted machine (LightGBM), a gradient boosting decision tree (GBDT) algorithm, with those of previous models. The experimental results suggested the potential to develop a QSPR model that will produce highly accurate Koc values using molecular descriptors and physicochemical properties. Unlike previous studies, the use of a combination of LightGBM, OPERA and Mordred enables the prediction of Koc for many chemicals with high accuracy. In this study, OPERA was used to calculate the physicochemical properties, and Mordred was used to calculate molecular descriptors. The wide range of chemicals covered by OPERA and Mordred enables the analysis of a diverse range of chemical compounds. We also report a method to tune the LightBGM program. The use of fast-processing software, such as LightGBM, enables parameter tuning of a method required to obtain best performance. Our research represents one of the few studies in the field of environmental chemistry to use LightGBM. Using physicochemical properties as well as molecular descriptors, we could develop highly accurate Koc prediction models when compared to prior studies. In addition, our QSPR models may be useful for preliminary environmental risk assessment without incurring significant costs during the early chemical developmental stage.
Copyright © 2020 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Environmental risk assessment; Gradient boosting decision trees; K(oc); Mordred; OPEn structure–activity relationship app; QSPR

Year:  2020        PMID: 33148423     DOI: 10.1016/j.envres.2020.110363

Source DB:  PubMed          Journal:  Environ Res        ISSN: 0013-9351            Impact factor:   6.498


  3 in total

1.  Development and Evaluation of a Machine Learning Prediction Model for Small-for-Gestational-Age Births in Women Exposed to Radiation before Pregnancy.

Authors:  Xi Bai; Zhibo Zhou; Yunyun Luo; Hongbo Yang; Huijuan Zhu; Shi Chen; Hui Pan
Journal:  J Pers Med       Date:  2022-03-31

2.  A model for predicting fall risks of hospitalized elderly in Taiwan-A machine learning approach based on both electronic health records and comprehensive geriatric assessment.

Authors:  Wei-Min Chu; Endah Kristiani; Yu-Chieh Wang; Yen-Ru Lin; Shih-Yi Lin; Wei-Cheng Chan; Chao-Tung Yang; Yu-Tse Tsan
Journal:  Front Med (Lausanne)       Date:  2022-08-09

3.  Predictive models for small-for-gestational-age births in women exposed to pesticides before pregnancy based on multiple machine learning algorithms.

Authors:  Xi Bai; Zhibo Zhou; Mingliang Su; Yansheng Li; Liuqing Yang; Kejia Liu; Hongbo Yang; Huijuan Zhu; Shi Chen; Hui Pan
Journal:  Front Public Health       Date:  2022-08-08
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.