Literature DB >> 18937041

Classification of bioaccumulative and non-bioaccumulative chemicals using statistical learning approaches.

Xiuli Sun1, Yan Li, Xianjie Liu, Jun Ding, Yonghua Wang, Hui Shen, Yaqing Chang.   

Abstract

The present work aimed at developing in silico models allowing for a reliable prediction of bioaccumulative compounds and non-bioaccumulative compounds based on the definition of Bioconcentration Factor (BCF) using a diverse data set of 238 organic molecules. The partial least squares analysis (PLS), C4.5, support vector machine (SVM), and random forest (RF) algorithms were applied, and their performance classifying these compounds in terms of their quantitative structure-activity relationships (QSAR) was evaluated and verified with 5-fold cross-validation and an independent evaluation data set. The obtained results show that the overall prediction accuracies (Q) of the optimal PLS, C4.5, SVM and RF models are 84.5-87.7% for the internal cross-validation, with prediction accuracy (CO) of 86.3-91.1% in the external test sets, and C4.5 is slightly better than the three other methods which presents a Q of 87.7%, and a CO of 91.1% for the test sets. All these results prove the reliabilities of the in silico models, which should be valuable for the environmental risk assessment of the substances.

Mesh:

Substances:

Year:  2008        PMID: 18937041     DOI: 10.1007/s11030-008-9092-x

Source DB:  PubMed          Journal:  Mol Divers        ISSN: 1381-1991            Impact factor:   2.943


  14 in total

1.  Estimation of bioconcentration factors of nonionic organic compounds in fish by molecular connectivity indices and polarity correction factors.

Authors:  X Lu; S Tao; H Hu; R W Dawson
Journal:  Chemosphere       Date:  2000-11       Impact factor: 7.086

2.  Partial least squares modeling and genetic algorithm optimization in quantitative structure-activity relationships.

Authors:  K Hasegawa; K Funatsu
Journal:  SAR QSAR Environ Res       Date:  2000       Impact factor: 3.000

3.  Non-linear modeling of bioconcentration using partition coefficients for narcotic chemicals.

Authors:  S D Dimitrov; O G Mekenyan; J D Walker
Journal:  SAR QSAR Environ Res       Date:  2002-03       Impact factor: 3.000

4.  Random forest: a classification and regression tool for compound classification and QSAR modeling.

Authors:  Vladimir Svetnik; Andy Liaw; Christopher Tong; J Christopher Culberson; Robert P Sheridan; Bradley P Feuston
Journal:  J Chem Inf Comput Sci       Date:  2003 Nov-Dec

5.  Prediction of protein retention times in anion-exchange chromatography systems using support vector regression.

Authors:  Minghu Song; Curt M Breneman; Jinbo Bi; N Sukumar; Kristin P Bennett; Steven Cramer; Nihal Tugcu
Journal:  J Chem Inf Comput Sci       Date:  2002 Nov-Dec

6.  Fragment generation and support vector machines for inducing SARs.

Authors:  S Kramer; E Frank; C Helma
Journal:  SAR QSAR Environ Res       Date:  2002-07       Impact factor: 3.000

7.  Correlation of bioconcentration factors.

Authors:  D Mackay
Journal:  Environ Sci Technol       Date:  1982-05-01       Impact factor: 9.028

Review 8.  Effects of polychlorinated biphenyls on the nervous system.

Authors:  O Faroon; D Jones; C de Rosa
Journal:  Toxicol Ind Health       Date:  2000-09       Impact factor: 2.273

9.  Tumor classification by partial least squares using microarray gene expression data.

Authors:  Danh V Nguyen; David M Rocke
Journal:  Bioinformatics       Date:  2002-01       Impact factor: 6.937

10.  Comparison of statistical methods for classification of ovarian cancer using mass spectrometry data.

Authors:  Baolin Wu; Tom Abbott; David Fishman; Walter McMurray; Gil Mor; Kathryn Stone; David Ward; Kenneth Williams; Hongyu Zhao
Journal:  Bioinformatics       Date:  2003-09-01       Impact factor: 6.937

View more
  6 in total

1.  A classification study of human β₃-adrenergic receptor agonists using BCUT descriptors.

Authors:  Ming Hao; Yan Li; Yonghua Wang; Shuwei Zhang
Journal:  Mol Divers       Date:  2011-05-31       Impact factor: 2.943

2.  Models for anti-tumor activity of bisphosphonates using refined topochemical descriptors.

Authors:  Rakesh K Goyal; G Singh; A K Madan
Journal:  Naturwissenschaften       Date:  2011-09-04

3.  A classification study of respiratory Syncytial Virus (RSV) inhibitors by variable selection with random forest.

Authors:  Ming Hao; Yan Li; Yonghua Wang; Shuwei Zhang
Journal:  Int J Mol Sci       Date:  2011-02-21       Impact factor: 5.923

4.  A systematic prediction of multiple drug-target interactions from chemical, genomic, and pharmacological data.

Authors:  Hua Yu; Jianxin Chen; Xue Xu; Yan Li; Huihui Zhao; Yupeng Fang; Xiuxiu Li; Wei Zhou; Wei Wang; Yonghua Wang
Journal:  PLoS One       Date:  2012-05-30       Impact factor: 3.240

5.  Prediction of bioconcentration factors in fish and invertebrates using machine learning.

Authors:  Thomas H Miller; Matteo D Gallidabino; James I MacRae; Stewart F Owen; Nicolas R Bury; Leon P Barron
Journal:  Sci Total Environ       Date:  2018-08-10       Impact factor: 7.963

6.  A systems-pharmacology analysis of herbal medicines used in health improvement treatment: predicting potential new drugs and targets.

Authors:  Jianling Liu; Mengjie Pei; Chunli Zheng; Yan Li; Yonghua Wang; Aiping Lu; Ling Yang
Journal:  Evid Based Complement Alternat Med       Date:  2013-11-28       Impact factor: 2.629

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.