Literature DB >> 24464801

Partial least square and k-nearest neighbor algorithms for improved 3D quantitative spectral data-activity relationship consensus modeling of acute toxicity.

Iva B Stoyanova-Slavova1, Svetoslav H Slavov, Bruce Pearce, Dan A Buzatu, Richard D Beger, Jon G Wilkes.   

Abstract

A diverse set of 154 chemicals that included US Food and Drug Administration-regulated compounds tested for their aquatic toxicity in Daphnia magna were modeled by a 3-dimensional quantitative spectral data-activity relationship (3D-QSDAR). Two distinct algorithms, partial least squares (PLS) and Tanimoto similarity-based k-nearest neighbors (KNN), were used to process bin occupancy descriptor matrices obtained after tessellation of the 3D-QSDAR space into regularly sized bins. The performance of models utilizing bins ranging in size from 2 ppm × 2 ppm × 0.5 Å to 20 ppm × 20 ppm × 2.5 Å was explored. Rigorous quality-control criteria were imposed: 1) 100 randomized 20% hold-out test sets were generated and the average R(2) test of the respective models was used as a measure of their performance, and 2) a Y-scrambling procedure was used to identify chance correlations. A consensus between the best-performing composite PLS model using 0.5 Å × 14 ppm × 14 ppm bins and 10 latent variables (average R(2) test  = 0.770) and the best composite KNN model using 0.5 Å × 8 ppm × 8 ppm and 2 neighbors (average R(2) test  = 0.801) offered an improvement of about 7.5% (R(2) test consensus  = 0.845). Projection of the most frequently occurring bins on the standard coordinate space indicated that the presence of a primary or secondary amino group-substituted aromatic systems-would result in an increased toxic effect in Daphnia. The presence of a second aromatic ring with highly electronegative substituents 5 Å to 7 Å apart from the first ring would lead to a further increase in toxicity.
© 2014 SETAC.

Entities:  

Keywords:  3D quantitative spectral data-activity relationship (QSDAR); Aquatic toxicology; Computational toxicology; Consensus modeling; Multivariate statistic

Mesh:

Substances:

Year:  2014        PMID: 24464801     DOI: 10.1002/etc.2534

Source DB:  PubMed          Journal:  Environ Toxicol Chem        ISSN: 0730-7268            Impact factor:   3.742


  3 in total

1.  MOLECULAR MODELLING, 3D-QSAR, AND DRUG DOCKING STUDIES ON THE ROLE OF NATURAL ANTICOAGULANT COMPOUNDS IN ANTITHROMBOTIC THERAPY.

Authors:  Prathusha Kakarla; Amith R Devireddy; Madhuri A Inupakutika; Upender R Cheeti; Jared T Floyd; Mukherjee M Mun; Raelyn N Vigil; Russell P Hunter; Manuel F Varela
Journal:  Int J Pharm Sci Res       Date:  2014

2.  Prior Knowledge for Predictive Modeling: The Case of Acute Aquatic Toxicity.

Authors:  Gulnara Shavalieva; Stavros Papadokonstantakis; Gregory Peters
Journal:  J Chem Inf Model       Date:  2022-08-23       Impact factor: 6.162

3.  Alignment-independent technique for 3D QSAR analysis.

Authors:  Jon G Wilkes; Iva B Stoyanova-Slavova; Dan A Buzatu
Journal:  J Comput Aided Mol Des       Date:  2016-03-30       Impact factor: 3.686

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.