| Literature DB >> 27863338 |
Alla P Toropova1, Andrey A Toropov2, Maria Raskova3, Ivan Raska3.
Abstract
By optimization of so-called correlation weights of attributes of simplified molecular input-line entry system (SMILES) quantitative structure - activity relationships (QSAR) for toxicity towards Pimephales promelas are established. A new SMILES attribute has been utilized in this work. This attribute is a molecular descriptor, which reflects (i) presence of different kinds of bonds (double, triple, and stereo chemical bonds); (ii) presence of nitrogen, oxygen, sulphur, and phosphorus atoms; and (iii) presence of fluorine, chlorine, bromine, and iodine atoms. The statistical characteristics of the best model are the following: n=226, r2=0.7630, RMSE=0.654 (training set); n=114, r2=0.7024, RMSE=0.766 (calibration set); n=226, r2=0.6292, RMSE=0.870 (validation set). A new criterion to select a preferable split into the training and validation sets are suggested and discussed. Copyright ÂEntities:
Keywords: CORAL software; Monte carlo method; Pimephales promelas; QSAR; Toxicity
Mesh:
Substances:
Year: 2016 PMID: 27863338 DOI: 10.1016/j.etap.2016.11.010
Source DB: PubMed Journal: Environ Toxicol Pharmacol ISSN: 1382-6689 Impact factor: 4.860