Literature DB >> 12807355

Predicting the genotoxicity of thiophene derivatives from molecular structure.

Philip D Mosier1, Peter C Jurs, Laura L Custer, Stephen K Durham, Greg M Pearl.   

Abstract

We report several binary classification models that directly link the genetic toxicity of a series of 140 thiophene derivatives with information derived from the compounds' molecular structure. Genetic toxicity was measured using an SOS Chromotest. IMAX (maximal SOS induction factor) values were recorded for each of the 140 compounds both in the presence and in the absence of S9 rat liver homogenate. Compounds were classified as genotoxic if IMAX >or= 1.5 in either test or nongenotoxic if IMAX < 1.5 for both tests. The molecular structures were represented by numerical descriptors that encoded the topological, geometric, electronic, and polar surface area properties of the thiophene derivatives. The classification models used were linear discriminant analysis (LDA), k-nearest neighbor classification (k-NN), and the probabilistic neural network (PNN). These were used in conjunction with either a genetic algorithm or a generalized simulated annealing to find optimal subsets of descriptors for each classifier. The quality of the resulting models was determined by the number of misclassified compounds, with preference given to models that produced fewer false negative classifications. Model sizes ranged from seven descriptors for LDA to three descriptors for k-NN and PNN. Very good classification results were obtained with all three classifiers. Classification rates for the LDA, k-NN, and PNN models were 80, 85, and 85%, respectively, for the prediction set compounds. Additionally, a consensus model was generated that incorporated all three of the basic model types. This consensus model correctly predicted the genotoxicity of 95% of the prediction set compounds.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 12807355     DOI: 10.1021/tx020104i

Source DB:  PubMed          Journal:  Chem Res Toxicol        ISSN: 0893-228X            Impact factor:   3.739


  2 in total

Review 1.  Bioinformatics opportunities for identification and study of medicinal plants.

Authors:  Vivekanand Sharma; Indra Neil Sarkar
Journal:  Brief Bioinform       Date:  2012-05-15       Impact factor: 11.622

2.  Prediction of skin sensitization with a particle swarm optimized support vector machine.

Authors:  Hua Yuan; Jianping Huang; Chenzhong Cao
Journal:  Int J Mol Sci       Date:  2009-07-17       Impact factor: 6.208

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.