Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Additive SMILES-based carcinogenicity models: Probabilistic principles in the search for robust predictions.

Literature DB >> 19742127

Additive SMILES-based carcinogenicity models: Probabilistic principles in the search for robust predictions.

Andrey A Toropov^1,2, Alla P Toropova^1,2, Emilio Benfenati².

Abstract

Optimal descriptors calculated with the simplified molecular input line entry system (SMILES) have been utilized in modeling of carcinogenicity as continuous values (logTD(50)). These descriptors can be calculated using correlation weights of SMILES attributes calculated by the Monte Carlo method. A considerable subset of these attributes includes rare attributes. The use of these rare attributes can lead to overtraining. One can avoid the influence of the rare attributes if their correlation weights are fixed to zero. A function, limS, has been defined to identify rare attributes. The limS defines the minimum number of occurrences in the set of structures of the training (subtraining) set, to accept attributes as usable. If an attribute is present less than limS, it is considered "rare", and thus not used. Two systems of building up models were examined: 1. classic training-test system; 2. balance of correlations for the subtraining and calibration sets (together, they are the original training set: the function of the calibration set is imitation of a preliminary test set). Three random splits into subtraining, calibration, and test sets were analysed. Comparison of above mentioned systems has shown that balance of correlations gives more robust prediction of the carcinogenicity for all three splits (split 1: r(test) (2)=0.7514, s(test)=0.684; split 2: r(test) (2)=0.7998, s(test)=0.600; split 3: r(test) (2)=0.7192, s(test)=0.728).

Entities: Chemical

Keywords: QSAR; SMILES; applicability domain; balance of correlations; carcinogenicity; optimal descriptor

Mesh：

Substances：

Year: 2009 PMID： 19742127 PMCID： PMC2738914 DOI： 10.3390/ijms10073106

Source DB: PubMed Journal: Int J Mol Sci ISSN： 1422-0067 Impact factor: 6.208

10 in total

1. QSAR model for predicting pesticide aquatic toxicity.

Authors: Paolo Mazzatorta; Martin Smiesko; Elena Lo Piparo; Emilio Benfenati
Journal: J Chem Inf Model Date: 2005 Nov-Dec Impact factor: 4.956

2. LINGO, an efficient holographic text based method to calculate biophysical properties and intermolecular similarities.

Authors: David Vidal; Michael Thormann; Miquel Pons
Journal: J Chem Inf Model Date: 2005 Mar-Apr Impact factor: 4.956

Review 3. Structure-activity relationship studies of chemical mutagens and carcinogens: mechanistic investigations and prediction approaches.

Authors: Romualdo Benigni
Journal: Chem Rev Date: 2005-05 Impact factor: 60.622

4. Optimisation of correlation weights of SMILES invariants for modelling oral quail toxicity.

Authors: Andrey A Toropov; Emilio Benfenati
Journal: Eur J Med Chem Date: 2006-12-15 Impact factor: 6.514

Review 5. The expanding role of predictive toxicology: an update on the (Q)SAR models for mutagens and carcinogens.

Authors: Romualdo Benigni; Tatiana I Netzeva; Emilio Benfenati; Cecilia Bossa; Rainer Franke; Christoph Helma; Etje Hulzebos; Carol Marchant; Ann Richard; Yin-Tak Woo; Chihae Yang
Journal: J Environ Sci Health C Environ Carcinog Ecotoxicol Rev Date: 2007 Jan-Mar Impact factor: 3.781

6. QSAR modeling of carcinogenic risk using discriminant analysis and topological molecular descriptors.

Authors: Joseph F Contrera; Philip Maclaughlin; Lowell H Hall; Lemont B Kier
Journal: Curr Drug Discov Technol Date: 2005-06

7. QSAR modelling of carcinogenicity by balance of correlations.

Authors: A A Toropov; A P Toropova; E Benfenati; A Manganaro
Journal: Mol Divers Date: 2009-02-04 Impact factor: 2.943

Review 8. Predictive models for carcinogenicity and mutagenicity: frameworks, state-of-the-art, and perspectives.

Authors: E Benfenati; R Benigni; D M Demarini; C Helma; D Kirkland; T M Martin; P Mazzatorta; G Ouédraogo-Arras; A M Richard; B Schilter; W G E J Schoonen; R D Snyder; C Yang
Journal: J Environ Sci Health C Environ Carcinog Ecotoxicol Rev Date: 2009-04 Impact factor: 3.781

9. Additive SMILES-based optimal descriptors in QSAR modelling bee toxicity: Using rare SMILES attributes to define the applicability domain.

Authors: A A Toropov; E Benfenati
Journal: Bioorg Med Chem Date: 2008-03-23 Impact factor: 3.641

10. QSAR modeling of acute toxicity by balance of correlations.

Authors: Andrey A Toropov; Bakhtiyor F Rasulev; Jerzy Leszczynski
Journal: Bioorg Med Chem Date: 2008-04-26 Impact factor: 3.641

10 in total

3 in total

1. CORAL: QSPR models for solubility of [C60] and [C70] fullerene derivatives.

Authors: Alla P Toropova; Andrey A Toropov; Emilio Benfenati; Giuseppina Gini; Danuta Leszczynska; Jerzy Leszczynski
Journal: Mol Divers Date: 2010-03-27 Impact factor: 2.943

2. Prediction of carcinogenicity for diverse chemicals based on substructure grouping and SVM modeling.

Authors: Kazutoshi Tanabe; Bono Lučić; Dragan Amić; Takio Kurita; Mikio Kaihara; Natsuo Onodera; Takahiro Suzuki
Journal: Mol Divers Date: 2010-02-26 Impact factor: 2.943

3. Large-scale structure-activity relationship study of hepatitis C virus NS5B polymerase inhibition using SMILES-based descriptors.

Authors: Apilak Worachartcheewan; Virapong Prachayasittikul; Alla P Toropova; Andrey A Toropov; Chanin Nantasenamat
Journal: Mol Divers Date: 2015-11 Impact factor: 2.943

3 in total