Literature DB >> 20186479

Prediction of carcinogenicity for diverse chemicals based on substructure grouping and SVM modeling.

Kazutoshi Tanabe1, Bono Lučić, Dragan Amić, Takio Kurita, Mikio Kaihara, Natsuo Onodera, Takahiro Suzuki.   

Abstract

The Carcinogenicity Reliability Database (CRDB) was constructed by collecting experimental carcinogenicity data on about 1,500 chemicals from six sources, including IARC, and NTP databases, and then by ranking their reliabilities into six unified categories. A wide variety of 911 organic chemicals were selected from the database for QSAR modeling, and 1,504 kinds of different molecular descriptors were calculated, based on their 3D molecular structures as modeled by the Dragon software. Positive (carcinogenic) and negative (non-carcinogenic) chemicals containing various substructures were counted using atom and functional group count descriptors, and the statistical significance of ratios of positives to negatives was tested for those substructures. Very few were judged to be strongly related to carcinogenicity, among substructures known to be responsible for carcinogens as revealed from biomedical studies. In order to develop QSAR models for the prediction of the carcinogenicities of a wide variety of chemicals with a satisfactory performance level, the relationship between the carcinogenicity data with improved reliability and a subset of significant descriptors selected from 1,504 Dragon descriptors was analyzed with a support vector machine (SVM) method: the classification function (SVC) for weighted data in LIBSVM program was used to classify chemicals into two carcinogenic categories (positive or negative), where weights were set depending on the reliabilities of the carcinogenicity data. The quality and stability of the models presented were tested by performing a dual cross-validation procedure. A single SVM model as the first step was developed for all the 911 chemicals using 250 selected descriptors, achieving an overall accuracy level, i.e., positive and negative correct estimate, of about 70%. In order to improve the accuracy of the final model, the 911 chemicals were classified into 20 mutually overlapping subgroups according to contained substructures, a specific SVM model was optimized for each subgroup, and the predicted carcinogenicities of the 911 chemicals were determined by the majorities of the outputs of the corresponding SVM models. The model developed on the basis of grouping of chemicals into 20 substructures predicts the carcinogenicities of a wide variety of chemicals with a satisfactory overall accuracy of approximately 80%.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20186479     DOI: 10.1007/s11030-010-9232-y

Source DB:  PubMed          Journal:  Mol Divers        ISSN: 1381-1991            Impact factor:   2.943


  37 in total

Review 1.  Quantitative structure-activity relationships for predicting mutagenicity and carcinogenicity.

Authors:  Grace Patlewicz; Rosemary Rodford; John D Walker
Journal:  Environ Toxicol Chem       Date:  2003-08       Impact factor: 3.742

2.  Prediction of the rodent carcinogenicity of 60 pesticides by the DEREKfW expert system.

Authors:  Pierre Crettaz; Romualdo Benigni
Journal:  J Chem Inf Model       Date:  2005 Nov-Dec       Impact factor: 4.956

Review 3.  Structure-activity relationship studies of chemical mutagens and carcinogens: mechanistic investigations and prediction approaches.

Authors:  Romualdo Benigni
Journal:  Chem Rev       Date:  2005-05       Impact factor: 60.622

4.  Virtual screening of molecular databases using a support vector machine.

Authors:  Robert N Jorissen; Michael K Gilson
Journal:  J Chem Inf Model       Date:  2005 May-Jun       Impact factor: 4.956

5.  QSAR modelling of carcinogenicity by balance of correlations.

Authors:  A A Toropov; A P Toropova; E Benfenati; A Manganaro
Journal:  Mol Divers       Date:  2009-02-04       Impact factor: 2.943

Review 6.  Predictive models for carcinogenicity and mutagenicity: frameworks, state-of-the-art, and perspectives.

Authors:  E Benfenati; R Benigni; D M Demarini; C Helma; D Kirkland; T M Martin; P Mazzatorta; G Ouédraogo-Arras; A M Richard; B Schilter; W G E J Schoonen; R D Snyder; C Yang
Journal:  J Environ Sci Health C Environ Carcinog Ecotoxicol Rev       Date:  2009-04       Impact factor: 3.781

7.  Feature selection for the imbalanced QSAR problems by using easyensemble.

Authors:  Tian-Yu Liu; Guo-Zheng Li; Jack Y Yang; Mary Qu Yang
Journal:  Int J Comput Biol Drug Des       Date:  2008

8.  A study of structure-carcinogenicity relationship for 86 compounds from NTP data base using topological indices as descriptors.

Authors:  M Vracko
Journal:  SAR QSAR Environ Res       Date:  2000       Impact factor: 3.000

Review 9.  Improving prediction of chemical carcinogenicity by considering multiple mechanisms and applying toxicogenomic approaches.

Authors:  Kathryn Z Guyton; Amy D Kyle; Jiri Aubrecht; Vincent J Cogliano; David A Eastmond; Marc Jackson; Nagalakshmi Keshava; Martha S Sandy; Babasaheb Sonawane; Luoping Zhang; Michael D Waters; Martyn T Smith
Journal:  Mutat Res       Date:  2008-11-01       Impact factor: 2.433

10.  Development of quantitative structure-activity relationship (QSAR) models to predict the carcinogenic potency of chemicals I. Alternative toxicity measures as an estimator of carcinogenic potency.

Authors:  Raghuraman Venkatapathy; Ching Yi Wang; Robert Mark Bruce; Chandrika Moudgal
Journal:  Toxicol Appl Pharmacol       Date:  2008-10-15       Impact factor: 4.219

View more
  4 in total

1.  A clinical risk stratification tool for predicting treatment resistance in major depressive disorder.

Authors:  Roy H Perlis
Journal:  Biol Psychiatry       Date:  2013-02-04       Impact factor: 13.382

2.  High-Dimensional descriptor selection and computational QSAR modeling for antitumor activity of ARC-111 analogues Based on Support Vector Regression (SVR).

Authors:  Wei Zhou; Zhijun Dai; Yuan Chen; Haiyan Wang; Zheming Yuan
Journal:  Int J Mol Sci       Date:  2012-01-20       Impact factor: 6.208

3.  CarcinoPred-EL: Novel models for predicting the carcinogenicity of chemicals using molecular fingerprints and ensemble learning methods.

Authors:  Li Zhang; Haixin Ai; Wen Chen; Zimo Yin; Huan Hu; Junfeng Zhu; Jian Zhao; Qi Zhao; Hongsheng Liu
Journal:  Sci Rep       Date:  2017-05-18       Impact factor: 4.379

4.  Which is a more accurate predictor in colorectal survival analysis? Nine data mining algorithms vs. the TNM staging system.

Authors:  Peng Gao; Xin Zhou; Zhen-ning Wang; Yong-xi Song; Lin-lin Tong; Ying-ying Xu; Zhen-yu Yue; Hui-mian Xu
Journal:  PLoS One       Date:  2012-07-25       Impact factor: 3.240

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.