Literature DB >> 22224501

In silico toxicity prediction by support vector machine and SMILES representation-based string kernel.

D-S Cao1, J-C Zhao, Y-N Yang, C-X Zhao, J Yan, S Liu, Q-N Hu, Q-S Xu, Y-Z Liang.   

Abstract

There is a great need to assess the harmful effects or toxicities of chemicals to which man is exposed. In the present paper, the simplified molecular input line entry specification (SMILES) representation-based string kernel, together with the state-of-the-art support vector machine (SVM) algorithm, were used to classify the toxicity of chemicals from the US Environmental Protection Agency Distributed Structure-Searchable Toxicity (DSSTox) database network. In this method, the molecular structure can be directly encoded by a series of SMILES substrings that represent the presence of some chemical elements and different kinds of chemical bonds (double, triple and stereochemistry) in the molecules. Thus, SMILES string kernel can accurately and directly measure the similarities of molecules by a series of local information hidden in the molecules. Two model validation approaches, five-fold cross-validation and independent validation set, were used for assessing the predictive capability of our developed models. The results obtained indicate that SVM based on the SMILES string kernel can be regarded as a very promising and alternative modelling approach for potential toxicity prediction of chemicals.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22224501     DOI: 10.1080/1062936X.2011.645874

Source DB:  PubMed          Journal:  SAR QSAR Environ Res        ISSN: 1026-776X            Impact factor:   3.000


  9 in total

1.  Revealing new therapeutic opportunities through drug target prediction: a class imbalance-tolerant machine learning approach.

Authors:  Siqi Liang; Haiyuan Yu
Journal:  Bioinformatics       Date:  2020-08-15       Impact factor: 6.937

2.  Drug-Drug Interaction Discovery: Kernel Learning from Heterogeneous Similarities.

Authors:  Devendra Singh Dhami; Gautam Kunapuli; Mayukh Das; David Page; Sriraam Natarajan
Journal:  Smart Health (Amst)       Date:  2018-07-07

3.  Prediction of developmental chemical toxicity based on gene networks of human embryonic stem cells.

Authors:  Junko Yamane; Sachiyo Aburatani; Satoshi Imanishi; Hiromi Akanuma; Reiko Nagano; Tsuyoshi Kato; Hideko Sone; Seiichiroh Ohsako; Wataru Fujibuchi
Journal:  Nucleic Acids Res       Date:  2016-05-20       Impact factor: 16.971

4.  A comparative study of SMILES-based compound similarity functions for drug-target interaction prediction.

Authors:  Hakime Öztürk; Elif Ozkirimli; Arzucan Özgür
Journal:  BMC Bioinformatics       Date:  2016-03-18       Impact factor: 3.169

5.  A novel methodology on distributed representations of proteins using their interacting ligands.

Authors:  Hakime Öztürk; Elif Ozkirimli; Arzucan Özgür
Journal:  Bioinformatics       Date:  2018-07-01       Impact factor: 6.937

6.  BigSMILES: A Structurally-Based Line Notation for Describing Macromolecules.

Authors:  Tzyy-Shyang Lin; Connor W Coley; Hidenobu Mochigase; Haley K Beech; Wencong Wang; Zi Wang; Eliot Woods; Stephen L Craig; Jeremiah A Johnson; Julia A Kalow; Klavs F Jensen; Bradley D Olsen
Journal:  ACS Cent Sci       Date:  2019-09-12       Impact factor: 14.553

7.  Descriptor Free QSAR Modeling Using Deep Learning With Long Short-Term Memory Neural Networks.

Authors:  Suman K Chakravarti; Sai Radha Mani Alla
Journal:  Front Artif Intell       Date:  2019-09-06

8.  Polygrammar: Grammar for Digital Polymer Representation and Generation.

Authors:  Minghao Guo; Wan Shou; Liane Makatura; Timothy Erps; Michael Foshey; Wojciech Matusik
Journal:  Adv Sci (Weinh)       Date:  2022-06-09       Impact factor: 17.521

9.  The identification of complex interactions in epidemiology and toxicology: a simulation study of boosted regression trees.

Authors:  Erik Lampa; Lars Lind; P Monica Lind; Anna Bornefalk-Hermansson
Journal:  Environ Health       Date:  2014-07-04       Impact factor: 5.984

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.