Literature DB >> 15032539

Combinatorial QSAR of ambergris fragrance compounds.

Assia Kovatcheva1, Alexander Golbraikh, Scott Oloff, Yun-De Xiao, Weifan Zheng, Peter Wolschann, Gerhard Buchbauer, Alexander Tropsha.   

Abstract

A combinatorial quantitative structure-activity relationships (Combi-QSAR) approach has been developed and applied to a data set of 98 ambergris fragrance compounds with complex stereochemistry. The Combi-QSAR approach explores all possible combinations of different independent descriptor collections and various individual correlation methods to obtain statistically significant models with high internal (for the training set) and external (for the test set) accuracy. Seven different descriptor collections were generated with commercially available MOE, CoMFA, CoMMA, Dragon, VolSurf, and MolconnZ programs; we also included chirality topological descriptors recently developed in our laboratory (Golbraikh, A.; Bonchev, D.; Tropsha, A. J. Chem. Inf. Comput. Sci. 2001, 41, 147-158). CoMMA descriptors were used in combination with MOE descriptors. MolconnZ descriptors were used in combination with chirality descriptors. Each descriptor collection was combined individually with four correlation methods, including k-nearest neighbors (kNN) classification, Support Vector Machines (SVM), decision trees, and binary QSAR, giving rise to 28 different types of QSAR models. Multiple diverse and representative training and test sets were generated by the divisions of the original data set in two. Each model with high values of leave-one-out cross-validated correct classification rate for the training set was subjected to extensive internal and external validation to avoid overfitting and achieve reliable predictive power. Two validation techniques were employed, i.e., the randomization of the target property (in this case, odor intensity) also known as the Y-randomization test and the assessment of external prediction accuracy using test sets. We demonstrate that not every combination of the data modeling technique and the descriptor collection yields a validated and predictive QSAR model. kNN classification in combination with CoMFA descriptors was found to be the best QSAR approach overall since predictive models with correct classification rates for both training and test sets of 0.7 and higher were obtained for all divisions of the ambergris data set into the training and test sets. Many predictive QSAR models were also found using a combination of kNN classification method with other collections of descriptors. The combinatorial QSAR affords automation, computational efficiency, and higher probability of identifying significant QSAR models for experimental data sets than the traditional approaches that rely on a single QSAR method.

Mesh:

Substances:

Year:  2004        PMID: 15032539     DOI: 10.1021/ci034203t

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  20 in total

1.  Development and implementation of (Q)SAR modeling within the CHARMMing web-user interface.

Authors:  Iwona E Weidlich; Yuri Pevzner; Benjamin T Miller; Igor V Filippov; H Lee Woodcock; Bernard R Brooks
Journal:  J Comput Chem       Date:  2014-11-03       Impact factor: 3.376

2.  Automated QSPR through Competitive Workflow.

Authors:  J Cartmell; S Enoch; D Krstajic; D E Leahy
Journal:  J Comput Aided Mol Des       Date:  2006-01-17       Impact factor: 3.686

3.  A novel automated lazy learning QSAR (ALL-QSAR) approach: method development, applications, and virtual screening of chemical databases using validated ALL-QSAR models.

Authors:  Shuxing Zhang; Alexander Golbraikh; Scott Oloff; Harold Kohn; Alexander Tropsha
Journal:  J Chem Inf Model       Date:  2006 Sep-Oct       Impact factor: 4.956

4.  kScore: a novel machine learning approach that is not dependent on the data structure of the training set.

Authors:  Scott Oloff; Ingo Muegge
Journal:  J Comput Aided Mol Des       Date:  2007-02-28       Impact factor: 3.686

5.  Differentiation of AmpC beta-lactamase binders vs. decoys using classification kNN QSAR modeling and application of the QSAR classifier to virtual screening.

Authors:  Jui-Hua Hsieh; Xiang S Wang; Denise Teotico; Alexander Golbraikh; Alexander Tropsha
Journal:  J Comput Aided Mol Des       Date:  2008-03-13       Impact factor: 3.686

6.  Hierarchical QSAR technology based on the Simplex representation of molecular structure.

Authors:  V E Kuz'min; A G Artemenko; E N Muratov
Journal:  J Comput Aided Mol Des       Date:  2008-02-06       Impact factor: 3.686

7.  QSAR modeling of the blood-brain barrier permeability for diverse organic compounds.

Authors:  Liying Zhang; Hao Zhu; Tudor I Oprea; Alexander Golbraikh; Alexander Tropsha
Journal:  Pharm Res       Date:  2008-06-14       Impact factor: 4.200

8.  Discovery of novel antimalarial compounds enabled by QSAR-based virtual screening.

Authors:  Liying Zhang; Denis Fourches; Alexander Sedykh; Hao Zhu; Alexander Golbraikh; Sean Ekins; Julie Clark; Michele C Connelly; Martina Sigal; Dena Hodges; Armand Guiguemde; R Kiplin Guy; Alexander Tropsha
Journal:  J Chem Inf Model       Date:  2013-01-23       Impact factor: 4.956

9.  Development of improved models for phosphodiesterase-4 inhibitors with a multi-conformational structure-based QSAR method.

Authors:  Adetokunbo Adekoya; Xialan Dong; Jerry Ebalunode; Weifan Zheng
Journal:  Curr Chem Genomics       Date:  2009-12-31

10.  Discovery of geranylgeranyltransferase-I inhibitors with novel scaffolds by the means of quantitative structure-activity relationship modeling, virtual screening, and experimental validation.

Authors:  Yuri K Peterson; Xiang S Wang; Patrick J Casey; Alexander Tropsha
Journal:  J Med Chem       Date:  2009-07-23       Impact factor: 7.446

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.