Literature DB >> 19807194

A model-based ensembling approach for developing QSARs.

Qianyi Zhang1, Jacqueline M Hughes-Oliver, Raymond T Ng.   

Abstract

Ensemble methods have become popular for QSAR modeling, but most studies have assumed balanced data, consisting of approximately equal numbers of active and inactive compounds. Cheminformatics data are often far from being balanced. We extend the application of ensemble methods to include cases of imbalance of class membership and to more adequately assess model output. Based on the extension, we propose an ensemble method called MBEnsemble that automatically determines the appropriate tuning parameters to provide reliable predictions and maximize the F-measure. Results from multiple data sets demonstrate that the proposed ensemble technique works well on imbalanced data.

Entities:  

Mesh:

Year:  2009        PMID: 19807194      PMCID: PMC2760036          DOI: 10.1021/ci900080f

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  9 in total

1.  Analysis of a large structure/biological activity data set using recursive partitioning.

Authors:  A Rusinko; M W Farmen; C G Lambert; P L Brown; S S Young
Journal:  J Chem Inf Comput Sci       Date:  1999 Nov-Dec

2.  Predicting CNS permeability of drug molecules: comparison of neural network and support vector machine algorithms.

Authors:  Scott Doniger; Thomas Hofmann; Joanne Yeh
Journal:  J Comput Biol       Date:  2002       Impact factor: 1.479

3.  Random forest: a classification and regression tool for compound classification and QSAR modeling.

Authors:  Vladimir Svetnik; Andy Liaw; Christopher Tong; J Christopher Culberson; Robert P Sheridan; Bradley P Feuston
Journal:  J Chem Inf Comput Sci       Date:  2003 Nov-Dec

Review 4.  Neural networks applied to quantitative structure-activity relationship analysis.

Authors:  T Aoyama; Y Suzuki; H Ichikawa
Journal:  J Med Chem       Date:  1990-09       Impact factor: 7.446

5.  A comparison of methods for modeling quantitative structure-activity relationships.

Authors:  Jeffrey J Sutherland; Lee A O'Brien; Donald F Weaver
Journal:  J Med Chem       Date:  2004-10-21       Impact factor: 7.446

6.  PowerMV: a software environment for molecular viewing, descriptor generation, data analysis and hit evaluation.

Authors:  Kejun Liu; Jun Feng; S Stanley Young
Journal:  J Chem Inf Model       Date:  2005 Mar-Apr       Impact factor: 4.956

7.  Contemporary QSAR classifiers compared.

Authors:  Craig L Bruce; James L Melville; Stephen D Pickett; Jonathan D Hirst
Journal:  J Chem Inf Model       Date:  2007 Jan-Feb       Impact factor: 4.956

8.  On the significance of clusters in the graphical display of structure-activity data.

Authors:  J W McFarland; D J Gans
Journal:  J Med Chem       Date:  1986-04       Impact factor: 7.446

9.  QSAR and k-nearest neighbor classification analysis of selective cyclooxygenase-2 inhibitors using topologically-based numerical descriptors.

Authors:  G W Kauffman; P C Jurs
Journal:  J Chem Inf Comput Sci       Date:  2001 Nov-Dec
  9 in total
  4 in total

1.  A novel adaptive ensemble classification framework for ADME prediction.

Authors:  Ming Yang; Jialei Chen; Liwen Xu; Xiufeng Shi; Xin Zhou; Zhijun Xi; Rui An; Xinhong Wang
Journal:  RSC Adv       Date:  2018-03-26       Impact factor: 4.036

2.  A computational pipeline for the development of multi-marker bio-signature panels and ensemble classifiers.

Authors:  Oliver P Günther; Virginia Chen; Gabriela Cohen Freue; Robert F Balshaw; Scott J Tebbutt; Zsuzsanna Hollander; Mandeep Takhar; W Robert McMaster; Bruce M McManus; Paul A Keown; Raymond T Ng
Journal:  BMC Bioinformatics       Date:  2012-12-08       Impact factor: 3.169

3.  Visual analytics in cheminformatics: user-supervised descriptor selection for QSAR methods.

Authors:  María Jimena Martínez; Ignacio Ponzoni; Mónica F Díaz; Gustavo E Vazquez; Axel J Soto
Journal:  J Cheminform       Date:  2015-08-19       Impact factor: 5.514

4.  A structure-based model for predicting serum albumin binding.

Authors:  Katrina W Lexa; Elena Dolghih; Matthew P Jacobson
Journal:  PLoS One       Date:  2014-04-01       Impact factor: 3.240

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.