| Literature DB >> 12653517 |
Weida Tong1, Huixiao Hong, Hong Fang, Qian Xie, Roger Perkins.
Abstract
The techniques of combining the results of multiple classification models to produce a single prediction have been investigated for many years. In earlier applications, the multiple models to be combined were developed by altering the training set. The use of these so-called resampling techniques, however, poses the risk of reducing predictivity of the individual models to be combined and/or over fitting the noise in the data, which might result in poorer prediction of the composite model than the individual models. In this paper, we suggest a novel approach, named Decision Forest, that combines multiple Decision Tree models. Each Decision Tree model is developed using a unique set of descriptors. When models of similar predictive quality are combined using the Decision Forest method, quality compared to the individual models is consistently and significantly improved in both training and testing steps. An example will be presented for prediction of binding affinity of 232 chemicals to the estrogen receptor.Entities:
Mesh:
Substances:
Year: 2003 PMID: 12653517 DOI: 10.1021/ci020058s
Source DB: PubMed Journal: J Chem Inf Comput Sci ISSN: 0095-2338