Literature DB >> 23039214

Introducing uncertainty in predictive modeling--friend or foe?

Ulf Norinder1, Henrik Boström.   

Abstract

Uncertainty was introduced to chemical descriptors of 16 publicly available data sets to various degrees and in various ways in order to investigate the effect on the predictive performance of the state-of-the-art method decision tree ensembles. A number of strategies to handle uncertainty in decision tree ensembles were evaluated. The main conclusion of the study is that uncertainty to a large extent may be introduced in chemical descriptors without impairing the predictive performance of ensembles and without the predictive performance being significantly reduced from a practical point of view. The investigation further showed that even when distributions of uncertain values were provided, the ensembles method could generate equally effective models from single-point samples from these distributions. Hence, there seems to be no advantage in using more elaborate methods for handling uncertainty in chemical descriptors when using decision tree ensembles as a modeling method for the considered types of introduced uncertainty.

Mesh:

Substances:

Year:  2012        PMID: 23039214     DOI: 10.1021/ci3003446

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  1 in total

1.  Representing descriptors derived from multiple conformations as uncertain features for machine learning.

Authors:  Ulf Norinder; Henrik Boström
Journal:  J Mol Model       Date:  2013-03-12       Impact factor: 1.810

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.