Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Interpretability With Accurate Small Models.

Literature DB >> 33733123

Interpretability With Accurate Small Models.

Abstract

Models often need to be constrained to a certain size for them to be considered interpretable. For example, a decision tree of depth 5 is much easier to understand than one of depth 50. Limiting model size, however, often reduces accuracy. We suggest a practical technique that minimizes this trade-off between interpretability and classification accuracy. This enables an arbitrary learning algorithm to produce highly accurate small-sized models. Our technique identifies the training data distribution to learn from that leads to the highest accuracy for a model of a given size. We represent the training distribution as a combination of sampling schemes. Each scheme is defined by a parameterized probability mass function applied to the segmentation produced by a decision tree. An Infinite Mixture Model with Beta components is used to represent a combination of such schemes. The mixture model parameters are learned using Bayesian Optimization. Under simplistic assumptions, we would need to optimize for O(d) variables for a distribution over a d-dimensional input space, which is cumbersome for most real-world data. However, we show that our technique significantly reduces this number to a fixed set of eight variables at the cost of relatively cheap preprocessing. The proposed technique is flexible: it is model-agnostic, i.e., it may be applied to the learning algorithm for any model family, and it admits a general notion of model size. We demonstrate its effectiveness using multiple real-world datasets to construct decision trees, linear probability models and gradient boosted models with different sizes. We observe significant improvements in the F1-score in most instances, exceeding an improvement of 100% in some cases.

Entities: Chemical Disease Gene Species

Keywords: Bayesian optimization; ML; density estimation; infinite mixture models; interpretable machine learning

Year: 2020 PMID： 33733123 PMCID： PMC7861231 DOI： 10.3389/frai.2020.00003

Source DB: PubMed Journal: Front Artif Intell ISSN： 2624-8212

9 in total

1 in total

1. Post-Analysis of Predictive Modeling with an Epidemiological Example.

Authors: Christina Brester; Ari Voutilainen; Tomi-Pekka Tuomainen; Jussi Kauhanen; Mikko Kolehmainen
Journal: Healthcare (Basel) Date: 2021-06-24

1 in total

Interpretability With Accurate Small Models.

Review 1. Completely derandomized self-adaptation in evolution strategies.

2. Rotation forest: A new classifier ensemble method.

3. A comparison of methods for multiclass support vector machines.

4. Optimization by simulated annealing.

5. Learning interactions via hierarchical group-lasso regularization.

6. Searching for exotic particles in high-energy physics with deep learning.

7. Interpretable Decision Sets: A Joint Framework for Description and Prediction.

8. Distributed Newton Methods for Deep Neural Networks.

9. Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change.

1. Post-Analysis of Predictive Modeling with an Epidemiological Example.