| Literature DB >> 30906917 |
Quentin F Gronau1, Eric-Jan Wagenmakers1.
Abstract
Cross-validation (CV) is increasingly popular as a generic method to adjudicate between mathematical models of cognition and behavior. In order to measure model generalizability, CV quantifies out-of-sample predictive performance, and the CV preference goes to the model that predicted the out-of-sample data best. The advantages of CV include theoretic simplicity and practical feasibility. Despite its prominence, however, the limitations of CV are often underappreciated. Here, we demonstrate the limitations of a particular form of CV-Bayesian leave-one-out cross-validation or LOO-with three concrete examples. In each example, a data set of infinite size is perfectly in line with the predictions of a simple model (i.e., a general law or invariance). Nevertheless, LOO shows bounded and relatively modest support for the simple model. We conclude that CV is not a panacea for model selection.Entities:
Keywords: Bounded support; Consistency; Evidence; Generalizability; Induction; Principle of parsimony
Year: 2018 PMID: 30906917 PMCID: PMC6400414 DOI: 10.1007/s42113-018-0011-7
Source DB: PubMed Journal: Comput Brain Behav ISSN: 2522-0861
Fig. 1Example 1: LOO weights for as a function of the number of confirmatory instances n, evaluated in relation to five different prior specifications for : a ; b ; c ; d ; and e . The dotted horizontal lines indicate the corresponding analytical asymptotic bounds (see text for details). Available at https://tinyurl.com/ya2r4gx8 under CC license https://creativecommons.org/licenses/by/2.0/
Fig. 2Example 2: LOO weights for as a function of the number of observations n, where the number of successes k = n/2, evaluated in relation to five different prior specifications for : a ; b ; c ; d ; and e . The dotted horizontal line indicates the corresponding analytical asymptotic bound. Note that only even sample sizes are displayed (see text for details). Available at https://tinyurl.com/y8azu4hc under CC license https://creativecommons.org/licenses/by/2.0/
Fig. 3Example 3: LOO weights for as a function of sample size n, for data sets with sample mean equal to zero and sample variance equal to one, evaluated in relation to four different prior specifications for : a ; b ; c ; and d . The dotted horizontal line indicates the corresponding analytical asymptotic bound (see text for details). Available at https://tinyurl.com/y7qhtp3o under CC license https://creativecommons.org/licenses/by/2.0/