| Literature DB >> 26770056 |
Kazunaga Matsuki1, Victor Kuperman2, Julie A Van Dyke3.
Abstract
Studies investigating individual differences in reading ability often involve data sets containing a large number of collinear predictors and a small number of observations. In this paper, we discuss the method of Random Forests and demonstrate its suitability for addressing the statistical concerns raised by such datasets. The method is contrasted with other methods of estimating relative variable importance, especially Dominance Analysis and Multimodel Inference. All methods were applied to a dataset that gauged eye-movements during reading and offline comprehension in the context of multiple ability measures with high collinearity due to their shared verbal core. We demonstrate that the Random Forests method surpasses other methods in its ability to handle model overfitting, and accounts for a comparable or larger amount of variance in reading measures relative to other methods.Entities:
Keywords: Random Forests; collinearity; eye-movements; individual differences; reading ability; variable importance
Year: 2016 PMID: 26770056 PMCID: PMC4710485 DOI: 10.1080/10888438.2015.1107073
Source DB: PubMed Journal: Sci Stud Read ISSN: 1088-8438