Literature DB >> 27454257

Statistical learning theory for high dimensional prediction: Application to criterion-keyed scale development.

Benjamin P Chapman1, Alexander Weiss2, Paul R Duberstein1.   

Abstract

Statistical learning theory (SLT) is the statistical formulation of machine learning theory, a body of analytic methods common in "big data" problems. Regression-based SLT algorithms seek to maximize predictive accuracy for some outcome, given a large pool of potential predictors, without overfitting the sample. Research goals in psychology may sometimes call for high dimensional regression. One example is criterion-keyed scale construction, where a scale with maximal predictive validity must be built from a large item pool. Using this as a working example, we first introduce a core principle of SLT methods: minimization of expected prediction error (EPE). Minimizing EPE is fundamentally different than maximizing the within-sample likelihood, and hinges on building a predictive model of sufficient complexity to predict the outcome well, without undue complexity leading to overfitting. We describe how such models are built and refined via cross-validation. We then illustrate how 3 common SLT algorithms-supervised principal components, regularization, and boosting-can be used to construct a criterion-keyed scale predicting all-cause mortality, using a large personality item pool within a population cohort. Each algorithm illustrates a different approach to minimizing EPE. Finally, we consider broader applications of SLT predictive algorithms, both as supportive analytic tools for conventional methods, and as primary analytic tools in discovery phase research. We conclude that despite their differences from the classic null-hypothesis testing approach-or perhaps because of them-SLT methods may hold value as a statistically rigorous approach to exploratory regression. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

Entities:  

Mesh:

Year:  2016        PMID: 27454257      PMCID: PMC5138114          DOI: 10.1037/met0000088

Source DB:  PubMed          Journal:  Psychol Methods        ISSN: 1082-989X


  27 in total

1.  Campbell and Rubin: A primer and comparison of their approaches to causal inference in field settings.

Authors:  William R Shadish
Journal:  Psychol Methods       Date:  2010-03

Review 2.  Simultaneous and selective inference: Current successes and future challenges.

Authors:  Yoav Benjamini
Journal:  Biom J       Date:  2010-11-19       Impact factor: 2.207

Review 3.  Model selection and psychological theory: a discussion of the differences between the Akaike information criterion (AIC) and the Bayesian information criterion (BIC).

Authors:  Scott I Vrieze
Journal:  Psychol Methods       Date:  2012-02-06

4.  When effect sizes disagree: the case of r and d.

Authors:  Robert E McGrath; Gregory J Meyer
Journal:  Psychol Methods       Date:  2006-12

5.  An abductive theory of scientific method.

Authors:  Brian D Haig
Journal:  Psychol Methods       Date:  2005-12

Review 6.  Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors.

Authors:  F E Harrell; K L Lee; D B Mark
Journal:  Stat Med       Date:  1996-02-28       Impact factor: 2.373

Review 7.  Combating unmeasured confounding in cross-sectional studies: evaluating instrumental-variable and Heckman selection models.

Authors:  Alfred DeMaris
Journal:  Psychol Methods       Date:  2014-08-11

8.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

9.  Integrative data analysis: the simultaneous analysis of multiple data sets.

Authors:  Patrick J Curran; Andrea M Hussong
Journal:  Psychol Methods       Date:  2009-06

10.  A questionnaire-wide association study of personality and mortality: the Vietnam Experience Study.

Authors:  Alexander Weiss; Catharine R Gale; G David Batty; Ian J Deary
Journal:  J Psychosom Res       Date:  2013-04-08       Impact factor: 3.006

View more
  5 in total

1.  Psychometric evaluation of a patient-reported item bank for healthcare engagement.

Authors:  Benjamin D Schalet; Steven P Reise; Donna M Zulman; Eleanor T Lewis; Rachel Kimerling
Journal:  Qual Life Res       Date:  2021-04-09       Impact factor: 4.147

2.  Natural Language Processing and Psychosis: On the Need for Comprehensive Psychometric Evaluation.

Authors:  Alex S Cohen; Zachary Rodriguez; Kiara K Warren; Tovah Cowan; Michael D Masucci; Ole Edvard Granrud; Terje B Holmlund; Chelsea Chandler; Peter W Foltz; Gregory P Strauss
Journal:  Schizophr Bull       Date:  2022-09-01       Impact factor: 7.348

3.  Health risk prediction models incorporating personality data: Motivation, challenges, and illustration.

Authors:  Benjamin P Chapman; Feng Lin; Shumita Roy; Ralph H B Benedict; Jeffrey M Lyness
Journal:  Personal Disord       Date:  2019-01

4.  Genome-wide association and genomic prediction identifies soybean cyst nematode resistance in common bean including a syntenic region to soybean Rhg1 locus.

Authors:  Liwei Wen; Hao-Xun Chang; Patrick J Brown; Leslie L Domier; Glen L Hartman
Journal:  Hortic Res       Date:  2019-01-01       Impact factor: 6.793

5.  Adaptive interventions for optimizing malaria control: an implementation study protocol for a block-cluster randomized, sequential multiple assignment trial.

Authors:  Guofa Zhou; Ming-Chieh Lee; Harrysone E Atieli; John I Githure; Andrew K Githeko; James W Kazura; Guiyun Yan
Journal:  Trials       Date:  2020-07-20       Impact factor: 2.279

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.