Literature DB >> 27918183

Finding structure in data using multivariate tree boosting.

Patrick J Miller1, Gitta H Lubke1, Daniel B McArtor1, C S Bergeman1.   

Abstract

Technology and collaboration enable dramatic increases in the size of psychological and psychiatric data collections, but finding structure in these large data sets with many collected variables is challenging. Decision tree ensembles such as random forests (Strobl, Malley, & Tutz, 2009) are a useful tool for finding structure, but are difficult to interpret with multiple outcome variables which are often of interest in psychology. To find and interpret structure in data sets with multiple outcomes and many predictors (possibly exceeding the sample size), we introduce a multivariate extension to a decision tree ensemble method called gradient boosted regression trees (Friedman, 2001). Our extension, multivariate tree boosting, is a method for nonparametric regression that is useful for identifying important predictors, detecting predictors with nonlinear effects and interactions without specification of such effects, and for identifying predictors that cause 2 or more outcome variables to covary. We provide the R package "mvtboost" to estimate, tune, and interpret the resulting model, which extends the implementation of univariate boosting in the R package "gbm" (Ridgeway, 2015) to continuous, multivariate outcomes. To illustrate the approach, we analyze predictors of psychological well-being (Ryff & Keyes, 1995). Simulations verify that our approach identifies predictors with nonlinear effects and achieves high prediction accuracy, exceeding or matching the performance of (penalized) multivariate multiple regression and multivariate decision trees over a wide range of conditions. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

Entities:  

Mesh:

Year:  2016        PMID: 27918183      PMCID: PMC5142230          DOI: 10.1037/met0000087

Source DB:  PubMed          Journal:  Psychol Methods        ISSN: 1082-989X


  28 in total

1.  A multivariate test of association.

Authors:  Manuel A R Ferreira; Shaun M Purcell
Journal:  Bioinformatics       Date:  2008-11-19       Impact factor: 6.937

2.  A working guide to boosted regression trees.

Authors:  J Elith; J R Leathwick; T Hastie
Journal:  J Anim Ecol       Date:  2008-04-08       Impact factor: 5.091

3.  Measurement of physical health in a general population survey.

Authors:  N B Belloc; L Breslow; J R Hochstim
Journal:  Am J Epidemiol       Date:  1971-05       Impact factor: 4.897

4.  Factors associated with caregiver stability in permanent placements: a classification tree approach.

Authors:  Laura J Proctor; Katherine Van Dusen Randazzo; Alan J Litrownik; Rae R Newton; Inger P Davis; Miguel Villodas
Journal:  Child Abuse Negl       Date:  2011-06-08

5.  A global measure of perceived stress.

Authors:  S Cohen; T Kamarck; R Mermelstein
Journal:  J Health Soc Behav       Date:  1983-12

6.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

7.  Big data: The future of biocuration.

Authors:  Doug Howe; Maria Costanzo; Petra Fey; Takashi Gojobori; Linda Hannick; Winston Hide; David P Hill; Renate Kania; Mary Schaeffer; Susan St Pierre; Simon Twigger; Owen White; Seung Yon Rhee
Journal:  Nature       Date:  2008-09-04       Impact factor: 49.962

8.  Structural equation model trees.

Authors:  Andreas M Brandmaier; Timo von Oertzen; John J McArdle; Ulman Lindenberger
Journal:  Psychol Methods       Date:  2012-09-17

9.  Measures of perceived social support from friends and from family: three validation studies.

Authors:  M E Procidano; K Heller
Journal:  Am J Community Psychol       Date:  1983-02

10.  Bias in random forest variable importance measures: illustrations, sources and a solution.

Authors:  Carolin Strobl; Anne-Laure Boulesteix; Achim Zeileis; Torsten Hothorn
Journal:  BMC Bioinformatics       Date:  2007-01-25       Impact factor: 3.169

View more
  6 in total

1.  Machine Learning Analysis Reveals Novel Neuroimaging and Clinical Signatures of Frailty in HIV.

Authors:  Robert H Paul; Kyu S Cho; Patrick Luckett; Jeremy F Strain; Andrew C Belden; Jacob D Bolzenius; Jaimie Navid; Paola M Garcia-Egan; Sarah A Cooley; Julie K Wisch; Anna H Boerwinkle; Dimitre Tomov; Abel Obosi; Julie A Mannarino; Beau M Ances
Journal:  J Acquir Immune Defic Syndr       Date:  2020-08-01       Impact factor: 3.731

2.  Cognitive Phenotypes of HIV Defined Using a Novel Data-driven Approach.

Authors:  Robert H Paul; Kyu Cho; Andrew Belden; Adam W Carrico; Eileen Martin; Jacob Bolzenius; Patrick Luckett; Sarah A Cooley; Julie Mannarino; Jodi M Gilman; Mariah Miano; Beau M Ances
Journal:  J Neuroimmune Pharmacol       Date:  2022-01-04       Impact factor: 4.147

3.  Predicting How Well Adolescents Get Along with Peers and Teachers: A Machine Learning Approach.

Authors:  Farhan Ali; Rebecca P Ang
Journal:  J Youth Adolesc       Date:  2022-04-04

4.  Individual Differences in CD4/CD8 T-Cell Ratio Trajectories and Associated Risk Profiles Modeled From Acute HIV Infection.

Authors:  Robert Paul; Kyu Cho; Jacob Bolzenius; Carlo Sacdalan; Lishomwa C Ndhlovu; Lydie Trautmann; Shelly Krebs; Somporn Tipsuk; Trevor A Crowell; Duanghathai Suttichom; Donn J Colby; Thomas A Premeaux; Nittaya Phanuphak; Phillip Chan; Eugène Kroon; Sandhya Vasan; Denise Hsu; Adam Carrico; Victor Valcour; Jintanat Ananworanich; Merlin L Robb; Julie A Ake; Somchai Sriplienchan; Serena Spudich
Journal:  Psychosom Med       Date:  2022-07-06       Impact factor: 3.864

5.  Knockoff boosted tree for model-free variable selection.

Authors:  Tao Jiang; Yuanyuan Li; Alison A Motsinger-Reif
Journal:  Bioinformatics       Date:  2021-05-17       Impact factor: 6.937

6.  Ensemble machine learning classification of daily living abilities among older people with HIV.

Authors:  Robert Paul; Torie Tsuei; Kyu Cho; Andrew Belden; Benedetta Milanini; Jacob Bolzenius; Shireen Javandel; Joseph McBride; Lucette Cysique; Samantha Lesinski; Victor Valcour
Journal:  EClinicalMedicine       Date:  2021-05-07
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.