Literature DB >> 19968396

An introduction to recursive partitioning: rationale, application, and characteristics of classification and regression trees, bagging, and random forests.

Carolin Strobl1, James Malley, Gerhard Tutz.   

Abstract

Recursive partitioning methods have become popular and widely used tools for nonparametric regression and classification in many scientific fields. Especially random forests, which can deal with large numbers of predictor variables even in the presence of complex interactions, have been applied successfully in genetics, clinical medicine, and bioinformatics within the past few years. High-dimensional problems are common not only in genetics, but also in some areas of psychological research, where only a few subjects can be measured because of time or cost constraints, yet a large amount of data is generated for each subject. Random forests have been shown to achieve a high prediction accuracy in such applications and to provide descriptive variable importance measures reflecting the impact of each variable in both main effects and interactions. The aim of this work is to introduce the principles of the standard recursive partitioning methods as well as recent methodological improvements, to illustrate their usage for low and high-dimensional data exploration, but also to point out limitations of the methods and potential pitfalls in their practical application. Application of the methods is illustrated with freely available implementations in the R system for statistical computing. (c) 2009 APA, all rights reserved.

Entities:  

Mesh:

Year:  2009        PMID: 19968396      PMCID: PMC2927982          DOI: 10.1037/a0016973

Source DB:  PubMed          Journal:  Psychol Methods        ISSN: 1082-989X


  30 in total

1.  Criticality of predictors in multiple regression.

Authors:  R Azen; D V Budescu; B Reiser
Journal:  Br J Math Stat Psychol       Date:  2001-11       Impact factor: 3.380

2.  The dominance analysis approach for comparing predictors in multiple regression.

Authors:  Razia Azen; David V Budescu
Journal:  Psychol Methods       Date:  2003-06

3.  Relating HIV-1 sequence variation to replication capacity via trees and forests.

Authors:  Mark R Segal; Jason D Barbour; Robert M Grant
Journal:  Stat Appl Genet Mol Biol       Date:  2004-02-12

4.  Evaluation of different biological data and computational classification methods for use in protein interaction prediction.

Authors:  Yanjun Qi; Ziv Bar-Joseph; Judith Klein-Seetharaman
Journal:  Proteins       Date:  2006-05-15

5.  Variables associated with familial suicide attempts in a sample of suicide attempters.

Authors:  Enrique Baca-Garcia; M Mercedes Perez-Rodriguez; Dolores Saiz-Gonzalez; Ignacio Basurte-Villamor; Jeronimo Saiz-Ruiz; José M Leiva-Murillo; Mario de Prado-Cumplido; Ricardo Santiago-Mozos; Antonio Artés-Rodríguez; Jose de Leon
Journal:  Prog Neuropsychopharmacol Biol Psychiatry       Date:  2007-06-07       Impact factor: 5.067

6.  Posttraumatic stress disorder: diagnostic data analysis by data mining methodology.

Authors:  Igor Marinić; Fran Supek; Zrnka Kovacić; Lea Rukavina; Tihana Jendricko; Dragica Kozarić-Kovacić
Journal:  Croat Med J       Date:  2007-04       Impact factor: 1.351

7.  Automated variable selection methods for logistic regression produced unstable models for predicting acute myocardial infarction mortality.

Authors:  Peter C Austin; Jack V Tu
Journal:  J Clin Epidemiol       Date:  2004-11       Impact factor: 6.437

8.  A feature selection method for multilevel mental fatigue EEG classification.

Authors:  Kai-Quan Shen; Chong-Jin Ong; Xiao-Ping Li; Zheng Hui; Einar P V Wilder-Smith
Journal:  IEEE Trans Biomed Eng       Date:  2007-07       Impact factor: 4.538

9.  Bias in random forest variable importance measures: illustrations, sources and a solution.

Authors:  Carolin Strobl; Anne-Laure Boulesteix; Achim Zeileis; Torsten Hothorn
Journal:  BMC Bioinformatics       Date:  2007-01-25       Impact factor: 3.169

10.  Screening large-scale association study data: exploiting interactions using random forests.

Authors:  Kathryn L Lunetta; L Brooke Hayward; Jonathan Segal; Paul Van Eerdewegh
Journal:  BMC Genet       Date:  2004-12-10       Impact factor: 2.797

View more
  410 in total

1.  Empiric neurocognitive performance profile discovery and interpretation in HIV infection.

Authors:  Daniela Gomez; Christopher Power; M John Gill; Noshin Koenig; Roberto Vega; Esther Fujiwara
Journal:  J Neurovirol       Date:  2018-12-05       Impact factor: 2.643

2.  Pathways to early coital debut for adolescent girls: a recursive partitioning analysis.

Authors:  Matthew R Pearson; Tatyana Kholodkov; James M Henson; Emily A Impett
Journal:  J Sex Res       Date:  2011-05-24

3.  Impact of Anesthetic Predictors on Postpartum Hospital Length of Stay and Adverse Events Following Cesarean Delivery: A Retrospective Study in 840 Consecutive Parturients.

Authors:  Ting Ting Oh; Colleen G Martel; Allison G Clark; Melissa B Russo; Bobby D Nossaman
Journal:  Ochsner J       Date:  2015

4.  Distinct preoptic-BST nuclei dissociate paternal and infanticidal behavior in mice.

Authors:  Yousuke Tsuneoka; Kenichi Tokita; Chihiro Yoshihara; Taiju Amano; Gianluca Esposito; Arthur J Huang; Lily M Y Yu; Yuri Odaka; Kazutaka Shinozuka; Thomas J McHugh; Kumi O Kuroda
Journal:  EMBO J       Date:  2015-09-30       Impact factor: 11.598

5.  Factors Associated With Premature Exits From Supported Housing.

Authors:  Sonya Gabrielian; Alaina V Burns; Nupur Nanda; Gerhard Hellemann; Vincent Kane; Alexander S Young
Journal:  Psychiatr Serv       Date:  2015-10-15       Impact factor: 3.084

6.  Bayesian neural adjustment of inhibitory control predicts emergence of problem stimulant use.

Authors:  Katia M Harlé; Jennifer L Stewart; Shunan Zhang; Susan F Tapert; Angela J Yu; Martin P Paulus
Journal:  Brain       Date:  2015-09-03       Impact factor: 13.501

7.  Item-focussed Trees for the Identification of Items in Differential Item Functioning.

Authors:  Gerhard Tutz; Moritz Berger
Journal:  Psychometrika       Date:  2015-11-23       Impact factor: 2.500

8.  Identifying Moderators of Response to the Penn Resiliency Program: A Synthesis Study.

Authors:  Steven M Brunwasser; Jane E Gillham
Journal:  Prev Sci       Date:  2018-02

9.  Serum metabolomic profiles evaluated after surgery may identify patients with oestrogen receptor negative early breast cancer at increased risk of disease recurrence. Results from a retrospective study.

Authors:  Leonardo Tenori; Catherine Oakman; Patrick G Morris; Ewa Gralka; Natalie Turner; Silvia Cappadona; Monica Fornier; Cliff Hudis; Larry Norton; Claudio Luchinat; Angelo Di Leo
Journal:  Mol Oncol       Date:  2014-08-10       Impact factor: 6.603

10.  Using Classification and Regression Trees (CART) and random forests to analyze attrition: Results from two simulations.

Authors:  Timothy Hayes; Satoshi Usami; Ross Jacobucci; John J McArdle
Journal:  Psychol Aging       Date:  2015-09-21
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.