Literature DB >> 26706666

Propensity score and proximity matching using random forest.

Peng Zhao1, Xiaogang Su2, Tingting Ge3, Juanjuan Fan4.   

Abstract

In order to derive unbiased inference from observational data, matching methods are often applied to produce balanced treatment and control groups in terms of all background variables. Propensity score has been a key component in this research area. However, propensity score based matching methods in the literature have several limitations, such as model mis-specifications, categorical variables with more than two levels, difficulties in handling missing data, and nonlinear relationships. Random forest, averaging outcomes from many decision trees, is nonparametric in nature, straightforward to use, and capable of solving these issues. More importantly, the precision afforded by random forest (Caruana et al., 2008) may provide us with a more accurate and less model dependent estimate of the propensity score. In addition, the proximity matrix, a by-product of the random forest, may naturally serve as a distance measure between observations that can be used in matching. The proposed random forest based matching methods are applied to data from the National Health and Nutrition Examination Survey (NHANES). Our results show that the proposed methods can produce well balanced treatment and control groups. An illustration is also provided that the methods can effectively deal with missing data in covariates.
Copyright © 2015 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Matching; Observational study; Propensity score; Proximity; Random forest

Mesh:

Year:  2015        PMID: 26706666      PMCID: PMC4818178          DOI: 10.1016/j.cct.2015.12.012

Source DB:  PubMed          Journal:  Contemp Clin Trials        ISSN: 1551-7144            Impact factor:   2.226


  11 in total

1.  A comparison of two methods of estimating propensity scores after multiple imputation.

Authors:  Robin Mitra; Jerome P Reiter
Journal:  Stat Methods Med Res       Date:  2012-06-11       Impact factor: 3.021

2.  Matching methods for causal inference: A review and a look forward.

Authors:  Elizabeth A Stuart
Journal:  Stat Sci       Date:  2010-02-01       Impact factor: 2.901

3.  Evaluating uses of data mining techniques in propensity score estimation: a simulation study.

Authors:  Soko Setoguchi; Sebastian Schneeweiss; M Alan Brookhart; Robert J Glynn; E Francis Cook
Journal:  Pharmacoepidemiol Drug Saf       Date:  2008-06       Impact factor: 2.890

4.  Using multiple imputation and propensity scores to test the effect of car seats and seat belt usage on injury severity from trauma registry data.

Authors:  John R Hayes; Jonathan I Groner
Journal:  J Pediatr Surg       Date:  2008-05       Impact factor: 2.545

5.  Comment: Demystifying Double Robustness: A Comparison of Alternative Strategies for Estimating a Population Mean from Incomplete Data.

Authors:  Anastasios A Tsiatis; Marie Davidian
Journal:  Stat Sci       Date:  2007       Impact factor: 2.901

6.  Propensity score methods for bias reduction in the comparison of a treatment to a non-randomized control group.

Authors:  R B D'Agostino
Journal:  Stat Med       Date:  1998-10-15       Impact factor: 2.373

7.  Propensity score estimation: neural networks, support vector machines, decision trees (CART), and meta-classifiers as alternatives to logistic regression.

Authors:  Daniel Westreich; Justin Lessler; Michele Jonsson Funk
Journal:  J Clin Epidemiol       Date:  2010-08       Impact factor: 6.437

8.  Improving propensity score weighting using machine learning.

Authors:  Brian K Lee; Justin Lessler; Elizabeth A Stuart
Journal:  Stat Med       Date:  2010-02-10       Impact factor: 2.373

9.  Effect of smoking on hemoglobin A1c and body mass index in patients with type 2 diabetes mellitus.

Authors:  Patricia McCulloch; Sheila Lee; Russell Higgins; Karissa McCall; David S Schade
Journal:  J Investig Med       Date:  2002-07       Impact factor: 2.895

Review 10.  Consequences of smoking for body weight, body fat distribution, and insulin resistance.

Authors:  Arnaud Chiolero; David Faeh; Fred Paccaud; Jacques Cornuz
Journal:  Am J Clin Nutr       Date:  2008-04       Impact factor: 7.045

View more
  10 in total

1.  Dual emission laser treatment and argon plasma coagulation in small bowel vascular lesion ablation: a pilot study.

Authors:  Gian Eugenio Tontini; Alessandro Rimondi; Lucia Scaramella; Matilde Topa; Roberto Penagini; Maurizio Vecchi; Luca Elli
Journal:  Lasers Med Sci       Date:  2022-07-13       Impact factor: 2.555

2.  A surrogate model for estimating extreme tower loads on wind turbines based on random forest proximities.

Authors:  Mikkel Slot Nielsen; Victor Rohde
Journal:  J Appl Stat       Date:  2020-09-04       Impact factor: 1.416

3.  Association between cardioplegia and postoperative atrial fibrillation in coronary surgery.

Authors:  Michele Di Mauro; Antonio M Calafiore; Antonino Di Franco; Francesco Nicolini; Francesco Formica; Roberto Scrofani; Carlo Antona; Antonio Messina; Giovanni Troise; Giovanni Mariscalco; Cesare Beghi; Michele De Bonis; Cinzia Trumello; Antonio Miceli; Mattia Glauber; Marco Ranucci; Carlo De Vincentiis; Mario Gaudino; Roberto Lorusso
Journal:  Int J Cardiol       Date:  2020-10-04       Impact factor: 4.164

4.  Should the left gastric artery lymph node be considered as the predictive lymph node for extra-gastric lymph node metastases?

Authors:  Weilin Sun; Jingyu Deng; Wenting He; Jinyuan Liu; Shiwei Guo; Pengfei Gu; Zizhen Wu; Han Liang
Journal:  Ann Transl Med       Date:  2020-06

5.  Extremely low viral reservoir in treated chronically HIV-1-infected individuals.

Authors:  Cristina Gálvez; Victor Urrea; Judith Dalmau; Montse Jimenez; Bonaventura Clotet; Valérie Monceaux; Nicolas Huot; Lorna Leal; Victoria González-Soler; Maria González-Cao; Michaela Müller-Trutwin; Asier Sáez-Cirión; Felipe García; Julià Blanco; Javier Martinez-Picado; Maria Salgado
Journal:  EBioMedicine       Date:  2020-06-21       Impact factor: 8.143

6.  Novel ranking of protective and risk factors for adolescent adiposity in US females.

Authors:  A Narla; D H Rehkopf
Journal:  Obes Sci Pract       Date:  2019-01-16

7.  Propensity score adjustment using machine learning classification algorithms to control selection bias in online surveys.

Authors:  Ramón Ferri-García; María Del Mar Rueda
Journal:  PLoS One       Date:  2020-04-22       Impact factor: 3.240

8.  Altered T-cell subset distribution in the viral reservoir in HIV-1-infected individuals with extremely low proviral DNA (LoViReTs).

Authors:  Cristina Gálvez; Víctor Urrea; Maria Del Carmen Garcia-Guerrero; Sílvia Bernal; Susana Benet; Beatriz Mothe; Lucía Bailón; Judith Dalmau; Andrea Martinez; Aroa Nieto; Lorna Leal; Felipe García; Bonaventura Clotet; Javier Martinez-Picado; Maria Salgado
Journal:  J Intern Med       Date:  2022-03-28       Impact factor: 13.068

9.  Prognostic impact of D2-plus lymphadenectomy and optimal extent of lymphadenectomy in advanced gastric antral carcinoma: Propensity score matching analysis.

Authors:  Weilin Sun; Jingyu Deng; Nannan Zhang; Huifang Liu; Jinyuan Liu; Pengfei Gu; Yingxin Du; Zizhen Wu; Wenting He; Pengliang Wang; Han Liang
Journal:  Chin J Cancer Res       Date:  2020-02       Impact factor: 5.087

10.  Application of a Combined Model with Autoregressive Integrated Moving Average (ARIMA) and Generalized Regression Neural Network (GRNN) in Forecasting Hepatitis Incidence in Heng County, China.

Authors:  Wudi Wei; Junjun Jiang; Hao Liang; Lian Gao; Bingyu Liang; Jiegang Huang; Ning Zang; Yanyan Liao; Jun Yu; Jingzhen Lai; Fengxiang Qin; Jinming Su; Li Ye; Hui Chen
Journal:  PLoS One       Date:  2016-06-03       Impact factor: 3.240

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.