Literature DB >> 32784265

A machine learning-based approach for estimating and testing associations with multivariate outcomes.

David Benkeser1, Andrew Mertens2, John M Colford2, Alan Hubbard3, Benjamin F Arnold4, Aryeh Stein5, Mark J van der Laan3.   

Abstract

We propose a method for summarizing the strength of association between a set of variables and a multivariate outcome. Classical summary measures are appropriate when linear relationships exist between covariates and outcomes, while our approach provides an alternative that is useful in situations where complex relationships may be present. We utilize machine learning to detect nonlinear relationships and covariate interactions and propose a measure of association that captures these relationships. A hypothesis test about the proposed associative measure can be used to test the strong null hypothesis of no association between a set of variables and a multivariate outcome. Simulations demonstrate that this hypothesis test has greater power than existing methods against alternatives where covariates have nonlinear relationships with outcomes. We additionally propose measures of variable importance for groups of variables, which summarize each groups' association with the outcome. We demonstrate our methodology using data from a birth cohort study on childhood health and nutrition in the Philippines.
© 2020 David Benkeser, et al., published by De Gruyter, Berlin/Boston.

Entities:  

Keywords:  canonical correlation; epidemiology; machine learning; multivariate outcomes; variable importance

Mesh:

Year:  2020        PMID: 32784265     DOI: 10.1515/ijb-2019-0061

Source DB:  PubMed          Journal:  Int J Biostat        ISSN: 1557-4679            Impact factor:   0.968


  3 in total

1.  Testing a global null hypothesis using ensemble machine learning methods.

Authors:  Sunwoo Han; Youyi Fong; Ying Huang
Journal:  Stat Med       Date:  2022-03-07       Impact factor: 2.497

2.  Estimation and Optimization of Composite Outcomes.

Authors:  Daniel J Luckett; Eric B Laber; Siyeon Kim; Michael R Kosorok
Journal:  J Mach Learn Res       Date:  2021-01       Impact factor: 5.177

3.  Predicting future community-level ocular Chlamydia trachomatis infection prevalence using serological, clinical, molecular, and geospatial data.

Authors:  Christine Tedijanto; Solomon Aragie; Zerihun Tadesse; Mahteme Haile; Taye Zeru; Scott D Nash; Dionna M Wittberg; Sarah Gwyn; Diana L Martin; Hugh J W Sturrock; Thomas M Lietman; Jeremy D Keenan; Benjamin F Arnold
Journal:  PLoS Negl Trop Dis       Date:  2022-03-11
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.