Literature DB >> 32380551

Reflection on modern methods: generalized linear models for prognosis and intervention-theory, practice and implications for machine learning.

Kellyn F Arnold1,2, Vinny Davies3, Marc de Kamps1,4, Peter W G Tennant1,2,5, John Mbotwa1,2, Mark S Gilthorpe1,2,5.   

Abstract

Prediction and causal explanation are fundamentally distinct tasks of data analysis. In health applications, this difference can be understood in terms of the difference between prognosis (prediction) and prevention/treatment (causal explanation). Nevertheless, these two concepts are often conflated in practice. We use the framework of generalized linear models (GLMs) to illustrate that predictive and causal queries require distinct processes for their application and subsequent interpretation of results. In particular, we identify five primary ways in which GLMs for prediction differ from GLMs for causal inference: (i) the covariates that should be considered for inclusion in (and possibly exclusion from) the model; (ii) how a suitable set of covariates to include in the model is determined; (iii) which covariates are ultimately selected and what functional form (i.e. parameterization) they take; (iv) how the model is evaluated; and (v) how the model is interpreted. We outline some of the potential consequences of failing to acknowledge and respect these differences, and additionally consider the implications for machine learning (ML) methods. We then conclude with three recommendations that we hope will help ensure that both prediction and causal modelling are used appropriately and to greatest effect in health research.
© The Author(s) 2020. Published by Oxford University Press on behalf of the International Epidemiological Association.

Entities:  

Keywords:  Prediction; artificial intelligence; causal inference; directed acyclic graphs; generalized linear models; machine learning

Year:  2021        PMID: 32380551      PMCID: PMC7825942          DOI: 10.1093/ije/dyaa049

Source DB:  PubMed          Journal:  Int J Epidemiol        ISSN: 0300-5771            Impact factor:   7.196


  29 in total

Review 1.  Machine learning for medical diagnosis: history, state of the art and perspective.

Authors:  I Kononenko
Journal:  Artif Intell Med       Date:  2001-08       Impact factor: 5.326

2.  Robust causal inference using directed acyclic graphs: the R package 'dagitty'.

Authors:  Johannes Textor; Benito van der Zander; Mark S Gilthorpe; Maciej Liskiewicz; George Th Ellison
Journal:  Int J Epidemiol       Date:  2016-12-01       Impact factor: 7.196

3.  The inevitable application of big data to health care.

Authors:  Travis B Murdoch; Allan S Detsky
Journal:  JAMA       Date:  2013-04-03       Impact factor: 56.272

4.  The C-Word: Scientific Euphemisms Do Not Improve Causal Inference From Observational Data.

Authors:  Miguel A Hernán
Journal:  Am J Public Health       Date:  2018-03-22       Impact factor: 9.308

5.  Risk factors for venous thromboembolic events in cancer patients.

Authors:  K Kröger; D Weiland; C Ose; N Neumann; S Weiss; C Hirsch; K Urbanski; S Seeber; M E Scheulen
Journal:  Ann Oncol       Date:  2005-11-09       Impact factor: 32.976

Review 6.  Chemotherapy-induced thrombosis.

Authors:  Tufia C Haddad; Edward W Greeno
Journal:  Thromb Res       Date:  2006-01-04       Impact factor: 3.944

7.  The coming of age of artificial intelligence in medicine.

Authors:  Vimla L Patel; Edward H Shortliffe; Mario Stefanelli; Peter Szolovits; Michael R Berthold; Riccardo Bellazzi; Ameen Abu-Hanna
Journal:  Artif Intell Med       Date:  2008-09-13       Impact factor: 5.326

8.  Association between active commuting and incident cardiovascular disease, cancer, and mortality: prospective cohort study.

Authors:  Carlos A Celis-Morales; Donald M Lyall; Paul Welsh; Jana Anderson; Lewis Steell; Yibing Guo; Reno Maldonado; Daniel F Mackay; Jill P Pell; Naveed Sattar; Jason M R Gill
Journal:  BMJ       Date:  2017-04-19

9.  Principles of confounder selection.

Authors:  Tyler J VanderWeele
Journal:  Eur J Epidemiol       Date:  2019-03-06       Impact factor: 8.082

10.  Deep-learning-assisted diagnosis for knee magnetic resonance imaging: Development and retrospective validation of MRNet.

Authors:  Nicholas Bien; Pranav Rajpurkar; Robyn L Ball; Jeremy Irvin; Allison Park; Erik Jones; Michael Bereket; Bhavik N Patel; Kristen W Yeom; Katie Shpanskaya; Safwan Halabi; Evan Zucker; Gary Fanton; Derek F Amanatullah; Christopher F Beaulieu; Geoffrey M Riley; Russell J Stewart; Francis G Blankenberg; David B Larson; Ricky H Jones; Curtis P Langlotz; Andrew Y Ng; Matthew P Lungren
Journal:  PLoS Med       Date:  2018-11-27       Impact factor: 11.069

View more
  7 in total

1.  Is the Way Forward to Step Back? Documenting the Frequency With Which Study Goals Are Misaligned With Study Methods and Interpretations in the Epidemiologic Literature.

Authors:  Katrina L Kezios
Journal:  Epidemiol Rev       Date:  2022-01-14       Impact factor: 4.280

2.  Latent class regression improves the predictive acuity and clinical utility of survival prognostication amongst chronic heart failure patients.

Authors:  John L Mbotwa; Marc de Kamps; Paul D Baxter; George T H Ellison; Mark S Gilthorpe
Journal:  PLoS One       Date:  2021-05-07       Impact factor: 3.240

3.  Use of directed acyclic graphs (DAGs) to identify confounders in applied health research: review and recommendations.

Authors:  Peter W G Tennant; Eleanor J Murray; Kellyn F Arnold; Laurie Berrie; Matthew P Fox; Sarah C Gadd; Wendy J Harrison; Claire Keeble; Lynsie R Ranker; Johannes Textor; Georgia D Tomova; Mark S Gilthorpe; George T H Ellison
Journal:  Int J Epidemiol       Date:  2021-05-17       Impact factor: 7.196

Review 4.  A scoping review of causal methods enabling predictions under hypothetical interventions.

Authors:  Lijing Lin; Matthew Sperrin; David A Jenkins; Glen P Martin; Niels Peek
Journal:  Diagn Progn Res       Date:  2021-02-04

5.  Interplay between polygenic propensity for ageing-related traits and the consumption of fruits and vegetables on future dementia diagnosis.

Authors:  Emma Ruby Francis; Dorina Cadar; Andrew Steptoe; Olesya Ajnakina
Journal:  BMC Psychiatry       Date:  2022-01-30       Impact factor: 3.630

6.  High polygenic predisposition for ADHD and a greater risk of all-cause mortality: a large population-based longitudinal study.

Authors:  Olesya Ajnakina; Diana Shamsutdinova; Theresa Wimberley; Søren Dalsgaard; Andrew Steptoe
Journal:  BMC Med       Date:  2022-02-23       Impact factor: 8.775

7.  Time to reality check the promises of machine learning-powered precision medicine.

Authors:  Jack Wilkinson; Kellyn F Arnold; Eleanor J Murray; Maarten van Smeden; Kareem Carr; Rachel Sippy; Marc de Kamps; Andrew Beam; Stefan Konigorski; Christoph Lippert; Mark S Gilthorpe; Peter W G Tennant
Journal:  Lancet Digit Health       Date:  2020-09-16
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.