Literature DB >> 33220494

An empirical characterization of fair machine learning for clinical risk prediction.

Stephen R Pfohl1, Agata Foryciarz2, Nigam H Shah3.   

Abstract

The use of machine learning to guide clinical decision making has the potential to worsen existing health disparities. Several recent works frame the problem as that of algorithmic fairness, a framework that has attracted considerable attention and criticism. However, the appropriateness of this framework is unclear due to both ethical as well as technical considerations, the latter of which include trade-offs between measures of fairness and model performance that are not well-understood for predictive models of clinical outcomes. To inform the ongoing debate, we conduct an empirical study to characterize the impact of penalizing group fairness violations on an array of measures of model performance and group fairness. We repeat the analysis across multiple observational healthcare databases, clinical outcomes, and sensitive attributes. We find that procedures that penalize differences between the distributions of predictions across groups induce nearly-universal degradation of multiple performance metrics within groups. On examining the secondary impact of these procedures, we observe heterogeneity of the effect of these procedures on measures of fairness in calibration and ranking across experimental conditions. Beyond the reported trade-offs, we emphasize that analyses of algorithmic fairness in healthcare lack the contextual grounding and causal awareness necessary to reason about the mechanisms that lead to health disparities, as well as about the potential of algorithmic fairness methods to counteract those mechanisms. In light of these limitations, we encourage researchers building predictive models for clinical use to step outside the algorithmic fairness frame and engage critically with the broader sociotechnical context surrounding the use of machine learning in healthcare.
Copyright © 2020 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Algorithmic fairness; Bias; Clinical risk prediction

Mesh:

Year:  2020        PMID: 33220494      PMCID: PMC7871979          DOI: 10.1016/j.jbi.2020.103621

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  36 in total

1.  Ensuring Fairness in Machine Learning to Advance Health Equity.

Authors:  Alvin Rajkomar; Michaela Hardt; Michael D Howell; Greg Corrado; Marshall H Chin
Journal:  Ann Intern Med       Date:  2018-12-04       Impact factor: 25.391

2.  Medicine. Race and reification in science.

Authors:  Troy Duster
Journal:  Science       Date:  2005-02-18       Impact factor: 47.728

3.  Making Machine Learning Models Clinically Useful.

Authors:  Nigam H Shah; Arnold Milstein; Steven C Bagley PhD
Journal:  JAMA       Date:  2019-10-08       Impact factor: 56.272

4.  Don't ask if artificial intelligence is good or fair, ask how it shifts power.

Authors:  Pratyusha Kalluri
Journal:  Nature       Date:  2020-07       Impact factor: 49.962

5.  Good intentions are not enough: how informatics interventions can worsen inequality.

Authors:  Tiffany C Veinot; Hannah Mitchell; Jessica S Ancker
Journal:  J Am Med Inform Assoc       Date:  2018-08-01       Impact factor: 4.497

6.  A fairer way forward for AI in health care.

Authors:  Linda Nordling
Journal:  Nature       Date:  2019-09       Impact factor: 49.962

7.  Treating health disparities with artificial intelligence.

Authors:  Irene Y Chen; Shalmali Joshi; Marzyeh Ghassemi
Journal:  Nat Med       Date:  2020-01       Impact factor: 53.440

Review 8.  Sex and gender differences and biases in artificial intelligence for biomedicine and healthcare.

Authors:  Davide Cirillo; Silvina Catuara-Solarz; Czuee Morey; Emre Guney; Laia Subirats; Simona Mellino; Annalisa Gigante; Alfonso Valencia; María José Rementeria; Antonella Santuccione Chadha; Nikolaos Mavridis
Journal:  NPJ Digit Med       Date:  2020-06-01

9.  MIMIC-III, a freely accessible critical care database.

Authors:  Alistair E W Johnson; Tom J Pollard; Lu Shen; Li-Wei H Lehman; Mengling Feng; Mohammad Ghassemi; Benjamin Moody; Peter Szolovits; Leo Anthony Celi; Roger G Mark
Journal:  Sci Data       Date:  2016-05-24       Impact factor: 6.444

10.  Design and implementation of a standardized framework to generate and evaluate patient-level prediction models using observational healthcare data.

Authors:  Jenna M Reps; Martijn J Schuemie; Marc A Suchard; Patrick B Ryan; Peter R Rijnbeek
Journal:  J Am Med Inform Assoc       Date:  2018-08-01       Impact factor: 4.497

View more
  12 in total

Review 1.  A scoping review of ethics considerations in clinical natural language processing.

Authors:  Oliver J Bear Don't Walk; Harry Reyes Nieva; Sandra Soo-Jin Lee; Noémie Elhadad
Journal:  JAMIA Open       Date:  2022-05-26

Review 2.  Trustworthy Augmented Intelligence in Health Care.

Authors:  Elliott Crigger; Karen Reinbold; Chelsea Hanson; Audiey Kao; Kathleen Blake; Mira Irons
Journal:  J Med Syst       Date:  2022-01-12       Impact factor: 4.460

3.  Use of a community advisory board to build equitable algorithms for participation in clinical trials: a protocol paper for HoPeNET.

Authors:  Nicole Farmer; Foster Osei Baah; Faustine Williams; Erika Ortiz-Chapparo; Valerie M Mitchell; Latifa Jackson; Billy Collins; Lennox Graham; Gwenyth R Wallen; Tiffany M Powell-Wiley; Allan Johnson
Journal:  BMJ Health Care Inform       Date:  2022-02

4.  A comparison of approaches to improve worst-case predictive model performance over patient subpopulations.

Authors:  Stephen R Pfohl; Haoran Zhang; Yizhe Xu; Agata Foryciarz; Marzyeh Ghassemi; Nigam H Shah
Journal:  Sci Rep       Date:  2022-02-28       Impact factor: 4.379

5.  Evaluation of domain generalization and adaptation on improving model robustness to temporal dataset shift in clinical medicine.

Authors:  Lin Lawrence Guo; Stephen R Pfohl; Jason Fries; Alistair E W Johnson; Jose Posada; Catherine Aftandilian; Nigam Shah; Lillian Sung
Journal:  Sci Rep       Date:  2022-02-17       Impact factor: 4.379

6.  Evaluating algorithmic fairness in the presence of clinical guidelines: the case of atherosclerotic cardiovascular disease risk estimation.

Authors:  Agata Foryciarz; Stephen R Pfohl; Birju Patel; Nigam Shah
Journal:  BMJ Health Care Inform       Date:  2022-04

7.  Cross-ethnicity/race generalization failure of behavioral prediction from resting-state functional connectivity.

Authors:  Jingwei Li; Danilo Bzdok; Jianzhong Chen; Angela Tam; Leon Qi Rong Ooi; Avram J Holmes; Tian Ge; Kaustubh R Patil; Mbemba Jabbi; Simon B Eickhoff; B T Thomas Yeo; Sarah Genon
Journal:  Sci Adv       Date:  2022-03-16       Impact factor: 14.957

8.  Addressing fairness in artificial intelligence for medical imaging.

Authors:  Rodrigo Echeveste; Enzo Ferrante; María Agustina Ricci Lara
Journal:  Nat Commun       Date:  2022-08-06       Impact factor: 17.694

9.  Performance and usability testing of an automated tool for detection of peripheral artery disease using electronic health records.

Authors:  I Ghanzouri; S Amal; V Ho; L Safarnejad; J Cabot; C G Brown-Johnson; N Leeper; S Asch; N H Shah; E G Ross
Journal:  Sci Rep       Date:  2022-08-03       Impact factor: 4.996

10.  Ethical Applications of Artificial Intelligence: Evidence From Health Research on Veterans.

Authors:  Christos Makridis; Seth Hurley; Gil Alterovitz; Mary Klote
Journal:  JMIR Med Inform       Date:  2021-06-02
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.