| Literature DB >> 35792990 |
David Bellamy1,2, Miguel A Hernán1,2,3, Andrew Beam4,5,6.
Abstract
With the rising use of machine learning for healthcare applications, practitioners are increasingly confronted with the limitations of prediction models that are trained in one setting but meant to be deployed in several others. One recently identified limitation is so-called shortcut learning, whereby a model learns to associate features with the prediction target that do not maintain their relationship across settings. Famously, the watermark on chest x-rays has been demonstrated to be an instance of a shortcut feature. In this viewpoint, we attempt to give a structural characterization of shortcut features in terms of causal DAGs. This is the first attempt at defining shortcut features in terms of their causal relationship with a model's prediction target.Entities:
Keywords: Causal inference; Machine learning; Prediction models
Mesh:
Year: 2022 PMID: 35792990 PMCID: PMC9256901 DOI: 10.1007/s10654-022-00892-3
Source DB: PubMed Journal: Eur J Epidemiol ISSN: 0393-2990 Impact factor: 12.434
Fig. 1Causal DAG underlying the prediction of COVID-19 status from chest X-rays across two different contexts. The training context (left) includes 4 hospitals with different COVID-19 prevalence and watermarking patterns. The deployment context (right) includes a fifth hospital with no watermark on the X-ray
Fig. 2Causal DAGs in the training (top row) and deployment (bottom row) context. Panel (A) depicts the same causal structure as Fig. 1, panels (B) and (C) depict alternative causal structures for shortcut features
Fig. 3Causal DAG from Wang et al. [25]. The features X1, …, Xm cause the prediction target Y and share a common cause C