A structural characterization of shortcut features for prediction

David Bellamy; Miguel A Hernán; Andrew Beam

doi:10.1007/s10654-022-00892-3

A structural characterization of shortcut features for prediction

Eur J Epidemiol. 2022 Jun;37(6):563-568. doi: 10.1007/s10654-022-00892-3. Epub 2022 Jul 6.

Authors

David Bellamy^{1

2}, Miguel A Hernán^{1

2

3}, Andrew Beam^{4

5

6}

Affiliations

¹ CAUSALab, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
² Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
³ Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
⁴ CAUSALab, Harvard T.H. Chan School of Public Health, Boston, MA, USA. andrew_beam@hms.harvard.edu.
⁵ Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA. andrew_beam@hms.harvard.edu.
⁶ Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA. andrew_beam@hms.harvard.edu.

Abstract

With the rising use of machine learning for healthcare applications, practitioners are increasingly confronted with the limitations of prediction models that are trained in one setting but meant to be deployed in several others. One recently identified limitation is so-called shortcut learning, whereby a model learns to associate features with the prediction target that do not maintain their relationship across settings. Famously, the watermark on chest x-rays has been demonstrated to be an instance of a shortcut feature. In this viewpoint, we attempt to give a structural characterization of shortcut features in terms of causal DAGs. This is the first attempt at defining shortcut features in terms of their causal relationship with a model's prediction target.

Keywords: Causal inference; Machine learning; Prediction models.

MeSH terms

Causality
Humans
Machine Learning*

Abstract

MeSH terms

Grants and funding