Generalisability through local validation: overcoming barriers due to data disparity in healthcare

William Greig Mitchell; Edward Christopher Dee; Leo Anthony Celi

doi:10.1186/s12886-021-01992-6

Generalisability through local validation: overcoming barriers due to data disparity in healthcare

BMC Ophthalmol. 2021 May 21;21(1):228. doi: 10.1186/s12886-021-01992-6.

Authors

William Greig Mitchell^{1

2}, Edward Christopher Dee³, Leo Anthony Celi^{4

5

6

7}

Affiliations

¹ Department of Ophthalmology, Massachusetts Eye and Ear Infirmary, Boston, MA, USA.
² Harvard TH Chan School of Public Health, Boston, MA, USA.
³ Harvard Medical School, Boston, MA, USA.
⁴ Harvard TH Chan School of Public Health, Boston, MA, USA. lceli@mit.edu.
⁵ Harvard Medical School, Boston, MA, USA. lceli@mit.edu.
⁶ Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA. lceli@mit.edu.
⁷ Department of Pulmonary, Critical Care and Sleep Medicine, Beth Israel Deaconess Medical Centre, Boston, MA, USA. lceli@mit.edu.

Abstract

Cho et al. report deep learning model accuracy for tilted myopic disc detection in a South Korean population. Here we explore the importance of generalisability of machine learning (ML) in healthcare, and we emphasise that recurrent underrepresentation of data-poor regions may inadvertently perpetuate global health inequity.Creating meaningful ML systems is contingent on understanding how, when, and why different ML models work in different settings. While we echo the need for the diversification of ML datasets, such a worthy effort would take time and does not obviate uses of presently available datasets if conclusions are validated and re-calibrated for different groups prior to implementation.The importance of external ML model validation on diverse populations should be highlighted where possible - especially for models built with single-centre data.

Keywords: Disparity; Healthcare equity; Machine learning; Ophthalmology.

Publication types

Letter

MeSH terms

Delivery of Health Care
Humans
Machine Learning*
Myopia*