Format

Send to

Choose Destination
J Am Med Inform Assoc. 2019 Dec 1;26(12):1466-1477. doi: 10.1093/jamia/ocz106.

Assessing clinical heterogeneity in sepsis through treatment patterns and machine learning.

Author information

1
Division of Research, Kaiser Permanente, Oakland, California, USA.
2
Department of Epidemiology, University of Washington, Seattle, Washington, USA.
3
Division of Biomedical Informatics Research, Stanford University, Stanford, California, USA.

Abstract

OBJECTIVE:

To use unsupervised topic modeling to evaluate heterogeneity in sepsis treatment patterns contained within granular data of electronic health records.

MATERIALS AND METHODS:

A multicenter, retrospective cohort study of 29 253 hospitalized adult sepsis patients between 2010 and 2013 in Northern California. We applied an unsupervised machine learning method, Latent Dirichlet Allocation, to the orders, medications, and procedures recorded in the electronic health record within the first 24 hours of each patient's hospitalization to uncover empiric treatment topics across the cohort and to develop computable clinical signatures for each patient based on proportions of these topics. We evaluated how these topics correlated with common sepsis treatment and outcome metrics including inpatient mortality, time to first antibiotic, and fluids given within 24 hours.

RESULTS:

Mean age was 70 ± 17 years with hospital mortality of 9.6%. We empirically identified 42 clinically recognizable treatment topics (eg, pneumonia, cellulitis, wound care, shock). Only 43.1% of hospitalizations had a single dominant topic, and a small minority (7.3%) had a single topic comprising at least 80% of their overall clinical signature. Across the entire sepsis cohort, clinical signatures were highly variable.

DISCUSSION:

Heterogeneity in sepsis is a major barrier to improving targeted treatments, yet existing approaches to characterizing clinical heterogeneity are narrowly defined. A machine learning approach captured substantial patient- and population-level heterogeneity in treatment during early sepsis hospitalization.

CONCLUSION:

Using topic modeling based on treatment patterns may enable more precise clinical characterization in sepsis and better understanding of variability in sepsis presentation and outcomes.

KEYWORDS:

infection; latent Dirichlet allocation; machine learning; topic modeling; treatment heterogeneity

PMID:
31314892
DOI:
10.1093/jamia/ocz106

Supplemental Content

Full text links

Icon for Silverchair Information Systems
Loading ...
Support Center