Format

Send to

Choose Destination
PeerJ. 2017 Apr 25;5:e3230. doi: 10.7717/peerj.3230. eCollection 2017.

Contribution of temporal data to predictive performance in 30-day readmission of morbidly obese patients.

Author information

1
Faculty of Health Sciences, University of Maribor, Maribor, Slovenia.
2
Faculty of Electrical Engineering and Computer Science, University of Maribor, Maribor, Slovenia.
3
Center for Data Analytics and Biomedical Informatics, Temple University, Philadelphia, PA, United States.

Abstract

BACKGROUND:

Reduction of readmissions after discharge represents an important challenge for many hospitals and has attracted the interest of many researchers in the past few years. Most of the studies in this field focus on building cross-sectional predictive models that aim to predict the occurrence of readmission within 30-days based on information from the current hospitalization. The aim of this study is demonstration of predictive performance gain obtained by inclusion of information from historical hospitalization records among morbidly obese patients.

METHODS:

The California Statewide inpatient database was used to build regularized logistic regression models for prediction of readmission in morbidly obese patients (n = 18,881). Temporal features were extracted from historical patient hospitalization records in a one-year timeframe. Five different datasets of patients were prepared based on the number of available hospitalizations per patient. Sample size of the five datasets ranged from 4,787 patients with more than five hospitalizations to 20,521 patients with at least two hospitalization records in one year. A 10-fold cross validation was repeted 100 times to assess the variability of the results. Additionally, random forest and extreme gradient boosting were used to confirm the results.

RESULTS:

Area under the ROC curve increased significantly when including information from up to three historical records on all datasets. The inclusion of more than three historical records was not efficient. Similar results can be observed for Brier score and PPV value. The number of selected predictors corresponded to the complexity of the dataset ranging from an average of 29.50 selected features on the smallest dataset to 184.96 on the largest dataset based on 100 repetitions of 10-fold cross-validation.

DISCUSSION:

The results show positive influence of adding information from historical hospitalization records on predictive performance using all predictive modeling techniques used in this study. We can conclude that it is advantageous to build separate readmission prediction models in subgroups of patients with more hospital admissions by aggregating information from up to three previous hospitalizations.

KEYWORDS:

Hospital readmission; Morbid obesity; Regularized logistic regression; Temporal data

Supplemental Content

Full text links

Icon for PeerJ, Inc. Icon for PubMed Central
Loading ...
Support Center