One model, several results: the paradox of the Hosmer-Lemeshow goodness-of-fit test for the logistic regression model

G Bertolini; R D'Amico; D Nardi; A Tinazzi; G Apolone

One model, several results: the paradox of the Hosmer-Lemeshow goodness-of-fit test for the logistic regression model

J Epidemiol Biostat. 2000;5(4):251-3.

Authors

G Bertolini¹, R D'Amico, D Nardi, A Tinazzi, G Apolone

Affiliation

¹ Clinical Epidemiology Unit, Institute of Research Pharmacology Mario Negri, Ranica, Bergamo, Italy.

PMID: 11055275

Abstract

Background: The Hosmer-Lemeshow test, used extensively to assess the fit of the logistic regression model, is performed by several statistical packages. Recent studies have shown some problems in the use of this test when ties are present. These problems were attributed merely to the test implementation.

Methods: We analysed the order of the observations as an alternative explanation of the problem of ties. Using a data-set of 1393 intensive care unit (ICU) patients we performed the Hosmer-Lemeshow test with all possible subjects dispositions.

Results: We obtained about one million different P values, ranging from 0.01 to 0.95.

Discussion: It is already known that when the Hosmer-Lemeshow goodness-of-fit test is performed with a number of covariate patterns lower than the number of subjects, its result may be inaccurate. We showed that the extent of this problem could be relevant under particular conditions. We also suggest a strategy for estimating the extent of the problem and subsequent interpretation.

MeSH terms

Hospital Mortality
Humans
Intensive Care Units
Italy
Logistic Models*
Predictive Value of Tests
Severity of Illness Index
Statistics as Topic / methods