Send to

Choose Destination
Epidemiol Infect. 2011 Sep;139(9):1296-306. doi: 10.1017/S0950268810002529. Epub 2010 Nov 19.

Classification algorithms to improve the accuracy of identifying patients hospitalized with community-acquired pneumonia using administrative data.

Author information

Biostatistics Unit, Group Health Research Institute, Seattle, WA 98101, USA.


In epidemiological studies of community-acquired pneumonia (CAP) that utilize administrative data, cases are typically defined by the presence of a pneumonia hospital discharge diagnosis code. However, not all such hospitalizations represent true CAP cases. We identified 3991 hospitalizations during 1997-2005 in a managed care organization, and validated them as CAP or not by reviewing medical records. To improve the accuracy of CAP identification, classification algorithms that incorporated additional administrative information associated with the hospitalization were developed using the classification and regression tree analysis. We found that a pneumonia code designated as the primary discharge diagnosis and duration of hospital stay improved the classification of CAP hospitalizations. Compared to the commonly used method that is based on the presence of a primary discharge diagnosis code of pneumonia alone, these algorithms had higher sensitivity (81-98%) and positive predictive values (82-84%) with only modest decreases in specificity (48-82%) and negative predictive values (75-90%).

[Indexed for MEDLINE]

Supplemental Content

Full text links

Icon for Cambridge University Press
Loading ...
Support Center