Format

Send to

Choose Destination
IEEE Trans Inf Technol Biomed. 2012 May;16(3):413-23. doi: 10.1109/TITB.2012.2185850. Epub 2012 Jan 27.

Anonymization of longitudinal electronic medical records.

Author information

1
Department of Biomedical Informatics, Vanderbilt University, Nashville, TN 37232, USA. acar.tamersoy@vanderbilt.edu

Abstract

Electronic medical record (EMR) systems have enabled healthcare providers to collect detailed patient information from the primary care domain. At the same time, longitudinal data from EMRs are increasingly combined with biorepositories to generate personalized clinical decision support protocols. Emerging policies encourage investigators to disseminate such data in a deidentified form for reuse and collaboration, but organizations are hesitant to do so because they fear such actions will jeopardize patient privacy. In particular, there are concerns that residual demographic and clinical features could be exploited for reidentification purposes. Various approaches have been developed to anonymize clinical data, but they neglect temporal information and are, thus, insufficient for emerging biomedical research paradigms. This paper proposes a novel approach to share patient-specific longitudinal data that offers robust privacy guarantees, while preserving data utility for many biomedical investigations. Our approach aggregates temporal and diagnostic information using heuristics inspired from sequence alignment and clustering methods. We demonstrate that the proposed approach can generate anonymized data that permit effective biomedical analysis using several patient cohorts derived from the EMR system of the Vanderbilt University Medical Center.

PMID:
22287248
PMCID:
PMC3779068
DOI:
10.1109/TITB.2012.2185850
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for IEEE Engineering in Medicine and Biology Society Icon for PubMed Central
Loading ...
Support Center