Format

Send to

Choose Destination
See comment in PubMed Commons below
Cancer Inform. 2007 May 4;3:149-58.

Development of query strategies to identify a histologic lymphoma subtype in a large linked database system.

Author information

1
Emory University School of Medicine, Winship Cancer Institute, Oncology Informatics, Atlanta, GA 30322, USA.

Abstract

BACKGROUND:

Large linked databases (LLDB) represent a novel resource for cancer outcomes research. However, accurate means of identifying a patient population of interest within these LLDBs can be challenging. Our research group developed a fully integrated platform that provides a means of combining independent legacy databases into a single cancer-focused LLDB system. We compared the sensitivity and specificity of several SQL-based query strategies for identifying a histologic lymphoma subtype in this LLDB to determine the most accurate legacy data source for identifying a specific cancer patient population.

METHODS:

Query strategies were developed to identify patients with follicular lymphoma from a LLDB of cancer registry data, electronic medical records (EMR), laboratory, administrative, pharmacy, and other clinical data. Queries were performed using common diagnostic codes (ICD-9), cancer registry histology codes (ICD-O), and text searches of EMRs. We reviewed medical records and pathology reports to confirm each diagnosis and calculated the sensitivity and specificity for each query strategy.

RESULTS:

Together the queries identified 1538 potential cases of follicular lymphoma. Review of pathology and other medical reports confirmed 415 cases of follicular lymphoma, 300 pathology-verified and 115 verified from other medical reports. The query using ICD-O codes was highly specific (96%). Queries using text strings varied in sensitivity (range 7-92%) and specificity (range 86-99%). Queries using ICD-9 codes were both less sensitive (34-44%) and specific (35-87%).

CONCLUSIONS:

Queries of linked-cancer databases that include cancer registry data should utilize ICD-O codes or employ structured free-text searches to identify patient populations with a precise histologic diagnosis.

KEYWORDS:

Large linked database; cancer epidemiology; cancer outcomes research; cancer registry

PMID:
19455241
PMCID:
PMC2675837
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for PubMed Central
    Loading ...
    Support Center