Format

Send to

Choose Destination
See comment in PubMed Commons below
J Am Med Inform Assoc. 2003 Jan-Feb;10(1):21-38.

Integrating query of relational and textual data in clinical databases: a case study.

Author information

1
Center for Medical Informatics, Yale University School of Medicine, New Haven, Connecticut 06520, USA.

Abstract

OBJECTIVES:

The authors designed and implemented a clinical data mart composed of an integrated information retrieval (IR) and relational database management system (RDBMS).

DESIGN:

Using commodity software, which supports interactive, attribute-centric text and relational searches, the mart houses 2.8 million documents that span a five-year period and supports basic IR features such as Boolean searches, stemming, and proximity and fuzzy searching.

MEASUREMENTS:

Results are relevance-ranked using either "total documents per patient" or "report type weighting."

RESULTS:

Non-curated medical text has a significant degree of malformation with respect to spelling and punctuation, which creates difficulties for text indexing and searching. Presently, the IR facilities of RDBMS packages lack the features necessary to handle such malformed text adequately.

CONCLUSION:

A robust IR+RDBMS system can be developed, but it requires integrating RDBMSs with third-party IR software. RDBMS vendors need to make their IR offerings more accessible to non-programmers.

PMID:
12509355
PMCID:
PMC150357
[Indexed for MEDLINE]
Free PMC Article
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for PubMed Central
    Loading ...
    Support Center