Linguistic approach for identification of medication names and related information in clinical narratives

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):549-54. doi: 10.1136/jamia.2010.004036.

Abstract

Background: Pharmacotherapy is an integral part of any medical care process and plays an important role in the medical history of most patients. Information on medication is crucial for several tasks such as pharmacovigilance, medical decision or biomedical research.

Objectives: Within a narrative text, medication-related information can be buried within other non-relevant data. Specific methods, such as those provided by text mining, must be designed for accessing them, and this is the objective of this study.

Methods: The authors designed a system for analyzing narrative clinical documents to extract from them medication occurrences and medication-related information. The system also attempts to deduce medications not covered by the dictionaries used.

Results: Results provided by the system were evaluated within the framework of the I2B2 NLP challenge held in 2009. The system achieved an F-measure of 0.78 and ranked 7th out of 20 participating teams (the highest F-measure was 0.86). The system provided good results for the annotation and extraction of medication names, their frequency, dosage and mode of administration (F-measure over 0.81), while information on duration and reasons is poorly annotated and extracted (F-measure 0.36 and 0.29, respectively). The performance of the system was stable between the training and test sets.

MeSH terms

  • Drug Therapy
  • Electronic Health Records*
  • Humans
  • Information Storage and Retrieval / methods*
  • Linguistics
  • Natural Language Processing*
  • Pharmaceutical Preparations*
  • Software Design

Substances

  • Pharmaceutical Preparations