Improving Terminology Mapping in Clinical Text with Context-Sensitive Spelling Correction

Stud Health Technol Inform. 2017:235:241-245.

Abstract

The mapping of unstructured clinical text to an ontology facilitates meaningful secondary use of health records but is non-trivial due to lexical variation and the abundance of misspellings in hurriedly produced notes. Here, we apply several spelling correction methods to Swedish medical text and evaluate their impact on SNOMED CT mapping; first in a controlled evaluation using medical literature text with induced errors, followed by a partial evaluation on clinical notes. It is shown that the best-performing method is context-sensitive, taking into account trigram frequencies and utilizing a corpus-based dictionary.

Keywords: clinical text; spelling correction; terminology mapping.

MeSH terms

  • Algorithms
  • Electronic Health Records / organization & administration*
  • Meaningful Use
  • Natural Language Processing*
  • Quality Assurance, Health Care / methods
  • Sweden
  • Systematized Nomenclature of Medicine*
  • Vocabulary, Controlled