Display Settings:

Format

Send to:

Choose Destination
    AMIA Annu Symp Proc. 2008 Nov 6:450-4.

    Use of semantic features to classify patient smoking status.

    Source

    College of Physicians & Surgeons, Columbia University, New York, NY, USA.

    Abstract

    The recent i2b2 NLP Challenge smoking classification task offers a rare chance to compare different natural language processing techniques on actual clinical data. We compare the performance of a classifier which relies on semantic features generated by an unmodified version of MedLEE, a clinical NLP engine, to one using lexical features. We also compare the performance of supervised classifiers to rule-based symbolic classifiers. Our baseline supervised classifier with lexical features yields a microaveraged F-measure of 0.81. Our rule-based classifier using MedLEE semantic features is superior, with an F-measure of 0.83. Our supervised classifier trained with semantic MedLEE features is competitive with the top-performing smoking classifier in the i2b2 NLP Challenge, with microaveraged precision of 0.90, recall of 0.89, and F-measure of 0.89.

    PMID:
    18998969
    [PubMed - indexed for MEDLINE]
    PMCID: PMC2655942
    Free PMC Article

    Images from this publication.See all images (2) Free text

    Figure 2
    Figure 1

      Supplemental Content

      Click here to read

      Recent activity

      Your browsing activity is empty.

      Activity recording is turned off.

      Turn recording back on

      See more...
      Write to the Help Desk