Format

Send to

Choose Destination
See comment in PubMed Commons below
J Biomed Inform. 2011 Oct;44(5):728-37. doi: 10.1016/j.jbi.2011.03.011. Epub 2011 Apr 1.

Document-level classification of CT pulmonary angiography reports based on an extension of the ConText algorithm.

Author information

1
Division of Biomedical Informatics, Department of Medicine, University of California, San Diego, La Jolla, CA 92093-0728, USA. brchapman@ucsd.edu

Abstract

In this paper we describe an application called peFinder for document-level classification of CT pulmonary angiography reports. peFinder is based on a generalized version of the ConText algorithm, a simple text processing algorithm for identifying features in clinical report documents. peFinder was used to answer questions about the disease state (pulmonary emboli present or absent), the certainty state of the diagnosis (uncertainty present or absent), the temporal state of an identified pulmonary embolus (acute or chronic), and the technical quality state of the exam (diagnostic or not diagnostic). Gold standard answers for each question were determined from the consensus classifications of three human annotators. peFinder results were compared to naive Bayes' classifiers using unigrams and bigrams. The sensitivities (and positive predictive values) for peFinder were 0.98(0.83), 0.86(0.96), 0.94(0.93), and 0.60(0.90) for disease state, quality state, certainty state, and temporal state respectively, compared to 0.68(0.77), 0.67(0.87), 0.62(0.82), and 0.04(0.25) for the naive Bayes' classifier using unigrams, and 0.75(0.79), 0.52(0.69), 0.59(0.84), and 0.04(0.25) for the naive Bayes' classifier using bigrams.

PMID:
21459155
PMCID:
PMC3164892
DOI:
10.1016/j.jbi.2011.03.011
[Indexed for MEDLINE]
Free PMC Article
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for Elsevier Science Icon for PubMed Central
    Loading ...
    Support Center