Display Settings:

Format

Send to:

Choose Destination
See comment in PubMed Commons below
Pac Symp Biocomput. 2000:517-28.

EDGAR: extraction of drugs, genes and relations from the biomedical literature.

Author information

  • 1Lister Hill Center, National Library of Medicine, Bethesda, MD 20894, USA. tcr@lhc.nlm.nih.gov

Abstract

EDGAR (Extraction of Drugs, Genes and Relations) is a natural language processing system that extracts information about drugs and genes relevant to cancer from the biomedical literature. This automatically extracted information has remarkable potential to facilitate computational analysis in the molecular biology of cancer, and the technology is straightforwardly generalizable to many areas of biomedicine. This paper reports on the mechanisms for automatically generating such assertions and on a simple application, conceptual clustering of documents. The system uses a stochastic part of speech tagger, generates an underspecified syntactic parse and then uses semantic and pragmatic information to construct its assertions. The system builds on two important existing resources: the MEDLINE database of biomedical citations and abstracts and the Unified Medical Language System, which provides syntactic and semantic information about the terms found in biomedical abstracts.

PMID:
10902199
[PubMed - indexed for MEDLINE]
PMCID:
PMC2709525
Free PMC Article

Images from this publication.See all images (3)Free text

Figure 1
Figure 2
Figure 3
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Icon for PubMed Central
    Loading ...
    Write to the Help Desk