Display Settings:

Format

Send to:

Choose Destination
See comment in PubMed Commons below
Bioinformatics. 2012 Aug 15;28(16):2154-61. doi: 10.1093/bioinformatics/bts332. Epub 2012 Jun 17.

BioContext: an integrated text mining system for large-scale extraction and contextualization of biomolecular events.

Author information

  • 1Faculty of Life Sciences, University of Manchester, Manchester M13 9PT, UK. martin.gerner@gmail.com

Abstract

MOTIVATION:

Although the amount of data in biology is rapidly increasing, critical information for understanding biological events like phosphorylation or gene expression remains locked in the biomedical literature. Most current text mining (TM) approaches to extract information about biological events are focused on either limited-scale studies and/or abstracts, with data extracted lacking context and rarely available to support further research.

RESULTS:

Here we present BioContext, an integrated TM system which extracts, extends and integrates results from a number of tools performing entity recognition, biomolecular event extraction and contextualization. Application of our system to 10.9 million MEDLINE abstracts and 234 000 open-access full-text articles from PubMed Central yielded over 36 million mentions representing 11.4 million distinct events. Event participants included over 290 000 distinct genes/proteins that are mentioned more than 80 million times and linked where possible to Entrez Gene identifiers. Over a third of events contain contextual information such as the anatomical location of the event occurrence or whether the event is reported as negated or speculative.

AVAILABILITY:

The BioContext pipeline is available for download (under the BSD license) at http://www.biocontext.org, along with the extracted data which is also available for online browsing.

PMID:
22711795
[PubMed - indexed for MEDLINE]
PMCID:
PMC3413385
Free PMC Article

Images from this publication.See all images (1)Free text

Fig. 1
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for HighWire Icon for PubMed Central
    Loading ...
    Write to the Help Desk