Named Entity Recognition and Relationship Extraction in Biomedicine

Introduction

Mining useful knowledge from the biomedical literature holds potentials for facilitating literature search, biological database curation and many other scientific tasks. To do that, it is a key step to be able to recognize various types of biological entities (e.g. gene and gene products) as they are the research focus in most biomedical studies. Indeed, our previous investigation revealed that most PubMed users search for publications mentioning those biomedical concepts. For instance, approximately 20% of the PubMed queries containing a gene/protein name.

Goals and Objectives

Our overall goal is to develop automated techniques to identify and annotate various biological entities and concepts (e.g. gene names) in the biomedical literature. Furthermore, we aim to develop state-of-the-art computational technologies for automatically extracting biologically meaningful relationships between those pre-identified entities in free text.

Team Members

Research Highlights

Selected Publications