Skip to main page content Skip to main page content

Principal Investigator

  • Deputy Director for Literature Search, National Center for Biotechnology Information (NCBI)
    Senior Investigator, NLM/NIH


Join us!

We're always interested in possible postdocs & students. If interested, please email me for more information. See our new FY2019 opening here!

In 2020, we're organizing the NLM Curation at Scale workshop (postponed) and text mining SIG at ISMB 2020. Come and join us for these two scientific events!

Recent tools

1. BioC files available for ~3 million full text articles in the PMC Text Mining (TM) Subset. Read our recent publication in Bioinformatics.
2. The source code for DeepSeeNet, a novel deep learning framework for automated AMD diagnosis is now publicly available. Its details are recently published in Ophthalmology.

Recent web-based systems

1. TeamTat: a collaborative text annotation tool. NAR 2020.
2. PubTator Central: Automated concept annotation for full-text articles. NAR 2019.
3. LitSense: Making sense of biomedical literature at sentence level. NAR 2019.

ChestX-ray8 Data Release

ChestX-ray8, one of the largest chest x-ray datasets, is now publicly available. Check out our CVPR paper and the recent NIH press release. This work received 2017 NIH Clinical Center Director's Award.