Skip to main page content Skip to main page content

Principal Investigator

  • Senior Investigator, NIH/NLM
    Deputy Director for Literature Search, National Center for Biotechnology Information (NCBI)
    Professor of Computer Science (Adjunct), University of Illinois Urbana-Champaign (UIUC)


Join us!

We're always interested in possible postdocs & students. If interested, please email me for more information.

In 2024, we're organizing the Text Mining COSI at ISMB, NIH DRKB Network Meeting, & Workshop on Generative AI and LLMs at PSB. Come and join us for these scientific events!

Call for JAMIA special issue on ChatGPT and LLMs in Biomedicine and Health details here

Recent NLP/AI tools

1. MedCPT: foundation models for embedding bio-texts. See details at Bioinformatics.
2. PhenoTagger: a hybrid method for phenotyping with HPO. Read our recent publication in Bioinformatics.
3. The source code for DeepSeeNet, a novel deep learning framework for automated AMD diagnosis is now publicly available. Its details are recently published in Ophthalmology.

Web-based Lit Search Tools

1. LitSuggest: a system for literature recommendation and curation. NAR 2021.
2. TeamTat: a collaborative text/corpus annotation tool. NAR 2020.
3. PubTator: Automated concept annotation for full-text articles. NAR 2013, 2019.
4. LitSense: Making sense of biomedical literature at sentence level. NAR 2019.
5. LitVar: a semantic literature search engine for genomic variants. NAR 2018.

ChestX-ray14 Data Release

ChestX-ray14, one of the largest chest x-ray datasets, is now publicly available. Check out our CVPR paper and the recent NIH press release. This work received 2017 NIH Clinical Center Director's Award.