Skip to main page content Skip to main page content

Principal Investigator

Zhiyong Lu, PhD FACMI


Zhiyong Lu, PhD FACMI

Short Biography

My NIH Biosketch

Dr. Lu is Deputy Director for Literature Search at the National Center for Biotechnology (NCBI), leading its overall efforts of improving literature search and information access in NCBI’s production resources. He is also an NIH Senior Investigator (early tenure) and directs the Text Mining / Natural Language Processing (NLP) Research program at NCBI/NLM where they are developing computational methods and software tools for analyzing and making sense of unstructured text data in biomedical literature and clinical notes towards accelerated discovery and better health. Before that, Dr. Lu was NIH's first Earl Stadtman Investigator in Computational Biology and Bioinformatics. His many recent publications and invited talks generally have a focus on the following topics:

  • PubMed Search (e.g. biomedical literature retrieval; author name disambiguation)
  • BioNLP & Text Mining (e.g. named entity recognition and information extraction)
  • eCuration (e.g. computer-assisted biomedical data curation at scale)
  • Machine Learning for Healthcare (e.g. deep learning, EMR mining & medical image analysis)

Dr. Lu is a Fellow of the American College of Medical Informatics (ACMI), an Associate Editor for Bioinformatics (OUP), Artificial Intelligence in Medicine (Elsevier), BMC Bioinformatics (Springer), Journal of Healthcare Informatics Research (Springer), and serves on the Editorial Board for the journal Database (OUP). He is an organizer of BioCreative (a community-wide international challenge on evaluating NLP systems in biomedicine since 2004) and has been involved in organizing a number of leading international meetings in his research field (e.g. ISMB text-mining area chair; PSB session chairs; general chair for IEEE Conference on Healthcare Informatics). He has also frequently participated in both US (e.g. NSF/NIH) and international (e.g. MRC/NSERC/The Royal Society) grant reviews and served as ad-hoc editor/reviewers for major journals/conferences in NLP (ACL), IR (WWW), Bioinformatics (ISMB/PSB/BioCuration), and Health Informatics (AMIA). He has (co-)authored over 300 scientific publications since 2004. According to Google Scholar, he has an h-index ~70 with over 25,000 citations. His name is also found on the Global Highly Cited Researchers List by Web of Science.

Invited Keynote Talks (selected)

  • 2022 CAMDA - Annual Int'l Conference on Critical Assessment of Massive Data Analysis
  • 2021 SciNLP - Natural Language Processing for Scientific Text
  • 2020 Human Impacts of AI Symposium AAAS STPF
  • 2019 Harvard Medical School
  • 2018 The 16th annual Rocky Mountain Bioinformatics Conference

    Selected Publications - See here