Skip to main page content
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation

NCBI Datasets BETA

NCBI Datasets is a new resource that lets you easily gather data from across NCBI databases. Find and download gene, transcript, protein and genome sequences, annotation and metadata.

What's new

NCBI Insights Oct. 12, 2022

Now Available! Updated NCBI Datasets Command-Line Tools

NLM’s NCBI Datasets announces the release of version 14 of our command-line (CLI) tools, datasets, and …

NCBI Insights Sept. 27, 2022

Coming soon! Changes to NCBI Datasets command-line tool in version 14 (CLIv14.0.0)

In October 2022, NCBI Datasets will release version 14 of our datasets and dataformat command-line tools. …

NCBI Insights July 26, 2022

Announcing the NCBI Datasets SARS-CoV-2 taxonomy page

Need SARS-CoV-2 assembled genome sequences or specific SARS-CoV-2 protein sequences? You can find them on the …

More news


Browse and download genome data using our taxonomy pages. Genome data includes genome, transcript and protein sequences, genome annotation and metadata.

Taxonomy Browser

View taxonomic relationships and find genome data for closely related species using our interactive species browser.


Create a customized Gene table to view and download gene data. Gene data is also available through our command-line tool and API. Gene data includes gene, transcript and protein sequences organized by gene.

Get started


How to


SARS-CoV-2 genomes

Download SARS-CoV-2 and other coronavirus genome and protein sequences on the web or through our command-line tool and API. Filter by host and release date.


SARS-CoV-2 proteins

Download specific SARS-CoV-2 protein sequences on the web or through our command-line tool and API.


How to

Command-line tools

Retrieve gene, genome and coronavirus data from the command-line. The Datasets and Dataformat command-line tools are available for Windows, Mac and Linux systems.

Install tool

How to