Skip to main page content
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation

NCBI Datasets BETA

NCBI Datasets is a new resource that lets you easily gather data from across NCBI databases. Find and download gene, transcript, protein and genome sequences, annotation and metadata.

What's new

NCBI Insights Sept. 27, 2022

Coming soon! Changes to NCBI Datasets command-line tool in version 14 (CLIv14.0.0)

In October 2022, NCBI Datasets will release version 14 of our datasets and dataformat command-line tools. …

NCBI Insights July 26, 2022

Announcing the NCBI Datasets SARS-CoV-2 taxonomy page

Need SARS-CoV-2 assembled genome sequences or specific SARS-CoV-2 protein sequences? You can find them on the …

NCBI Insights July 13, 2022

NLM’s all-new NCBI Datasets genome table is now available

We are excited to introduce new and useful updates to the Datasets genome table that let …

More news

Genomes

Browse and download genome data using our taxonomy pages. Genome data includes genome, transcript and protein sequences, genome annotation and metadata.

Taxonomy Browser

View taxonomic relationships and find genome data for closely related species using our interactive species browser.

Genes

Create a customized Gene table to view and download gene data. Gene data is also available through our command-line tool and API. Gene data includes gene, transcript and protein sequences organized by gene.

Get started

Examples

How to

Viruses

SARS-CoV-2 genomes

Download SARS-CoV-2 and other coronavirus genome and protein sequences on the web or through our command-line tool and API. Filter by host and release date.

Genomes

SARS-CoV-2 proteins

Download specific SARS-CoV-2 protein sequences on the web or through our command-line tool and API.

Proteins

How to

Command-line tools

Retrieve gene, genome and coronavirus data from the command-line. The Datasets and Dataformat command-line tools are available for Windows, Mac and Linux systems.

Install tool

How to