Skip to main page content
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation

NCBI Datasets BETA

NCBI Datasets is a new resource that lets you easily gather data from across NCBI databases. Find and download gene, transcript, protein and genome sequences, annotation and metadata.


Browse and download genome data using our taxonomy pages. Genome data includes genome, transcript and protein sequences, genome annotation and metadata.

Taxonomy Browser

View taxonomic relationships and find genome data for closely related species using our interactive species browser.


Create a customized Gene table to view and download gene data. Gene data is also available through our command-line tool and API. Gene data includes gene, transcript and protein sequences organized by gene.

Get started


How to


SARS-CoV-2 genomes

Download SARS-CoV-2 and other coronavirus genome and protein sequences on the web or through our command-line tool and API. Filter by host and release date.


SARS-CoV-2 proteins

Download specific SARS-CoV-2 protein sequences on the web or through our command-line tool and API.


How to

Command-line tools

Retrieve gene, genome and coronavirus data from the command-line. The Datasets and Dataformat command-line tools are available for Windows, Mac and Linux systems.

Install tool

How to

What's new

NCBI Insights July 26, 2022

Announcing the NCBI Datasets SARS-CoV-2 taxonomy page

<p>Need SARS-CoV-2 assembled genome sequences or specific SARS-CoV-2 protein sequences? You can find them on the …

NCBI Insights July 13, 2022

NLM’s all-new NCBI Datasets genome table is now available

<p>We are excited to introduce new and useful updates to the Datasets genome table that let …

NCBI Insights June 29, 2022

Introducing NLM’s new NCBI Datasets genome page!

<p>As part of an ongoing effort to modernize and improve your experience, NLM’s NCBI Datasets is …

More news