NCBI Datasets BETA
NCBI Datasets is a new resource that lets you easily gather data from across NCBI databases. Find and download gene, transcript, protein and genome sequences, annotation and metadata.
Genomes
Browse and download genome data using our taxonomy pages. Genome data includes genome, transcript and protein sequences, genome annotation and metadata.
Taxonomy Browser
Genes
Create a customized Gene table to view and download gene data. Gene data is also available through our command-line tool and API. Gene data includes gene, transcript and protein sequences organized by gene.
Get startedExamples
How to

Viruses
SARS-CoV-2 genomes
Download SARS-CoV-2 and other coronavirus genome and protein sequences on the web or through our command-line tool and API. Filter by host and release date.
GenomesSARS-CoV-2 proteins
Download specific SARS-CoV-2 protein sequences on the web or through our command-line tool and API.
ProteinsHow to

Command-line tools
Retrieve gene, genome and coronavirus data from the command-line. The Datasets and Dataformat command-line tools are available for Windows, Mac and Linux systems.
Install toolHow to

What's new
Announcing the NCBI Datasets SARS-CoV-2 taxonomy page
<p>Need SARS-CoV-2 assembled genome sequences or specific SARS-CoV-2 protein sequences? You can find them on the …
NLM’s all-new NCBI Datasets genome table is now available
<p>We are excited to introduce new and useful updates to the Datasets genome table that let …
Introducing NLM’s new NCBI Datasets genome page!
<p>As part of an ongoing effort to modernize and improve your experience, NLM’s NCBI Datasets is …