NCBI Datasets BETA
NCBI Datasets is a new resource that lets you easily gather data from across NCBI databases. Find and download gene, transcript, protein and genome sequences, annotation and metadata.
What's new
New Way to View and Download Related Genes
Effective June 2023, the HomoloGene records will redirect to the Datasets Gene Table Do you use …
RefSeq Release 217
RefSeq release 217 is now available online and from the FTP site. You can access RefSeq …
New & Improved NCBI Datasets Genome and Assembly Pages
Legacy pages will be redirected effective June 2023 In June 2023, NCBI’s Assembly and Genome record …
Genomes
Browse and download genome data using our taxonomy pages. Genome data includes genome, transcript and protein sequences, genome annotation and metadata.
Taxonomy Browser
Genes
Create a customized Gene table to view and download gene data. Gene data is also available through our command-line tool and API. Gene data includes gene, transcript and protein sequences organized by gene.
Get startedExamples
How to

Viruses
SARS-CoV-2 genomes
Download SARS-CoV-2 and other coronavirus genome and protein sequences on the web or through our command-line tool and API. Filter by host and release date.
GenomesSARS-CoV-2 proteins
Download specific SARS-CoV-2 protein sequences on the web or through our command-line tool and API.
ProteinsHow to

Command-line tools
Retrieve gene, genome and coronavirus data from the command-line. The Datasets and Dataformat command-line tools are available for Windows, Mac and Linux systems.
Install toolHow to
