Latest news: New BLAST design to be released on April 16, 2007

Databases available for BLAST search


Peptide Sequence Databases


nr
All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
month
All new or revised GenBank CDS translation+PDB+SwissProt+PIR+PRF released in the last 30 days.
swissprot
Last major release of the SWISS-PROT protein sequence database (no updates)
Drosophila genome
Drosophila genome proteins provided by Celera and Berkeley Drosophila Genome Project (BDGP).
yeast
Yeast (Saccharomyces cerevisiae) genomic CDS translations
ecoli
Escherichia coli genomic CDS translations
pdb
Sequences derived from the 3-dimensional structure from Brookhaven Protein Data Bank
kabat [kabatpro]
Kabat's database of sequences of immunological interest
alu
Translations of select Alu repeats from REPBASE, suitable for masking Alu repeats from query sequences. It is available by anonymous FTP from ncbi.nlm.nih.gov (under the /pub/jmc/alu directory). See "Alu alert" by Claverie and Makalowski, Nature vol. 371, page 752 (1994) .

Nucleotide Sequence Databases


nr
All GenBank+EMBL+DDBJ+PDB sequences (but no EST, STS, GSS, or phase 0, 1 or 2 HTGS sequences). No longer "non-redundant".
month
All new or revised GenBank+EMBL+DDBJ+PDB sequences released in the last 30 days.
Drosophila genome
Drosophila genome provided by Celera and Berkeley Drosophila Genome Project (BDGP).
dbest
Database of GenBank+EMBL+DDBJ sequences from EST Divisions
dbsts
Database of GenBank+EMBL+DDBJ sequences from STS Divisions
htgs
Unfinished High Throughput Genomic Sequences: phases 0, 1 and 2 (finished, phase 3 HTG sequences are in nr)
gss
Genome Survey Sequence, includes single-pass genomic data, exon-trapped sequences, and Alu PCR sequences.
yeast
Yeast (Saccharomyces cerevisiae) genomic nucleotide sequences
E. coli
Escherichia coli genomic nucleotide sequences
pdb
Sequences derived from the 3-dimensional structure from Brookhaven Protein Data Bank
kabat [kabatnuc]
Kabat's database of sequences of immunological interest
vector
Vector subset of GenBank(R), NCBI, in ftp://ncbi.nlm.nih.gov/blast/db/
mito
Database of mitochondrial sequences
alu
Select Alu repeats from REPBASE, suitable for masking Alu repeats from query sequences. It is available by anonymous FTP from ncbi.nlm.nih.gov (under the /pub/jmc/alu directory). See "Alu alert" by Claverie and Makalowski, Nature vol. 371, page 752 (1994).
epd
Eukaryotic Promotor Database found on the web at http://www.genome.ad.jp/dbget-bin/www_bfind?epd

Trace Archive Databases


The sequences in these databases are derived from raw sequence trace files. These data are not trimmed for quality or vector sequences. The trace files in the Trace Archive are from a variety of projects and strategies, including Whole Genome Shotgun (WGS), Clone by Clone Strategies, BAC end sequencing, and EST sequencing.

organism-WGS
Whole Genome Shotgun reads
organism-EST
EST reads
organism-other
Any other trace file - mostly BAC traces and BAC end sequences

Disclaimer
Privacy statement
Accessibility
This page is valid XHTML 1.0.