NCBI Logo GenBank Database Divisions
Sequin Entrez BLAST OMIM Taxonomy Structure
spacer

HTG home

Clone registry

Submitting HTGs

Sequin

fa2htgs

Processing HTGs

HTG FAQs

HTG article

Examples


GenBank Database Divisions

GenBank divisions are divided into two general categories and were described recently in an (Genome Research (1997) 7(10)) article by Ouellette and Boguski; the full-text article is available (Database Divisions and Homology Search Files: A Guide for the Perplexed). The "Organismal" category includes databases pertaining to sequences derived from specific organisms and the "Functional" databases pertain to different types of sequence data being collected. Sequence records exist only in one GenBank division. For example, the HTG division includes unfinished sequences (phases 0, 1, and 2) being generated from several different organisms. As a sequence is updated to phase 3, it is moved into the appropriate organismal division. For instance, human phase 3 (finished) HTG sequences are located in the PRI division. The GenBank divisions listed here represent the location of the annotated sequence records; for homology search purposes the records are reformatted and stored in the BLAST databases. The different database divisions currently available, as well as the related BLAST database, are listed below. An example of a submission (one accession number) that has progressed through phase 1, phase 2, and phase 3 is available (Examples).


Organismal Divisions:
Database DivisionBLASTExample
BCTBacterial sequencesnr, month
PRIPrimate sequencesnr, monthHuman Phase 3
RODRodent sequencesnr, month
MAMOther mammalian sequencesnr, month
VRTOther vertebrate sequencesnr, month
INVInvertebrate sequencesnr, monthDrosophila, C. elegans Phase 3
PLNPlant and Fungal sequencesnr, monthArabidopsis Phase 3
VRLViral sequencesnr, month
PHGPhage sequencesnr, month
RNAStructural RNA sequencesnr, month
SYNSynthetic and chimeric sequencesnr, month
UNAUnannotated sequencesnr, month

Functional Divisions:
Database DivisionBLAST
ESTExpressed Sequence Tagsdbest, month
STSSequence Tagged Sitesdbsts, month
GSSGenome Survey Sequencesdbgss, month
HTGHigh Throughput Genomic sequenceshtgs, monthAll Organisms: Phase 0, 1, and 2


Phase 0 sequences are single-few pass reads of a single clone (not contigs usually).
Phase 1 sequences are unfinished, unordered, and contain gaps.
Phase 2 sequences are unfinished, ordered, and can contain one or more gaps.
Phase 3 sequences are high quality finished sequences that do not contain gaps.


Revised: July 25, 2002.

Disclaimer     Privacy Statement