NCBI logo
Types of Databases module of the MLA course on Introduction to Molecular Biology Information Resources
Course Home Modules Schedule Exercises Comments Credits
Slide 1 Previous Next Slide List

RefSeq

  • Database of reference sequences


  • Curated


  • Non-redundant; one record for each gene, or each splice variant, from each organism represented


  • A representative GenBank record is used as the source for a RefSeq record


  • Value-added information is added by an expert(s)


  • Each record is intended to present an encapsulation of the current understanding of a gene or protein, similar to a review article


  • Variety of accession number prefixes (NM_ , NP_ , etc.) and status codes (provisional, reviewed, etc.). More about those in later slides.


  • RefSeq database includes genomic DNA, mRNA, and protein sequences, so organizes information according to the model of the central dogma of biology


  • Accessible through Entrez, BLAST, and FTP site
    (RefSeq records are available in various Entrez Databases such as Nucleotide, Protein, Genome, and are also accessible from Entrez Gene records)


  • more about RefSeq

Types of Databases
Slide 1 Previous Next Slide List
Revised 11/01/2007