Eukaryotic Genome Annotation at NCBI

Genome annotation available in NCBI resources can come from several different sources.

Annotation on GenBank genomes

Annotations, if any, on genomic sequence records in GenBank were provided by the group that submitted the genomic sequences to one of the databases in the International Sequence Database Collaboration ( INSDC ), i.e. DDBJ , ENA or GenBank .

Annotation on RefSeq genomes

Genomic sequences in NCBI's Reference Sequence (RefSeq) collection always have annotation. The annotation on a RefSeq genome can come from one of three different sources, depending on the organism:

  1. the submitter's annotation copied from the GenBank genomic sequence records
  2. curated annotation provided by a model organism database, for example FlyBase or WormBase
  3. generated at NCBI by running the genome through our Eukaryotic Genome Annotation Pipeline . See details of the process in the Eukaryotic Genome Annotation chapter of the NCBI Handbook .
  4. See the NCBI eukaryotic genome annotation policy
  5. See all genomes annotated by the NCBI Eukaryotic Genome Annotation Pipeline, with links to available resources for each
  6. See eukaryotic genome annotation runs currently in progress

NCBI also provides Prokaryotic Genome Annotation .

Last updated: 2017-11-13T21:07:02Z