NCBI has developed an automatic annotation pipeline that combines ab initio gene prediction algorithms with homology based methods. See more details
here.
Historically RefSeq prokaryotic genomes retained on author submitted annotation. Annotation from different submitters varies in quality resulting in the inconsistent annotation even in closely related genomes. The NCBI Prokaryotic Annotation Pipeline can produce consistent high quality automatic annotation.
The statistics for all NCBI RefSeq Prokaryotic Assemblies:
| Organism Group | Genera | Species | Genomes | Assemblies | Proteins |
|---|
| Bacteria | 4344 | 71826 | - | | 379055321 |
| Archaea | 221 | 1342 | - | | 4841975 |
The statistics for NCBI RefSeq Prokaryotic Assemblies associated with this project:
| Publications | - Goldfarb T et al., "NCBI RefSeq: reference sequence standards through 25 years of curation and annotation.", Nucleic Acids Res, 2025 Jan 6;53(D1):D243-D257
- Li W et al., "RefSeq: expanding the Prokaryotic Genome Annotation Pipeline reach with protein family model curation.", Nucleic Acids Res, 2021 Jan 8;49(D1):D1020-D1028
|
| Submission | Registration date: 23-Oct-2013 NCBI |
| NCBI Links | |