NCBI Home Genomic Biology Rat Genome Resources Statistics

Rat Gene and RefSeq Statistics

Additional statistics are available from:

Definition of reported columns:
  1. category: the type of object being counted (Genes, RefSeq accessions, PubMed IDs etc)
    • curated RefSeq: the pool of RefSeq records that are available for manual curation.
    • model RefSeq: the pool of RefSeq records that are a product of NCBI's computational genome annotation pipeline and which are not available for manual curation.
  2. total: the count of objects, per status
  3. status: the type of Gene or RefSeq
    • standard: a Gene that is defined based on data from public sources such as GenBank or RGD. Or, a Gene that was predicted by NCBI's computational genome annotation process and was later determined to have sufficiently strong homology support to track as valid.
    • computed: a Gene record that is defined by NCBI's computational genome annotation pipeline.
Statistics Report:
Last updated: April 25, 2008

category                                                         total       status        
---------------------------------------------------------------- ----------- --------
GeneIDs, total                                                   19245   standard
GeneIDs, total                                                   18556   computed
GeneIDs, with official nomenclature                              17356   standard
GeneIDs, with official nomenclature                               8646   computed
GeneIDs, with interim nomenclature                                1715   standard
GeneIDs, with interim nomenclature                                9621   computed
GeneIDs, protein-coding                                          17141   standard
GeneIDs, protein-coding                                          11138   computed
GeneIDs, with associated phenotype                                 926   standard
GeneIDs, associated with sequence                                16993   standard
GeneIDs, with curated RefSeqs                                    15244   standard
GeneIDs, with curated protein-coding RefSeqs                     14300   standard
GeneIDs, with Swiss-Prot accessions                               6761   standard
GeneIDs, with TrEMBL accessions                                   5588   standard
RefSeqs, curated protein-coding                                  14571   standard
RefSeqs, curated ncRNA                                              16   standard
RefSeqs, curated pseudogenes                                       928   standard
PubMed IDs, total                                                35201   standard
PubMed IDs, total                                                   83   computed
GeneIDs, with PubMed IDs                                         13869   standard
GeneIDs, with PubMed IDs                                            62   computed
GeneIDs, with model RefSeqs                                       1820   standard
GeneIDs, with model RefSeqs                                      11489   computed
GeneIDs, with model protein-coding RefSeqs                        1762   standard
GeneIDs, with model protein-coding RefSeqs                        8885   computed
RefSeqs, model non-coding (all assemblies)                         108   standard
RefSeqs, model non-coding (all assemblies)                        3875   computed
RefSeqs, model protein-coding (all assemblies)                    3648   standard
RefSeqs, model protein-coding (all assemblies)                   11743   computed
GeneIDs with model RefSeqs on the reference assembly              1718   standard
GeneIDs, with model RefSeqs on a non-reference assembly           1605   standard
GeneIDs, unique to reference assembly                              792   standard
GeneIDs, unique to reference assembly                             4982   computed
GeneIDs, unique to any alternate assembly                          343   standard
GeneIDs, unique to any alternate assembly                         4231   computed
GeneIDs with sequence, not on any build                            440   standard
GeneIDs, with GeneRIFs                                            4053   standard
GeneIDs, with GeneRIFs                                              41   computed
GeneIDs, in HomoloGene groups                                    14852   standard
GeneIDs, in HomoloGene groups                                     4194   computed
GeneIDs in HomoloGene clusters including human/mouse/rat         12643   standard
GeneIDs in HomoloGene clusters including human/mouse/rat          2187   computed
GeneIDs in HomoloGene clusters including mouse/rat only            803   standard
GeneIDs in HomoloGene clusters including mouse/rat only            513   computed
GeneIDs associated with markers                                  11473   standard
GeneIDs associated with markers                                   3877   computed