NCBI Home Genomic Biology Horse Genome Resources Statistics

Horse Gene and RefSeq Statistics

Additional statistics are available from:

Definition of reported columns:
  1. category: the type of object being counted (Genes, RefSeq accessions, PubMed IDs etc)
    • cuhorseed RefSeq: the pool of RefSeq records that are available for manual cuhorseion.
    • model RefSeq: the pool of RefSeq records that are a product of NCBI's computational genome annotation pipeline and which are not available for manual cuhorseion.
  2. total: the count of objects, per status
  3. status: the type of Gene or RefSeq
    • standard: a Gene that is defined based on data from public sources such as GenBank or RGD. Or, a Gene that was predicted by NCBI's computational genome annotation process and was later determined to have sufficiently strong homology support to track as valid.
    • computed: a Gene record that is defined by NCBI's computational genome annotation pipeline.
Statistics Report:
Last updated: January 7, 2008
 GeneIDs, total                                                           651 standard
 GeneIDs, total                                                         23428 computed
 GeneIDs, protein-coding                                                  598 standard
 GeneIDs, protein-coding                                                15723 computed
 GeneIDs, associated with sequence                                        601 standard
 GeneIDs, with curated RefSeqs                                            303 standard
 GeneIDs, with curated protein-coding RefSeqs                             302 standard
 GeneIDs, with Swiss-Prot accessions                                      147 standard
 GeneIDs, with TrEMBL accessions                                            0 standard
 RefSeqs, curated protein-coding                                          303 standard
 RefSeqs, curated ncRNA                                                     1 standard
 RefSeqs, curated pseudogenes                                               0 standard
 PubMed IDs, total                                                        258 standard
 PubMed IDs, total                                                          1 computed
 GeneIDs, with PubMed IDs                                                 273 standard
 GeneIDs, with PubMed IDs                                                   1 computed
 GeneIDs, with model RefSeqs                                              223 standard
 GeneIDs, with model RefSeqs                                            16167 computed
 GeneIDs, with model protein-coding RefSeqs                               216 standard
 GeneIDs, with model protein-coding RefSeqs                             15421 computed
 RefSeqs, model non-coding (all assemblies)                                 7 standard
 RefSeqs, model non-coding (all assemblies)                               746 computed
 RefSeqs, model protein-coding (all assemblies)                           250 standard
 RefSeqs, model protein-coding (all assemblies)                         17040 computed
 GeneIDs with sequence, not on any build                                    8 standard
 GeneIDs, with GeneRIFs                                                    25 standard
 GeneIDs, with GeneRIFs                                                     1 computed
 GeneIDs associated with markers                                          138 standard
 GeneIDs associated with markers                                         3136 computed