Your browser version may not work well with NCBI's Web applications. More information here...
HomoloGene is a system for automated detection of homologs among the annotated genes of several completely sequenced eukaryotic genomes.
HomoloGene Release 61 Statistics



Initial numbers of genes from complete genomes, numbers of genes placed in a homology group, and the numbers of groups for each species.

Species   Number of Genes   HomoloGene
  Input Grouped   groups
Homo sapiens 22,849* 19,969   19,357
Pan troglodytes 25,096  17,423   16,938
Canis lupus familiaris 19,766  16,735   16,297
Bos taurus 23,797  18,160   16,687
Mus musculus 25,388  21,537   19,409
Rattus norvegicus 21,991  19,094   17,868
Gallus gallus 17,959  13,033   12,324
Danio rerio 33,859  20,708   15,621
Drosophila melanogaster 14,085  8,187   7,974
Anopheles gambiae 13,909  8,463   7,910
Caenorhabditis elegans 20,077  5,307   5,078
Schizosaccharomyces pombe 5,043  3,210   3,174
Saccharomyces cerevisiae 5,880  4,742   4,591
Kluyveromyces lactis 5,335  4,457   4,426
Eremothecium gossypii 4,722  3,951   3,942
Magnaporthe grisea 12,832  6,841   6,401
Neurospora crassa 10,079  6,131   6,125
Arabidopsis thaliana 26,981  13,378   13,046
Oryza sativa 26,887  12,974   12,603
Plasmodium falciparum 5,266  989   960


'*' indicates organisms where new genome annotation data is used in this build.


Last updated on: Fri Mar 7 2008



We have recently adopted a new build procedure that makes use of amino acid sequence searching (blastp) to find more distant relationships, but the procedure still refers to the DNA sequence for computation of some of the statistics. The matching strategy is guided by the taxonomic tree such that more closely related organisms are compared first. Moreover, HomoloGene entries now include paralogs in addition to orthologs.




Sources of Additional Information



HomoloGene entries have been augumented with homology and phenotype information drawn from the following sources.

Online Mendelian Inheritance in Man (OMIM)

Mouse Genome Informatics (MGI)

Zebrafish Information Network (ZFIN)

Saccharomyces Genome Database (SGD)

Clusters of Orthologous Groups (COG)

FlyBase

 

What's New
HomoloGene release 61 is now public. It incorporates updated annotation for Homo sapiens (NCBI build 36.3, Mar. 4, 2008).


Tip of The Day




Related Resources


Entrez Genomes


A collection of complete genome sequences that includes more than 1000 viruses and over hundred microbes

  Archaea

  Bacteria

  Eukaryota

  Viruses



  COGs

Phylogenetic classification of proteins encoded in complete genomes.