Transitive functional annotation by shortest-path analysis of gene expression data

Proc Natl Acad Sci U S A. 2002 Oct 1;99(20):12783-8. doi: 10.1073/pnas.192159399. Epub 2002 Aug 26.

Abstract

Current methods for the functional analysis of microarray gene expression data make the implicit assumption that genes with similar expression profiles have similar functions in cells. However, among genes involved in the same biological pathway, not all gene pairs show high expression similarity. Here, we propose that transitive expression similarity among genes can be used as an important attribute to link genes of the same biological pathway. Based on large-scale yeast microarray expression data, we use the shortest-path analysis to identify transitive genes between two given genes from the same biological process. We find that not only functionally related genes with correlated expression profiles are identified but also those without. In the latter case, we compare our method to hierarchical clustering, and show that our method can reveal functional relationships among genes in a more precise manner. Finally, we show that our method can be used to reliably predict the function of unknown genes from known genes lying on the same shortest path. We assigned functions for 146 yeast genes that are considered as unknown by the Saccharomyces Genome Database and by the Yeast Proteome Database. These genes constitute around 5% of the unknown yeast ORFome.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Cell Nucleus / metabolism
  • Cytoplasm / metabolism
  • DNA / genetics*
  • Databases as Topic
  • Gene Expression
  • Genetic Techniques*
  • Genome, Fungal*
  • Mitochondria / metabolism
  • Open Reading Frames
  • Proteins / analysis
  • Saccharomyces cerevisiae / genetics*
  • Statistics as Topic / methods*

Substances

  • Proteins
  • DNA