Rank of correlation coefficient as a comparable measure for biological significance of gene coexpression

DNA Res. 2009 Oct;16(5):249-60. doi: 10.1093/dnares/dsp016. Epub 2009 Sep 18.

Abstract

Information regarding gene coexpression is useful to predict gene function. Several databases have been constructed for gene coexpression in model organisms based on a large amount of publicly available gene expression data measured by GeneChip platforms. In these databases, Pearson's correlation coefficients (PCCs) of gene expression patterns are widely used as a measure of gene coexpression. Although the coexpression measure or GeneChip summarization method affects the performance of the gene coexpression database, previous studies for these calculation procedures were tested with only a small number of samples and a particular species. To evaluate the effectiveness of coexpression measures, assessments with large-scale microarray data are required. We first examined characteristics of PCC and found that the optimal PCC threshold to retrieve functionally related genes was affected by the method of gene expression database construction and the target gene function. In addition, we found that this problem could be overcome when we used correlation ranks instead of correlation values. This observation was evaluated by large-scale gene expression data for four species: Arabidopsis, human, mouse and rat.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Arabidopsis / genetics
  • Arabidopsis / metabolism*
  • Computational Biology / methods
  • Databases, Genetic
  • Gene Expression Profiling* / methods
  • Gene Expression Profiling* / statistics & numerical data
  • Gene Expression Regulation*
  • Humans
  • Mice
  • Oligonucleotide Array Sequence Analysis / methods*
  • Plant Proteins / genetics
  • Plant Proteins / metabolism*
  • Proteins / genetics
  • Proteins / metabolism*
  • Rats
  • Species Specificity
  • Statistics as Topic

Substances

  • Plant Proteins
  • Proteins