Display Settings:


Send to:

Choose Destination
See comment in PubMed Commons below
BMC Bioinformatics. 2007 Mar 30;8:111.

Evaluation of gene-expression clustering via mutual information distance measure.

Author information

  • 1Department of Industrial Engineering, Tel Aviv University, Israel. ido.priness@gmail.com <ido.priness@gmail.com>



The definition of a distance measure plays a key role in the evaluation of different clustering solutions of gene expression profiles. In this empirical study we compare different clustering solutions when using the Mutual Information (MI) measure versus the use of the well known Euclidean distance and Pearson correlation coefficient.


Relying on several public gene expression datasets, we evaluate the homogeneity and separation scores of different clustering solutions. It was found that the use of the MI measure yields a more significant differentiation among erroneous clustering solutions. The proposed measure was also used to analyze the performance of several known clustering algorithms. A comparative study of these algorithms reveals that their "best solutions" are ranked almost oppositely when using different distance measures, despite the found correspondence between these measures when analysing the averaged scores of groups of solutions.


In view of the results, further attention should be paid to the selection of a proper distance measure for analyzing the clustering of gene expression data.

[PubMed - indexed for MEDLINE]
Free PMC Article

Images from this publication.See all images (9)Free text

Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
PubMed Commons home

PubMed Commons

How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for BioMed Central Icon for PubMed Central
    Loading ...
    Write to the Help Desk