Send to

Choose Destination
Appl Bioinformatics. 2003;2(4):197-208.

Overcoming confounded controls in the analysis of gene expression data from microarray experiments.

Author information

Center for Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA 15232, USA.


A potential limitation of data from microarray experiments exists when improper control samples are used. In cancer research, comparisons of tumour expression profiles to those from normal samples is challenging due to tissue heterogeneity (mixed cell populations). A specific example exists in a published colon cancer dataset, in which tissue heterogeneity was reported among the normal samples. In this paper, we show how to overcome or avoid the problem of using normal samples that do not derive from the same tissue of origin as the tumour. We advocate an exploratory unsupervised bootstrap analysis that can reveal unexpected and undesired, but strongly supported, clusters of samples that reflect tissue differences instead of tumour versus normal differences. All of the algorithms used in the analysis, including the maximum difference subset algorithm, unsupervised bootstrap analysis, pooled variance t-test for finding differentially expressed genes and the jackknife to reduce false positives, are incorporated into our online Gene Expression Data Analyzer ( http:// ).

[Indexed for MEDLINE]

Supplemental Content

Loading ...
Support Center