The effect of replication on gene expression microarray experiments

Bioinformatics. 2003 Sep 1;19(13):1620-7. doi: 10.1093/bioinformatics/btg227.

Abstract

Motivation: We examine the effect of replication on the detection of apparently differentially expressed genes in gene expression microarray experiments. Our analysis is based on a random sampling approach using real data sets from 16 published studies. We consider both the ability to find genes that meet particular statistical criteria as well as the stability of the results in the face of changing levels of replication.

Results: While dependent on the data source, our findings suggest that stable results are typically not obtained until at least five biological replicates have been used. Conversely, for most studies, 10-15 replicates yield results that are quite stable, and there is less improvement in stability as the number of replicates is further increased. Our methods will be of use in evaluating existing data sets and in helping to design new studies.

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Validation Study

MeSH terms

  • Animals
  • DNA Replication / genetics
  • Data Interpretation, Statistical*
  • Databases, Nucleic Acid
  • Gene Expression Profiling / methods*
  • Humans
  • Models, Genetic*
  • Models, Statistical*
  • Neoplasms / genetics
  • Oligonucleotide Array Sequence Analysis / methods*
  • Quality Control
  • Reproducibility of Results
  • Research Design
  • Sample Size
  • Sensitivity and Specificity
  • Sequence Alignment / methods*
  • Sequence Analysis, DNA / methods