A comprehensive and universal method for assessing the performance of differential gene expression analyses

PLoS One. 2010 Sep 9;5(9):e12657. doi: 10.1371/journal.pone.0012657.

Abstract

The number of methods for pre-processing and analysis of gene expression data continues to increase, often making it difficult to select the most appropriate approach. We present a simple procedure for comparative estimation of a variety of methods for microarray data pre-processing and analysis. Our approach is based on the use of real microarray data in which controlled fold changes are introduced into 20% of the data to provide a metric for comparison with the unmodified data. The data modifications can be easily applied to raw data measured with any technological platform and retains all the complex structures and statistical characteristics of the real-world data. The power of the method is illustrated by its application to the quantitative comparison of different methods of normalization and analysis of microarray data. Our results demonstrate that the method of controlled modifications of real experimental data provides a simple tool for assessing the performance of data preprocessing and analysis methods.

Publication types

  • Comparative Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Data Mining / methods*
  • Gene Expression Profiling / standards*
  • Humans
  • Oligonucleotide Array Sequence Analysis / standards

Associated data

  • GEO/GSE22630