Power and stability properties of resampling-based multiple testing procedures with applications to gene oncology studies

Comput Math Methods Med. 2013:2013:610297. doi: 10.1155/2013/610297. Epub 2013 Nov 20.

Abstract

Resampling-based multiple testing procedures are widely used in genomic studies to identify differentially expressed genes and to conduct genome-wide association studies. However, the power and stability properties of these popular resampling-based multiple testing procedures have not been extensively evaluated. Our study focuses on investigating the power and stability of seven resampling-based multiple testing procedures frequently used in high-throughput data analysis for small sample size data through simulations and gene oncology examples. The bootstrap single-step minP procedure and the bootstrap step-down minP procedure perform the best among all tested procedures, when sample size is as small as 3 in each group and either familywise error rate or false discovery rate control is desired. When sample size increases to 12 and false discovery rate control is desired, the permutation maxT procedure and the permutation minP procedure perform best. Our results provide guidance for high-throughput data analysis when sample size is small.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Computer Simulation
  • Female
  • Gene Expression Profiling / methods
  • Gene Expression Regulation, Neoplastic
  • Genome-Wide Association Study
  • Humans
  • Medical Oncology / methods*
  • Models, Theoretical
  • Neoplasms / genetics*
  • Neoplasms / pathology
  • Oligonucleotide Array Sequence Analysis / methods*
  • Ovarian Neoplasms / genetics
  • Ovarian Neoplasms / metabolism
  • Research Design*
  • Sample Size