Comparison of Sample Size by Bootstrap and by Formulas Based on Normal Distribution Assumption

Ther Innov Regul Sci. 2019 Mar;53(2):170-175. doi: 10.1177/2168479018778280. Epub 2018 May 28.

Abstract

Bootstrapping technique is distribution-independent, which provides an indirect way to estimate the sample size for a clinical trial based on a relatively smaller sample. In this paper, sample size estimation to compare two parallel-design arms for continuous data by bootstrap procedure are presented for various test types (inequality, non-inferiority, superiority, and equivalence), respectively. Meanwhile, sample size calculation by mathematical formulas (normal distribution assumption) for the identical data are also carried out. Consequently, power difference between the two calculation methods is acceptably small for all the test types. It shows that the bootstrap procedure is a credible technique for sample size estimation. After that, we compared the powers determined using the two methods based on data that violate the normal distribution assumption. To accommodate the feature of the data, the nonparametric statistical method of Wilcoxon test was applied to compare the two groups in the data during the process of bootstrap power estimation. As a result, the power estimated by normal distribution-based formula is far larger than that by bootstrap for each specific sample size per group. Hence, for this type of data, it is preferable that the bootstrap method be applied for sample size calculation at the beginning, and that the same statistical method as used in the subsequent statistical analysis is employed for each bootstrap sample during the course of bootstrap sample size estimation, provided there is historical true data available that can be well representative of the population to which the proposed trial is planning to extrapolate.

Keywords: bootstrap; equivalence test; inequality test; non-inferiority test; power; sample size; superiority test.

MeSH terms

  • Algorithms
  • Clinical Trials as Topic / statistics & numerical data*
  • Humans
  • Normal Distribution
  • Sample Size*