Sample size and power calculations for 1-way ANOVA rolling in FDR
- 4-group exposure (
$n_1 = 127$ ,$n_2 = 74$ ,$n_3 = 45$ ,$n_4 = 32$ ) - can be generalized to$G$ groups. Let$N$ be the total sample size. - Continuous outcome (can assumed to be normally distributed and standardized)
- Comparisons via ANOVA
- Assume group 1 has mean
$\delta$ and remaining groups have mean 0. See below section on defining the effect size relative to Cohen's definition (1988) in the standard ANOVA with equal group sizes.
- Assume group 1 has mean
- 3000 proteins (outcomes) so want to control FDR
- Ideally, assumed N, 80% and produce minimal detectable effect sizes
- Write a function to calculate power (via simulations) for 1-way ANOVA with possibly unequal group sizes
- Suppose that of the
$m=3000$ hypothesis tests performed,$\pi_0$ denotes the proportion that are truly null. The goal is to design a study so that we are powered at$(1-\beta)$ to declare significance for the$(1-\pi_0)\times m$ tests that have a signal, while controlling for the FDR, denoted$f$ . Jung (2005 Bioinformatics) has shown that the p-value threshold ($\alpha$ ) used to declare significance is estimated by the formula below. We could the bespoke power function described above to calculate the minimum detectable effect sizes, given the group sample sizes and desired power based on value of$\alpha$ .
For ANOVA with equal group sizes
which for the case of 4 groups is
Cohen (1988) defined the effect size
In our case, we assume
So for a given effect size