A visualization of the relationships among the five elements in experimental power analysis:
Effect Size, Sample Size (
It's important to determine the required power level and smaple size before running an A/B testing.
Inspired by several excellent power analysis tools / sample size calculators, I generated this small chart to show the trade-off relationships between the key metrics in power analysis and how we could increase statistical power:
| How to | Visualization | Note |
|---|---|---|
| Increase effect size | Larger distance between |
Stronger "signal" is easier to detect |
| Derease SD ( |
Narrower distributions | Reduce noise, ask clear questions |
| Increase sample size ( |
Narrower distributions | More data points shrink the standard error |
| Change test type ( |
Modifing the shape of distributions | Choice depends on the underlying data/assumptions |
| Increase significance level ( |
Move the reject line left | Dangerous, could be wrong |
In visual eyes, statistical power is determined by: the relative positions of the
Methods 1–4 shift the former by reducing distribution overlap. The "dangerous" 5th affects the latter by simply lowering the rejection bar. Among these, increasing sample size (
The main code drawing the chart is in scripts/plot.py, and the future plan is a web-based chart for interactive exploration.
Reference: