Power Curve and Sample Size
Learn how statistical power curves change with sample size through formulas and interactive visualization.
What Is Statistical Power?
Statistical power is the probability that a test correctly rejects the null hypothesis when the alternative is true—that is, the chance of detecting an actual effect. It answers the question:
“If there truly is a difference, how likely is the test to detect it?”
A high power means a low risk of Type II error (false negative). Power depends on four factors:
- Significance level
- Effect size
- Population standard deviation
- Sample size
This article focuses on how increasing the sample size improves power and reshapes the power curve.
The Power Function: A Step-by-Step Derivation
We consider a one-sided -test with the following setup:
- Null hypothesis:
- Alternative hypothesis:
- Known standard deviation
- Sample mean
1. The Test Statistic
The standardized test statistic is:
2. Rejection Region
We reject the null hypothesis if:
where is the upper quantile of the standard normal distribution, i.e.,
Noncentral Distribution Under the Alternative
Now suppose the alternative is true, i.e., . Then:
So, under the alternative hypothesis:
The mean shift is called the noncentrality parameter:
This tells us the test statistic follows a noncentral normal distribution when the null is false.
Power Function
Power is the probability that this noncentral distribution falls into the rejection region:
where is the standard normal cumulative distribution function.
Example: How Power Changes with Sample Size
Let’s fix parameters as follows:
- Effect size
- Standard deviation
- Significance level
Then compute power for different :
| Sample Size | Power |
|---|---|
| 10 | 0.158 |
| 30 | 0.385 |
| 50 | 0.553 |
| 100 | 0.809 |
As increases, the noncentral distribution shifts right, increasing the chance it exceeds , and hence increasing power.
Key Takeaways
- Power increases as sample size increases.
- Under the alternative, the test statistic shifts right → higher chance to reject .
- The shift is quantified by the noncentrality parameter .
- The power function shows how likely we are to detect a true effect.
- Visualizing power curves helps in planning sample size effectively.