4.10. Sample size and power#
We have seen that the power of a test depends on the sample size, and a sample of a certain size is required for an effect to be detected with, say, 80% power.
Most people will accept the intuition that increasing the sample size should make an experiment ‘better’, more reliable or more believable. However it is worth understanding that increasing the sample size primarily affects the proportion of false negatives NOT false positives. Hence sample size is relevant for the power of a test more than its
4.10.1. The proportion of false positives is fixed by design#
In null hypothesis testing, the whole procedure is based on assuming
Choose an
valueAssume
is true
we then
Choose the critical value of the test statistic, so that the proportion of false positives
is true) is , eg 5%.Reminder - the test statistic is a value like
or on which a statistical test is performedthe critical value is the minimum value of that test statistic for which the test is significant, for a given
This is equivalent to calculating the test statistic, finding the
So by definition the proportion of false positives, given the null is true, is fixed at 5% (or whatever
Example#
Consider again the example from the Power by Simulation notebook:
I collect data on end-of year exam scores in Maths and French for 50 high school studehts. Then I calculate the correlation coefficient, Pearson’s
Under the null hypothesis there is no correlation between maths scores and French scores Under the alternative hypothesis, there is a positive correlation
This is a one-tailed test, at the
I can then work out the critical value of
is the smallest value of for which , for a given

As

4.10.2. Proportion of false negatives is not fixed#
In power analysis, the whole procedure is based on assuming
Assuming
One way to do this is simulating a correlated population and sampling from it, as we did in a previous notebook.

Note that
Looking at some samples drawn from a population with a fixed effect size (

This is because of a “double whammy”:
The distribution of
from the correlated population stays centred on and gets tighter around that valuetherefore the tail of the correlated distribution or
is pulled in towards and away from zero
retreats towards zero as the sample size increasesthis is because as
increases, the null distribution ( values from the null population) gets tighter around , so r_{crit}$ moves towards zero therefore to maintain a tail of 5% of false positives
This interaction means that as
ie there are fewer false negatives as
increases
4.10.3. Summary#
The proportion of false positives (if the null were true) is fixed regardless of
The proportion of false negatives increases as
gets smaller

4.10.4. Implications#
Studies with insufficient power (because the sample size is too small) are unlikely to detect an effect even if there is one present in the underlying population. It is a waste of time and resources to carry our such a study.
Correlations, in particular, tend to need large sample sizes to achieve adequate power.