4.5. Effect size (Pearson’s r)#
The effect size for a correlation is quantified simply by the correlation coefficient Pearson’s
Remember an effect size tells us how large the effect of interest is, in comparison to random variation.
In the case of a correlation, the effect of interest is the relationship between
Pearson’s

The correlation coefficient
Correlation gives us a ‘pure’ measure of the strength of the relationship between
4.5.1. Effect size statistical significance#
Note that for any given correlation, whether it is believable or statistically significant also depends on the sample size
Consider the following examples:
the top row all have sample size
the bottom row have sample size

The red oval captures the shape of the dot cloud; an elongated shape is a stronger correlation and greater effect size.
Nonetheless, for a given effect size, the correlation is less convincing for small
Statistical significance of #
For a given values of Pearson’s
We calculate a
and then the
However you can get straight to the scipy.stats
- note that we are using scipy.stats
as the other correlation functions we met in numpy
and pandas
do not return a
Here is the syntax to test the correlation between Maths and English scores, assuming these are columns in a dataframe scores
:
stats.pearsonr(scores.maths, scores.english).pvalue
What counts as a large effect#
Wikipedia tells me that

… but bear in mind this is just a rule of thumb.