3.5. The t distribution#
The
3.5.1. Sampling distribution of the mean#
The sampling distribution of the mean is the distribution you wuld get if you took many different samples of sie
from the population and calculated each of our meansThe null distribution is the sampling distribution of the mean (or another test statistic) that we would expect to get if the null hypothesis were true
3.5.2. Scaling by sample size#
Recall from the previous lecture that the spread of the sampling distribution of the mean depends on the sample size. More precisely:
The standard deviation of the sampling distribution of the mean is the standard error,
where is the sample standard deviation and is the sample size

3.5.3. Pointy top, heavy tails#
The
The
when the sample is tiny (
) we get an extreme pointy top and heavy tailsas the sample size gets large (about 30) the t distribution is almost identical to the normal distribution

This distinction may appear subtle but remember that when a value is statistically significant, it lies in the tails of the null distribution. So if the
Why heavy tails?#
You can skip this bit if you don’t fancy it; it is sufficient to understand that the

3.5.4. Analogy: and #
We saw that if data are normally distributed, that when we standardize data by converting the data to Z-scores, we create a distribution (of Z) with a fixed mean (zero) and standard deviation (1). If the data mean is
From this standardied distribution, we can directly read off the probability of a given data value (eg the probability ot a Z-score greater than 1 is 15%)
Similarly for sample means, if we standardize by converting to
This standardized distribution tells us the probability of a sample mean as large as our observed one
This is the probability of the sample mean arising due to chance under the null hypothesis, if the null hypothesis is that the population mean is zero.
3.5.5. Link to t-test#
Of course, we don’t always expect our population mean to be zero under the null hypothesis. In the following sections, we see three examples
Single sample t-test#
Under the null,
For example we hypothesise that Oxford students’ IQs are higher than the average (by definition the average IQ is 100)
Then our equation to get the standardied
Paired samples t-test#
Under the null,
For example we hypothesise that men are taller than women:
Then our equation to get the standardied
Independent samples t-test#
Under the null,
For example we hypothesise that Oxford students’ IQs (
Then our equation to get the standardized
… noting that scipy.stats
.