3.2. Learning Objectives#
The learning objectives for this week are closely related to those of the previous week
3.2.1. Conceptual#
After this week you should be able to:
Define the term sample and population
Understand the difference between sampling with and without replacement
Define the sampling distribution of a statistic (such as the sample mean or proportion)
Explain how the standard error depends on sample size
and the population variabilityUnderstand that the estimated sampling distribution of the mean for a population can be obtained in several ways including:
Calculated empirically from the population - draw many samples of size
from the population and plot their meansModelled using a known distribution (Normal if central limit theorem applies)
This material is covered in the lecture and recapped in the worked examples in Python
3.2.2. Python skills#
This week there is an emphasis on simulating the process of drawing a large number of (re)samples from a sample distribution
The key skill practiced this week is building a for
loop to
repeat a process many times
(such as drawing a random sample and getting its mean)
You might need to change some variable (such as sample size n) on each pass through the loop.
Additional new(ish) Python skills:
sample from a
pandas
dataframe usingpandas.df.sample()
Plot an expected distribution such as a curve from the function
stats.norm.pdf()
over a histogram of simulated dataPlot a Q-Q plot
This material is covered in the Jupyter Notebooks in this section