6.4. Causality#

Here is a recap of the three criteria for establishing causality (you saw these in Week 6 last term, “Regression Models, Multivariate Analysis”).

The three criteria for causation are:

  1. Association between the variables, where we must show that \(x\) and \(y\) are associated, i.e., if \(x \rightarrow y\), then as \(x\) changes, the distribution of \(y\) should change in some way.

  2. An appropriate time order: the two variables have the appropriate time order, with the cause preceding the effect.

  3. The elimination of alternative explanations: when two variables are associated and have the proper time order to satisfy a casual relation, this is still insufficient to imply causality. The association may have an alternative explanation, such as a confounding factor on which we have no information.

  • How do experiments help to establish causality?

  • When analysing experimental data, do you need control variables?

https://raw.githubusercontent.com/jillxoreilly/StatsCourseBook/main/images/regression6_health.png

Example: alcohol consumption and health

In a research project about healthy ageing, the research team had to interpret the outcome of the regression table below. The outcome variable is “frailty”, a commonly used measure of functional health among older people, where higher scores indicate higher frailty. One explanatory variable for health behaviours was a categorical variable of alcohol consumption. The reference category (omitted from the table) is “drinking alcohol daily or almost daily”.

Dep. Variable: Frailty

Model: OLS

No. Observations: 10,520

Coef

srd err

P>[t]

Intercept

-6.567

0.448

0.000

Age

0.350

0.006

0.000

Income

-1.064

0.038

0.000

Education: degree

-3.799

0.163

0.000

Alcohol

Once or twice a week

-0.271

0.120

0.024

Once or twice a month

0.896

0.165

0.000

Special occasions only

1.929

0.146

0.000

Not at all

5.596

0.180

0.000

  • Given that we are usually told that drinking alcohol is bad for our health, what is surprising about the result?

  • And, how do you think we can explain this surprising result?