2.4. Multiple Regression Q & A#
The term multiple regression simply means having multiple x variables. (Whereas “simple regression” like last week’s examples have just a single
variable). Note that the variable may be called a “control variable” or an “explanatory variable”. What is the difference between a control variable and an explanatory variable?
Click to reveal answer
An explanatory variable is one that is important for the hypothesis being tested. A control variable is a potential confounder. Note that for the statistics/ for Python, a control and an explanatory variable are the same thing, just an
How many
variables can you include in a regression analysis?
Click to reveal answer
As many as you like! Although there will usually be constraints due to sample size and practicality. We want to keep the model parsimonious and efficient. And all the control variables should be there for good reason.
Last week, we saw the regression equation
What is the form of the regression equation when there are several variables?
Click to reveal answer
The regression equation is additive. We can add more slope*
How does the interpretation of the intercept change in multiple regression?
Click to reveal answer
Let’s take the health example from above, where we specify a regression equation as follows:
health=intercept +
We interpret the intercept to be the level of health when income is zero, education is zero, and smoking is zero. (The exact interpretation will of course depend on how I have measured the