An Introduction to Multiple Regression Flashcards Preview

Question 1

Q

How do you calculate a residual?

Answer

A

It is the observed value - predicted value.

Question 2

Q

What is a Partial Correlation?

Answer

A

It is the correlation between two variables while controlling for a third.

Question 3

Q

What is a Semi-Partial Correlation?

Answer

A

It is the correlation between two variables while looking at the correlation between the third variable and one of those variables.

Question 4

Q

What are the 3 main things we can predict from a Multiple Regression model?

Answer

A

How well the model explains the outcome.
How much variance in the outcome our model explains.
The importance of each individual predictor.

Question 5

Q

What are the 3 main types of Multiple Regression?

Answer

A

Forced entry (all data in at once).
Hierarchical (researcher decides variable order).
Stepwise (SPSS decides variables order).

Question 6

Q

What program should you use to determine the sample size needed (which depends on the effect size)?

A

G*Power.

Question 7

Q

What is R-Squared?

Answer

A

It is the variance accounted for by the model (the amount of variance in the DV the model explains).

Question 8

Q

How do we know if our model generalises well?

Answer

A

The closer R-Squared is to the Adjusted R-Squared the more accurate our model is likely to be for other samples.

Question 9

Q

Why is R not useful?

Answer

A

This is because in Multiple Regression we have several variables.

Question 10

Q

Why is the Standardised Coefficients Beta important?

Answer

A

It allows us to compare predictors to decide which are the most important. The higher the number the more important the variable as a predictor.

Question 11

Q

When reporting the regression equation what are the coefficients also known as in SPSS?

Answer

A

Unstandardised B.

Question 12

Q

What are the three assumptions of Multiple Regression pre-experiment?

Answer

A

The outcome variable should be continuous.
The predictor variable should be continuous or dichotomous.
There should be reasonable theoretical ground for including variables.

Question 13

Q

What are the four assumptions of Multiple Regression post-experiment?

Answer

A

Linearity.
Homoscedascity.
Normal distribution of residuals.
No multicollinearity.

Question 14

Q

What is meant by linearity?

Answer

A

There should be a linear relationship between each predictor and the outcome. Partial plots should be checked for this.

Question 15

Q

What is meant by homoscedascity?

Answer

A

The variance of the residuals should be constant for all values of the predicted values.

Question 16

Q

What shape indicates heteroscedasticity?

Answer

A

Funnel/cone shape.

Question 17

Q

What graph should be looked at when checking for homoscedascity?

Answer

A

Graph of standardised residuals by standardised predicted values (ZRESID by ZPRED).

Question 18

Q

What two graphs should be looked at when checking for normal distribution of residuals?

Answer

A

Histogram (should be bell shaped) + normal probability plot (points should be close to the diagonal).

Question 19

Q

What two statistics should you look at to check for no multicollinearity?

Answer

A

Tolerance + VIF statistic.

Question 20

Q

What are the tolerance + VIF statistic rules in order for there to be no multicollinearity?

Answer

A

VIF value should not be larger than 10.

Tolerance value should not be less than 0.1 (although 0.2 is already a concern).

Question 21

Q

Why is multicollinearity an issue?

Answer

A

A good predictor might be rejected.

It may lead to errors in estimation of regression coefficients.

Question 22

Q

What are two possible solutions for multicollinearity?

Answer

A

Combine predictors.

Remove one of the variables.

Question 23

Q

What is an alternative indication of multicollinearity (not including the VIF + tolerance statistics)?

Answer

A

A high R-Squared with non-significant beta coefficients.

Question 24

Q

Why must all the assumptions in Multiple Regression be met?

Answer

A

This is because otherwise they could affect the fit and generalisability of the model.

An Introduction to Multiple Regression Flashcards Preview

Statistics & Data Analysis > An Introduction to Multiple Regression > Flashcards

Decks in Statistics & Data Analysis Class (7):

Brainscape's Knowledge GenomeTM

An Introduction to Multiple Regression Flashcards Preview

Statistics & Data Analysis > An Introduction to Multiple Regression > Flashcards

Brainscape's Knowledge Genome^TM