Linear regression assumptions are the conditions your data must meet for the model's coefficients, standard errors, and p-values to be trustworthy. The core set is linearity, independence of errors, homoscedasticity, and normality of residuals, with no severe multicollinearityoften added as a fifth. Check them before you interpret a single coefficient, because a violated assumption can quietly invalidate your whole results chapter.
Why your committee asks you to check assumptions first
A regression will always print numbers, whether or not the assumptions hold, so the output looks convincing even when it is misleading. Your supervisor knows this, which is why a credible results chapter shows the diagnostic checks before the interpretation. Reporting a tidy table of coefficients without evidence that the residuals behave is a common way to lose marks. Treat the checks as part of the analysis, not an optional appendix.
The assumptions, one diagnostic at a time
Linearity means the relationship between each predictor and the outcome is genuinely straight; you check it with a plot of residuals against fitted values, looking for a flat band with no curve. Independence of errors means one observation's error does not predict the next, which matters most for time-ordered or clustered data and is tested with a Durbin-Watson statistic. Homoscedasticity means the spread of residuals stays constant across fitted values; a funnel shape in that same residual plot signals the heteroscedasticity you want to avoid. Normality of residuals means the errors follow a normal distribution, which you judge with a Q-Q plot rather than the raw outcome variable.
Read the Q-Q plot above as the assumption-check for normality: when the points track the dashed reference line, the residuals are close to normal. Bends away from the line, especially at the tails, point to skew or heavy tails. This is the same logic you use when testing a variable for normality, except here you apply it to the model's residuals, not to the raw data.
The fifth assumption and the Gauss-Markov conditions
The fifth item people add is no severe multicollinearity: your predictors should not be so highly correlated that the model cannot separate their effects, which you check with the variance inflation factor. The narrower Gauss-Markov conditions, which guarantee that ordinary least squares gives the best linear unbiased estimates, are a related but distinct list: a linear model, a zero-mean error term, constant error variance, and uncorrelated errors. Gauss-Markov does not require normality at all; normality is what you add to justify the p-values and confidence intervals. If any predictor is itself binary, you may be drifting toward interpreting a logistic regression instead.
What to do when an assumption fails
A failed check is a finding, not a dead end. Curvature in the residuals often responds to a transformation or a polynomial term; heteroscedasticity can be handled with robust standard errors; influential points show up in leverage diagnostics and may need justified removal during reading the SPSS output. Remember too that a strong correlation between predictors is about association, and association is not the same thing as the causal story you tell, a distinction worth keeping straight from correlation versus causation. Document every check, every fix, and the reasoning behind it, and your regression will read as careful rather than convenient.