The Bias-Variance Tradeoff Deep Dive
The bias-variance tradeoff is the central theoretical lens for understanding why models fail — either because they are too simple (high bias) or too sensitive to training data (high variance).
Decomposing Prediction Error
The expected prediction error for any model can be decomposed as: Error = Bias\u00b2 + Variance + Irreducible Noise. You can reduce bias or variance, but never both simultaneously without more or better data.
Bias: Systematic Error
Bias is the error from wrong assumptions in the learning algorithm. A high-bias model is too simple — it consistently misses the true pattern (e.g., fitting a straight line to clearly curved data). High bias leads to underfitting.
Variance: Sensitivity to Training Data
Variance is the error from excessive sensitivity to small fluctuations in training data. A high-variance model learns the noise in the training set and fails to generalise (e.g., a degree-15 polynomial on sparse data). High variance leads to overfitting.
The Tradeoff in Practice
As model complexity grows, bias decreases but variance increases. The optimal model sits at the sweet spot where total error is minimised.