Question 1

What is the difference between accuracy and calibration?

Accepted Answer

Accuracy measures how often the model's top-1 prediction is correct. Calibration measures whether the probabilities the model assigns match the actual frequencies of outcomes. A model that always says '60% chance of A' for an event that occurs 50% of the time has consistent accuracy on whichever way you grade top-1, but it is poorly calibrated. For betting, calibration is dominant because it controls every downstream EV calculation.

Question 2

How is calibration improved if a model is found to be miscalibrated?

Accepted Answer

Post-hoc calibration methods like Platt scaling (logistic regression on the model output) or isotonic regression learn a monotonic transform from raw model probabilities to calibrated probabilities, fit on a held-out calibration set. This is standard practice in production ML systems for sports prediction. The transform is then applied to all live predictions.

Question 3

Can a model be too confident or not confident enough?

Accepted Answer

Yes, both happen. Models trained with cross-entropy loss often become overconfident on classes near 0 or 1, especially with class imbalance. Models that average across an ensemble can become underconfident relative to the truth. Calibration assessment catches both.

Question 4

Why is calibration central to HVP?

Accepted Answer

Because every win-rate or ROI claim implicitly assumes the model's probability outputs correspond to real frequencies. If they do not, headline numbers are an artifact of bin selection rather than a real edge. HVP rule 2 (Beta-Binomial CI lower bound) and rule 5 (empirical CLV haircut) are both corrections for the gap between model-stated and actually-realized probabilities.