Glossary/Basketball Methodology

RAPM

Also called Regularized Adjusted Plus-Minus

A regression-based estimate of each basketball player's per-possession impact on team scoring margin, regularized to handle the small-sample collinearity that makes raw adjusted plus-minus unstable. The methodological backbone of modern basketball player evaluation.

Definition

Regularized Adjusted Plus-Minus (RAPM) estimates each player's contribution to net scoring margin per possession by regressing possession-level scoring margin on indicator variables for each player on the court. Raw Adjusted Plus-Minus (APM) suffers from severe multicollinearity (teammates share court time, so their indicators are correlated) and small-sample noise (most player coefficients are estimated from limited possessions). RAPM applies ridge regression or Bayesian shrinkage toward zero, which trades a small amount of bias for a large reduction in variance and produces stable, comparable player coefficients across seasons. RAPM is the input layer for most modern basketball player-evaluation frameworks; the major public composites (RAPTOR, EPM, LEBRON) all build on RAPM-style estimates.

Why It Matters
How to Compute
y = X·β + ε, where y is possession-level scoring margin, X is the player-on-court indicator matrix, β is the vector of player impact coefficients. Solve β = (Xᵀ·X + λ·I)⁻¹·Xᵀ·y for ridge regression with regularization strength λ.

Construct a possession-level dataset where each row is one possession with the ten players on court and the points scored. Build a sparse indicator matrix X (rows = possessions, columns = players, entries = +1 for offensive players, -1 for defensive players). Fit ridge regression of possession margin y on X with a regularization strength λ chosen by cross-validation. The resulting coefficients β are each player's regularized impact per possession. Convert to per-100-possession units for league comparability.

Example

For an NBA player with 1,500 offensive possessions and 1,400 defensive possessions across a season, ridge regression with cross-validated λ might produce a coefficient of +3.4 per 100 possessions on offense and -1.1 per 100 on defense, for a net +2.3 impact. A 95% credible interval would be reported alongside the point estimate, derived either analytically (for ridge) or via Bayesian shrinkage with posterior sampling.

Common Mistakes
Frequently Asked

How is RAPM different from box-score composites like PER or BPM?

Box-score composites are weighted sums of measurable per-game statistics (points, rebounds, assists, etc.) and do not isolate a player's impact from their teammates' and opponents'. RAPM is regression-based and explicitly conditions on the other nine players on court. Composites are easier to compute and interpret; RAPM is harder but reflects what a player actually contributes to scoring margin.

How many possessions are needed for a stable RAPM estimate?

Single-season RAPM is noisy for most players; multi-season RAPM (typically 2-3 seasons pooled) is the standard for reliable estimates. Players with low minutes have wider credible intervals; honest RAPM reporting includes the interval, not just the point estimate.

How does RAPM compose with scheme fit?

RAPM gives a player's average impact across the schemes they've played in. Scheme fit estimates how that impact would change under a specific destination's scheme. The two are complementary: RAPM is the league-average input; scheme fit is the destination-conditioned projection.

See Also