A self-updating rating that needs nothing but match results. It is the transparent floor every other model on this site must clear, and the history state the richer models build on.
0.1938
OOS RPS · expanding
the headline skill (realism protocol)
0.1937
OOS RPS · LOTO
optimistic ceiling (leaks future folds)
#7
Leaderboard rank
of 7 · CI 0.1546–0.2281
§ 01
The intuition
In plain English, before any mathematics.
A self-updating strength rating: a team gains points for beating a stronger opponent and loses them otherwise, scaled by margin of victory and match importance. It is the transparent, reproducible floor every other model must clear.
§ 02
Mathematical specification
The exact scheme, from the expected score to a W/D/A forecast. Ratings are recomputed from all 49,445 internationals, so a rating exists for any team on any date — built strictly from matches played before that date.
Expected result (logistic on the rating gap) and the rating update
01The expected score
Two ratings give an expected result through a logistic curve on the rating gap. The 400-point scale fixes the units: a 400-point gap makes the stronger side a 10-to-1 favourite on expected score. The 100-point home-field bonus is added only when a side genuinely plays in its own country — neutral-venue matches, the bulk of tournament football, get none.
Ehome=1+10−(Ri−Rj+H1home)/4001,H=100
Expected score — logistic in the rating gap, base 10, scale 400
02The update
After every match the rating moves in proportion to the surprise — the gap between what happened and what the ratings expected. The step size K scales with match importance; the multiplier G lets the margin of victory speak.
Ri′=Ri+KG(δ)(W−Ehome),W∈{1,21,0}
The rating update after every match
03Margin of victory
G is flat for one-goal results, jumps for two, then grows gently with the rout — a big win moves the rating more than a narrow one, but not without limit.
G(δ)=⎩⎨⎧123811+∣δ∣∣δ∣≤1∣δ∣=2∣δ∣≥3
The margin-of-victory multiplier on the goal difference δ
04Match importance
K follows the eloratings.net importance tiers: a World Cup match moves ratings three times as far as a friendly, with qualifiers and continental finals in between.
K=⎩⎨⎧6050403020World Cup finalscontinental finalsqualifiers ⋅ Nations Leagueother tournamentsfriendlies
The importance-weighted step size
05From rating to forecast
A rating gap alone gives an expected score, not a draw probability. The Elo-only forecast — the floor on the leaderboard — adds a two-parameter draw curve: the draw probability decays as the gap grows, and the remaining mass is split by the logistic expected score. Both parameters are calibrated by minimising RPS on recent internationals.
OOS RPS — expanding (headline) and LOTO (optimistic)
On the headline expanding window this model scores 0.1938 — 0.0000 versus the Elo floor (0.1938) and +0.0032 versus the market ceiling (0.1905).
Elo floor 0.1938market 0.1905
Expanding0.1938
LOTO0.1937
Bar fills to the model’s RPS on the floor–ceiling axis; the whisker on the expanding bar is the conservative 95% CI (0.1546–0.2281). Lower (left) is better.
Its match-RPS sits at the floor; the gap to the market is small and — at n = 3 — inside the bootstrap interval.
Source · Oxford Football Forecasting model · Bookmaker consensus (de-vigged closing odds) · 152 matches · 3 tournaments
§ 06
Strengths & limits
What this model is good for — and where it is weak.
Strengths
Fully reproducible from results alone
No squad/market data needed — works for every nation