How Elo Works

Understanding the rating system behind F1 Elo

What is Elo?

The Elo rating system, originally developed for chess by Arpad Elo, is a method for calculating the relative skill of players. Every driver starts at a rating of 2000. After each race, ratings are adjusted based on finishing position — beating a higher-rated driver gains you more points than beating a lower-rated one.

How It Works in F1

Each race is treated as a series of head-to-head matchups between every pair of drivers. For each pairing:

  • The driver who finishes ahead "wins" the matchup
  • An expected score is calculated based on the rating difference: E = 1 / (1 + 10^((opponent_rating - player_rating) / 400))
  • The rating adjustment is: K × (actual_score - expected_score)
  • Base K-factor is 48, scaled by season length (12.0 reference races) and normalized by √(grid size − 1)
  • At each season's end, ratings regress 3% toward the starting Elo of 2000

A driver's total rating change for a race is the sum of adjustments across all opponents. With 20 drivers on the grid, that's 19 head-to-head matchups per race.

Why Elo for F1?

Unlike championship points, Elo ratings account for the strength of the field. Winning against a grid of highly-rated drivers is worth more than winning against weaker competition. This makes it possible to compare drivers across different eras — a 2600-rated driver in 1955 demonstrated the same level of dominance as a 2600-rated driver in 2024.

Across 1149 races and 864 drivers in F1 history, the Elo system produces a natural distribution of skill tiers.

Elo Tiers

Elite 2600+

The all-time greats. Drivers who achieved sustained dominance against the strongest competition. Only 6 drivers (0.7%) have reached this level.

World Class 2450–2599

Championship contenders and race winners who consistently performed at the highest level. 36 drivers (4.2%).

Strong 2300–2449

Solid midfield performers and occasional podium finishers. Competitive drivers who proved themselves against strong fields. 84 drivers (9.7%).

Average 2100–2299

Reliable competitors who carved out respectable F1 careers without reaching the top step. 247 drivers (28.6%).

Developing Below 2100

The largest group — short careers, backmarkers, or drivers who never had the machinery to climb higher. Many only raced a handful of grands prix. 491 drivers (56.8%).

Worked Example

Let's walk through a simplified 4-driver race to see exactly how the math works.

Step 1: Set up the race

P1 — Verstappen 2650
P2 — Hamilton 2520
P3 — Norris 2480
P4 — Alonso 2200

Step 2: Calculate K per pair

The K-factor for each head-to-head matchup is adjusted for season length and grid size:

Kpair = Kbase × (Reference Races ÷ Season Races) ÷ √(n − 1)
= 48 × (12 ÷ 24) ÷ √(4 − 1)
= 13.86 per matchup

Fewer season races → higher K (each race matters more). Larger grids → lower K per pair (more matchups to sum over).

Step 3: Head-to-head matchups

Every pair of drivers is compared. The higher finisher "wins" the matchup. Here are all 6 pairings:

Matchup Winner Expected Surprise Winner Δ Loser Δ
Verstappen vs Hamilton Verstappen 67.9% 32.1% +4.45 -4.45
Verstappen vs Norris Verstappen 72.7% 27.3% +3.79 -3.79
Verstappen vs Alonso Verstappen 93.0% 7.0% +0.97 -0.97
Hamilton vs Norris Hamilton 55.7% 44.3% +6.13 -6.13
Hamilton vs Alonso Hamilton 86.3% 13.7% +1.9 -1.9
Norris vs Alonso Norris 83.4% 16.6% +2.3 -2.3

Expected = probability the higher-rated driver wins (based on rating gap). Surprise = how unexpected the result was. Beating a much stronger opponent = high surprise = big gain.

Step 4: Sum all adjustments

Each driver's total change is the sum of their gains/losses across all matchups:

Verstappen P1
Before 2650
Change +9.2
After 2659.2
Hamilton P2
Before 2520
Change +3.6
After 2523.6
Norris P3
Before 2480
Change -7.6
After 2472.4
Alonso P4
Before 2200
Change -5.2
After 2194.8

Step 5: Season-end regression

At the end of each season, every driver's rating regresses 3% toward the starting Elo of 2000. This prevents ratings from inflating forever and gives returning drivers a slight pull toward the mean:

New Elo = Current × 0.97 + 2000 × 0.03
e.g. Verstappen: 2650 × 0.97 + 2000 × 0.03 = 2630.5 (19.5 toward mean)

Key takeaways

  • Zero-sum per matchup — the winner's gain exactly equals the loser's loss
  • Upsets pay more — a low-rated driver beating a high-rated one gains big; the expected winner gains little
  • Every position matters — even P10 vs P11 creates a rating adjustment
  • Scale with the grid — with 20 drivers, each race has 190 matchups, producing meaningful Elo swings

When an Underdog Wins

Now imagine the same four drivers, but this time Alonso — rated 450 points below Verstappen — wins the race. Verstappen, the highest-rated driver, finishes last. This is where Elo gets interesting.

P1 — Alonso 2200
P2 — Norris 2480
P3 — Hamilton 2520
P4 — Verstappen 2650

Alonso's expected win probability against Verstappen is just 7.0%. By winning that matchup, the surprise factor is enormous — and the Elo reward reflects it. Meanwhile Verstappen, expected to beat everyone comfortably, loses all three matchups and pays a steep price.

Matchup Winner Expected Surprise Winner Δ Loser Δ
Alonso vs Norris Alonso 16.6% 83.4% +11.55 -11.55
Alonso vs Hamilton Alonso 13.7% 86.3% +11.96 -11.96
Alonso vs Verstappen Alonso 7.0% 93.0% +12.89 -12.89
Norris vs Hamilton Norris 44.3% 55.7% +7.72 -7.72
Norris vs Verstappen Norris 27.3% 72.7% +10.07 -10.07
Hamilton vs Verstappen Hamilton 32.1% 67.9% +9.41 -9.41

Compare the outcomes

Notice how the same drivers with the same starting Elos produce dramatically different results depending on who finishes where. Alonso's upset win earns him nearly as much as Verstappen loses — the system heavily rewards beating opponents you're not expected to beat.

Alonso P1
Before 2200
Change +36.4
After 2236.4
vs expected finish +41.6
Norris P2
Before 2480
Change +6.2
After 2486.2
vs expected finish +13.9
Hamilton P3
Before 2520
Change -10.3
After 2509.7
vs expected finish -13.9
Verstappen P4
Before 2650
Change -32.4
After 2617.6
vs expected finish -41.6

"vs expected finish" shows the difference compared to the first example, where everyone finished in Elo order. Alonso swings by 41.6 Elo compared to finishing last, while Verstappen's P4 costs him 41.6 more than his expected P1 win.

Elo In Action

Baku City Circuit — 2025 (Round 17)

Average field Elo: 2171 · Grid size: 18

C.Sainz gained +48.3 by finishing P3 against a field averaging 2171 Elo.

O.Piastri dropped -85.7 after finishing P20 — underperforming relative to their 2528 Elo.

# Driver Elo Before Elo After Change
1 M.Verstappen Red Bull 2443 2462 +19.6
2 G.Russell Mercedes 2398 2416 +17.6
3 C.Sainz Williams 2075 2123 +48.3
4 A.Antonelli Mercedes 2038 2085 +47.0
5 L.Lawson RB F1 Team 1989 2036 +46.9
6 Y.Tsunoda Red Bull 2044 2078 +34.6
7 L.Norris McLaren 2474 2456 -17.8
8 L.Hamilton Ferrari 2235 2234 -0.3
9 C.Leclerc Ferrari 2314 2299 -14.9
10 I.Hadjar RB F1 Team 2100 2105 +4.4
11 G.Bortoleto Sauber 1985 1997 +12.4
12 O.Bearman Haas F1 Team 2027 2029 +1.7
13 A.Albon Williams 2169 2147 -21.5
14 E.Ocon Haas F1 Team 2014 2005 -8.4
15 F.Alonso Aston Martin 2074 2053 -21.5
16 N.Hülkenberg Sauber 2089 2060 -29.0
17 L.Stroll Aston Martin 2077 2044 -33.4
20 O.Piastri McLaren 2528 2442 -85.7

Limitations

Elo is a powerful tool for measuring relative performance, but it has inherent blind spots — especially when applied to a sport as complex as Formula 1. Keep these caveats in mind when interpreting ratings.

Car performance is invisible

Elo treats every result as a reflection of driver skill. In reality, machinery plays a massive role. A dominant car inflates its driver's rating; a poor car suppresses it. A driver switching from a backmarker to a frontrunner will see a sharp Elo rise that's partly car, not skill. Head-to-head teammate comparisons (which Elo captures well) help, but inter-team matchups always blend driver ability with engineering.

DNFs and mechanical failures

Elo penalises any finishing position — including a retirement on lap 1. A driver who crashes out and one whose engine fails are treated identically: both lose Elo from head-to-head matchups against every driver who finished ahead. This means reliability-plagued seasons can drag a rating down through no fault of the driver.

Team orders and strategic results

When a driver yields a position on team orders, Elo records the result at face value. A number-two driver told to hold station will accumulate a lower rating than their raw pace might deserve. Elo has no way to distinguish a genuine on-track battle from a choreographed swap.

Grid size and era context

A 20-car grid generates fewer head-to-head matchups per race than a 26-car grid. Our K-factor scales with grid size to compensate, but smaller fields still mean fewer data points per event. Additionally, the talent depth of the grid varies across decades — dominating a shallow field and dominating a deep one may look the same in raw Elo terms.

Cross-era comparison nuance

The system is designed so that a given Elo value represents the same level of dominance relative to contemporaries regardless of era. However, it cannot tell you whether a 2600-rated driver from the 1950s would beat a 2600-rated driver from the 2020s in the same car — it only says both dominated their respective fields to a similar degree.

What Elo doesn't capture

Qualifying pace, race craft in wheel-to-wheel battles, tyre management, wet-weather ability, and development feedback are all crucial driver skills that Elo compresses into a single number. Sprint races and qualifying results are also not factored into ratings — only the main Grand Prix finishing order matters.