Performance rating in chess
Performance rating
Definition
Performance rating (often abbreviated TPR for “Tournament Performance Rating”) is the rating level implied by a player’s results against a set of rated opponents, typically over a single event. It answers the question: “If this event were the only evidence we had, what rating would best explain this score against those opponents?” Performance rating is not a permanent rating; it is an event-based statistic derived from the ratings of your opponents and your score against them.
How it’s used in chess
- Event summaries: Organizers and broadcasters list TPR to show how strongly a player performed relative to the field.
- Title norms: FIDE uses performance rating thresholds (e.g., 2600+ for a GM norm, alongside other requirements) when verifying norms. See also Norm.
- Self-assessment: Players gauge form or strength in a particular tournament independent of their current published Elo_rating.
- Historical context: Exceptional events are often described by their TPR (e.g., “near 3000 performance”).
Calculation (FIDE-standard approach)
Let N be the number of rated games in the event, S your total points, and Ro the average rating of your opponents (with some norm-specific adjustments; see Notes below). The performance rating Rp is chosen so that your expected score versus Ro matches your actual score S.
- For 0 < S < N:
- Rp = Ro + 400 × log10(S / (N − S))
- Boundary cases (FIDE convention for norms and event summaries):
- If S = N (a perfect score), Rp = Ro + 400.
- If S = 0, Rp = Ro − 400.
- Quick intuition:
- 50% vs an average field ⇒ Rp ≈ Ro.
- More than 50% ⇒ Rp above Ro; less than 50% ⇒ below.
Notes and practical details:
- For FIDE title norms, very large rating gaps are capped: when computing Ro, an opponent more than 400 points below you is treated as “your rating minus 400.” This avoids inflating TPR by stacking many much-lower-rated opponents.
- Unrated players and forfeits have special handling in norm calculations; consult the current FIDE Handbook for exact procedures.
Quick reference: score percentage to rating difference
Against an opponent pool averaging Ro:
- 60% score ⇒ about Ro + 70
- 65% ⇒ about Ro + 108
- 66.7% (2/3) ⇒ about Ro + 120
- 70% ⇒ about Ro + 147
- 75% ⇒ about Ro + 191
- 80% ⇒ about Ro + 241
- 85% ⇒ about Ro + 301
- 90% ⇒ about Ro + 382
FIDE caps extremes at ±400 for practical reporting and norm checks.
Worked examples
- Balanced result:
- You score 5/10 against opponents averaging 2400. Since S/N = 50%, Rp = 2400.
- Strong plus score:
- Opponents average Ro = 2250. You score S = 6/9.
- S / (N − S) = 6 / 3 = 2 ⇒ 400 × log10(2) ≈ 120.
- Rp ≈ 2250 + 120 = 2370.
- Small open tournament snapshot:
- Opponents’ ratings: 2100, 2150, 2200, 2250, 2300 ⇒ Ro = 2200.
- Your score S = 3.5/5 (e.g., wins in rounds 1, 2, 4; draw in round 3; loss in round 5).
- S / (N − S) = 3.5 / 1.5 ≈ 2.333 ⇒ 400 × log10(2.333) ≈ 147.
- Rp ≈ 2200 + 147 = 2347.
- Perfect score:
- Opponents average 2200 and you score 5/5 ⇒ by convention, Rp = 2200 + 400 = 2600.
Historical and strategic significance
- Benchmarking greatness: Exceptional events are often remembered by their TPR. At Sinquefield Cup 2014, Fabiano Caruana scored 8.5/10 against a super-elite field (average ~2802), yielding a widely cited performance rating around 3097 for the event.
- Classic dominance: Anatoly Karpov’s 11/13 at Linares 1994 produced a TPR close to 2985—one of the most celebrated single-event performances of the classical era.
- Impact on norms: A TPR at or above a norm threshold (e.g., 2600 for GM) signals that, combined with opponent and round requirements, a title norm may be achieved. However, TPR alone does not guarantee a norm; composition rules also apply.
- Preparation insight: Coaches use TPR splits across phases (classical/rapid/blitz) or opponent categories to assess where results outpace or lag behind current rating.
Common pitfalls and caveats
- Small sample size: A few games can yield volatile TPRs—great for headlines, but not a stable indicator of long-term strength.
- Distribution matters: The average Ro hides whether points came vs much higher- or lower-rated players; rating change is calculated game-by-game, not from TPR.
- Different formulas exist: Some federations and online platforms use approximations, such as Rp ≈ Ro + (400 × (wins − losses)) / N. This is convenient but can differ from the FIDE logistic-based method, especially for extreme scores.
- Not your new rating: TPR is descriptive of an event; your published rating updates via the rating system’s per-game formulas and K-factors.
Interesting facts
- Terminology: You’ll see TPR, PR, “event performance,” and “tournament performance rating” used interchangeably. FIDE documents often use Rp (performance rating) and Ro (opponents’ average rating).
- Caps vs reality: The logistic model implies that scoring 90% corresponds to roughly a 382-point rating advantage; FIDE’s ±400 cap ensures perfect scores don’t yield “infinite” performance ratings in reports.
- Headline numbers: During Caruana’s 7/7 start at Sinquefield Cup 2014, some live trackers (using uncapped or approximate formulas) briefly displayed astronomical TPRs far above 3200; with FIDE’s ±400 cap, a perfect score vs ~2800 opposition tops out near 3200.