Model Transparency

We believe predictions are only valuable when you can trust them. This page shows how well-calibrated our AI models are: when we say 70%, it should happen roughly 70% of the time.

What Is Calibration?

A well-calibrated model produces probabilities that match real-world frequencies. For example, if our model assigns a 60% win probability to 100 different matches, roughly 60 of those matches should end as predicted. The reliability diagrams below plot predicted probabilities against observed outcomes on a held-out test set that the models never saw during training.

How to read the charts: In each chart, the dashed diagonal line represents perfect calibration. Points close to the diagonal indicate reliable probabilities. The bar chart below each curve shows how many predictions fall in each probability range, giving context on where the model is most confident.

Example: What Good and Bad Calibration Looks Like

An illustrative example. The green point is perfectly calibrated: the model predicted 60% and the event occurred 60% of the time. The red point is overconfident: it predicted 80% but events only occurred 55% of the time. The orange point is underconfident: it predicted 30% but events occurred 50% of the time.

The calibration results shown below are based on our latest deployed models. We continuously train, evaluate, and release new model versions — each release is validated on a held-out test set before deployment. The charts on this page always reflect the models currently serving predictions on the platform.

Our NBA prediction system uses neural networks trained on over 40,000 historical games. We predict match winners, total points, point spreads, and individual player statistics including points, rebounds, assists, three-pointers made, steals, and blocks.

Evaluated on 2024-2025 season test set

Reliability diagram for the Match Winner model showing predicted probabilities versus observed outcomes. Enable JavaScript to view the interactive chart.

Reliability diagram for the Match Winner model (2024-2025 season). Points closer to the dashed diagonal indicate better-calibrated probabilities.

Number of predictions in each probability bin. More samples in a bin means the calibration measurement there is more reliable.

Reliability diagram for the Total Points model showing predicted probabilities versus observed outcomes. Enable JavaScript to view the interactive chart.

Reliability diagram for the Total Points model (2024-2025 season). Points closer to the dashed diagonal indicate better-calibrated probabilities.

Reliability diagram for the Point Spread model showing predicted probabilities versus observed outcomes. Enable JavaScript to view the interactive chart.

Reliability diagram for the Point Spread model (2024-2025 season). Points closer to the dashed diagonal indicate better-calibrated probabilities.

Reliability diagram for the Player Points model showing predicted probabilities versus observed outcomes. Enable JavaScript to view the interactive chart.

Reliability diagram for the Player Points model (2024-2025 season). Points closer to the dashed diagonal indicate better-calibrated probabilities.

Reliability diagram for the Player Rebounds model showing predicted probabilities versus observed outcomes. Enable JavaScript to view the interactive chart.

Reliability diagram for the Player Rebounds model (2024-2025 season). Points closer to the dashed diagonal indicate better-calibrated probabilities.

Reliability diagram for the Player Assists model showing predicted probabilities versus observed outcomes. Enable JavaScript to view the interactive chart.

Reliability diagram for the Player Assists model (2024-2025 season). Points closer to the dashed diagonal indicate better-calibrated probabilities.

Reliability diagram for the Player 3-Pointers Made model showing predicted probabilities versus observed outcomes. Enable JavaScript to view the interactive chart.

Reliability diagram for the Player 3-Pointers Made model (2024-2025 season). Points closer to the dashed diagonal indicate better-calibrated probabilities.

Our EuroLeague prediction system uses neural networks with Elo and Glicko-2 ratings. We predict match winners, total points, point spreads, and per-player statistics — including PIR (Performance Index Rating), EuroLeague's signature box-score metric. Every winner prediction carries a concept-level explanation of the factors that drove it.