The headline numbers
Across the major European leagues, well-built AI football predictors land in these accuracy bands over rolling samples of several hundred fixtures:
| Market | Typical accuracy | Notes |
|---|---|---|
| 1X2 (match result) | 50β55% | Theoretical ceiling ~55-58%. Higher claims = cherry-picking. |
| Over/Under 2.5 goals | 60β65% | Less sensitive to single events than 1X2. |
| Both Teams to Score (BTTS) | 60β65% | Similar dynamics to O/U 2.5. |
| Asian Handicap (closest line) | 50β55% | Sharpest market β model edge vs market is smallest here. |
| Correct Score (any specific) | 10β15% | Large outcome space; high price is paid for low accuracy. |
Bands derived from public model performance across Premier League, La Liga, Bundesliga, Serie A and Ligue 1 over the 2024-25 and 2025-26 seasons. Comparable to academic xG-Poisson literature (Dixon-Coles 1997 baseline + modern extensions).
Why 55% is the ceiling on 1X2
Football is a genuinely high-variance sport. A single deflected shot, a contentious VAR overturn, a red card in the 28th minute β any of these flips the outcome of a fixture an objectively stronger team should win. The probability of these βflipβ events is real, not noise the model is failing to capture. It's the sport's inherent randomness.
The theoretical ceiling for 1X2 accuracy in elite leagues sits around 55-58% even with perfect information about both squads. AI predictors hitting 53-55% aren't bumping against a methodology limit β they're bumping against the variance of the sport itself. Higher claimed accuracy almost always reflects either cherry-picking (only publishing predictions on obvious fixtures where every model would agree), market-aligned predictions (just picking the favourite, which the bookmaker also βpredictsβ correctly 60-65% of the time), or simply mis-stating.
For broader context on how the underlying math works, see our walkthrough of how AI football predictions work and the pillar piece on AI football predictions.
Why βhow accurate is itβ is the wrong question
Asking βhow accurate is the AI?β treats predictions as outcomes (the AI said home win, was it home win?). That's the wrong frame. Predictions are probabilities. The question that matters is whether the probabilities are calibrated β when the AI says 65%, does the outcome happen 65% of the time?
A calibrated 65% probability that is βwrongβ on a specific fixture isn't a model failure. It's the model being right 65% of the time and this happening to be one of the 35% where the underdog wins. The accuracy that matters is across the distribution, not on any single match.
Practically, that means: don't use AI predictions to pick which team will win the next match. Use them to find spots where the model's probability is higher than the bookmaker's implied probability, then bet the size that makes sense given the edge β the Kelly criterion is the standard tool. Over hundreds of bets, the edge compounds. Over five bets, it's indistinguishable from luck.
The right way to measure prediction quality
Raw win rate over a small sample is a noisy measure. Three better metrics:
1. Brier score
Measures how close the predicted probabilities are to the actual outcomes across the full sample. A perfect predictor scores 0; a predictor that just guesses uniformly scores 0.667 on a three-outcome market. Lower is better. The Brier score is the standard academic measure of forecast quality.
2. Log loss
Penalises confident wrong predictions more heavily than uncertain wrong ones. A predictor that says 95% home win and gets it wrong is heavily punished; a predictor that says 40% home win and gets it wrong is mildly punished. Log loss rewards humility and punishes overconfidence.
3. Closing Line Value (CLV)
The most useful metric for bettors. CLV measures whether the predictions identified outcomes that the market subsequently agreed were underpriced β comparing the odds you took against the closing price right before kickoff. Consistent positive CLV over a few hundred bets is the strongest available evidence that a model has edge. Our CLV explainer walks through the full definition.
How to spot mis-stated accuracy claims
Four red flags when an AI prediction service publishes an accuracy number:
- Sample size missing.β75% accuracyβ on 40 fixtures means almost nothing β variance dominates. A credible accuracy claim cites the sample size and ideally publishes the underlying records.
- Market unspecified.Accuracy on 1X2 is much lower than accuracy on over/under 2.5. Citing β75% accuracyβ without saying which market is conflating two very different bands.
- Cherry-picked sample.Some services only publish predictions on a subset of fixtures (e.g. only when the model is βconfidentβ). The visible win rate then reflects the cherry-pick, not the full slate. Look for sites that publish probabilities for every fixture, every week.
- No methodology disclosure.An accuracy claim without a documented methodology is unfalsifiable. The model can be re-tuned post-hoc to fit historical results; the published number then doesn't predict future performance. KiqIQ's full methodology is documented at /methodology.
What KiqIQ publishes β and what we don't claim
KiqIQ publishes probability predictions on every upcoming fixture in the leagues we cover β no cherry-picking, no βonly when confidentβ filtering. The methodology is documented in the open. The model output runs through the same odds-consensus + Poisson + xG stack on every fixture; there is no human curation step that selectively re-rates the probabilities.
We deliberately do not publish a headline single-number accuracy claim like βour AI is 74% accurate.β That number, even when honestly computed, is misleading without the full distribution β and most users do not have time to validate the distribution. Instead, the live probability outputs are visible on every fixture and the methodology page is public, so the analytically curious can audit specific predictions and the broader population can use the probabilities as inputs to their own decisions.
Inspect the predictions yourself
Every fixture on /football carries an AI probability prediction generated by the documented stack. The methodology page explains exactly how. The Poisson and no-vig calculators let you reproduce the math.
For informational and educational purposes only. 18+. Predictions are probability estimates, not guarantees. Only bet what you can afford to lose.