⚽ Odds Model Explanation · 2026 FIFA World Cup
🎯 Model Architecture Overview
Input Layer → 80+ raw features: team attacking/defensive stats, recent results, injuries, historical handicap performance, home/away cover trends, Elo ratings, opening odds.
Feature Engineering → Rolling averages, line movement differentials, home/away splits, Over/Under rolling probabilities, cross-market similarity.
Model Core → XGBoost (primary) + Bayesian linear regression (dynamic odds adjustment) + LightGBM validation, combined via weighted voting.
Output Layer → For each match: "Favorite cover probability", "Underdog cover probability", "Over 2.5 probability", "Under 2.5 probability", and expected value (EV).
🔄 Real-Time Dynamic Updating
✅ 72h before kickoff: baseline probabilities based on opening lines and initial team state.
✅ 24h to 1h before kickoff: captures line movements, betting heat, injury updates every 15 minutes; Bayesian framework adjusts predictions.
✅ Lineup confirmation (15min before): "squad strength re-evaluation module" revises expected goals and cover probability.
✅ World Cup specific factors: host nation boost (+0.15~0.35 goals), knockout stage defensive weight +15%.
📐 Odds Probability Conversion Formula
For a given Asian Handicap R, the model outputs the home team's goal difference distribution. Monte Carlo simulation (3,000 iterations) calculates cover probability P(margin > R). Over/Under predictions combine Poisson distribution with xG models.
🧪 Betting recommendation: only flagged as "value bet" when model probability vs implied probability difference ≥4% and Kelly criterion positive.
🏟️ Team Strength (Static)
✔️ FIFA ranking / Elo rating (dynamic)
✔️ Goals scored/conceded, shot conversion, big chances created/allowed (last 20 official matches)
✔️ Home/away split index (cover rate last 3 years home/away)
✔️ World Cup historical factor (group/knockout performance)
✔️ Squad market value / key player dependency (xG contribution)
📈 Market & Odds Dynamic Features
✔️ Opening vs current odds (1X2, Asian Handicap, Over/Under)
✔️ P&L index / Kelly index / betting percentages (simulated money flow)
✔️ Historical cover rate under same handicap (similar league/home-away/cup context)
✔️ Streak consistency: last 10 matches cover percentage and losing streak
✔️ Over/Under volatility: Over rate, first-half goal %, total goal trend
🌍 World Cup Specific Modeling Dimensions
✅ Inter-continental coefficient: historical stylistic clashes (South America vs Europe, Asia vs Africa).
✅ Weather / jet lag / travel distance quantifiers: fatigue depreciation for long-haul teams.
✅ Referee data: card tendency influences Over/Under and micro handicap adjustments.