V5

V5 Machine Learning Model

XGBoost gradient boosting with comprehensive feature engineering

~240 FeaturesRolling StatsSituational ContextPhysical MatchupsPlay Style Data

Model Training In Progress

V5 models are currently being trained. Showing placeholder data below.

Training Samples

~75K

3 seasons of data

Total Features

239

after encoding

Avg Val MAE

4.8

points prediction

vs Baseline

+14%

improvement

How V5 Works

Feature Engineering

  • Rolling Stats (L5, L10, L15, Season) - Recent form and consistency
  • Situational Context - Opponent defense, pace, rest days
  • Matchup Features - Historical vs team, physical differentials
  • Advanced Stats - Synergy play types, hustle, clutch performance

Model Architecture

  • Algorithm - XGBoost Gradient Boosting
  • Separate Models - One per stat type (pts, reb, ast, etc.)
  • Time-Based Split - Train on historical, validate on recent
  • Early Stopping - Prevents overfitting

Performance by Stat Type

StatVal MAEVal RMSEVal R²Baseline MAEImprovementStatus
pts4.806.400.6505.60+14.3%Strong
reb2.102.800.5802.40+12.5%Good
ast1.802.500.6202.10+14.3%Strong
stl0.700.900.3500.80+12.5%Weak
blk0.600.800.4000.70+14.3%Good
tov1.001.400.4501.20+16.7%Good
fg3m0.901.300.5001.10+18.2%Good

MAE = Mean Absolute Error (avg points off). RMSE = Root Mean Square Error (penalizes large errors). = Coefficient of determination (1.0 = perfect).

V5 vs V3 vs V4

Head-to-head comparison once V5 predictions are generated

V3
Baseline Model
Rolling averages + simple factors
V4
Bayesian Model
Probability distributions + regime detection
V5
ML Model
XGBoost + comprehensive features