Methodologies | Obsidic

What exactly does Obsidic do?

+

Obsidic is a predictive sports analytics platform. We run machine learning models and thousands of Monte Carlo simulations on every event to generate win probabilities, edge calculations, player prop projections, and full probability distributions, all surfaced in a clean, daily-updated dashboard.

How is this different from other prediction sites?

+

Most prediction sites give you a number with no methodology behind it. Obsidic is fully transparent. Every model's data sources, feature engineering, training process, and backtest results are publicly documented. We show you the model probability, the market implied probability, and let you decide.

What sports do you cover?

+

Currently Tennis (ATP and WTA) and MLB (game-level + player props) with full backtested results. PGA Golf is in development and will launch next. All three sports are accessible from a single dashboard.

What does "backtested" mean and why does it matter?

+

Backtesting means we test the model on historical data it has never seen before. The model makes predictions, then we score them against actual results. This is the gold standard for validating any predictive model. Our Tennis model was backtested on 2,381 matches and MLB on 2,416 games, all out-of-sample, no hindsight bias.

What are Monte Carlo simulations?

+

Monte Carlo simulations model thousands of random outcomes based on the underlying probabilities. For tennis, we run 10,000 simulations per match, simulating each set and game to produce win probabilities, set score distributions, and tiebreak likelihoods. For MLB, we run 5,000 simulations per game to model run distributions, first-five outcomes, and more.

How do you calculate "edge"?

+

Edge is the difference between our model's probability and the implied probability from the market odds. For example, if our model says a team has a 65% chance of winning but the market implies only 55%, the edge is +10%. Positive edge means the market is undervaluing that outcome relative to our model.

What data sources power the models?

+

For Tennis: 46,400+ historical matches across ATP and WTA, 1,335 player Elo profiles, surface-specific stats, recent form, and head-to-head records, all engineered into 113 ML features. For MLB: 3.4M+ Statcast pitches, 709,281 plate appearances, park factors, pitcher/batter platoon splits, bullpen usage, and rolling performance windows.

When does the Golf model launch?

+

The PGA Golf model is currently in development. It will feature Strokes Gained decomposition (6 components), course-fit scoring, head-to-head matchup projections, and full-field tournament simulations. Sign up for early access to be notified the moment it goes live.

No black boxes.
Read everything.

MLB Prediction Model

Tennis Prediction Model

PGA Golf Model

Frequently Asked Questions

What exactly does Obsidic do?

How is this different from other prediction sites?

What sports do you cover?

What does "backtested" mean and why does it matter?

What are Monte Carlo simulations?

How do you calculate "edge"?

What data sources power the models?

When does the Golf model launch?

No black boxes.Read everything.

MLB Prediction Model

Tennis Prediction Model

PGA Golf Model

Frequently Asked Questions

What exactly does Obsidic do?

How is this different from other prediction sites?

What sports do you cover?

What does "backtested" mean and why does it matter?

What are Monte Carlo simulations?

How do you calculate "edge"?

What data sources power the models?

When does the Golf model launch?

No black boxes.
Read everything.