Model Dashboard

Phase 40 • Training

Model Story

A story-driven view of each run: trust (performance), drivers (what mattered), adaptation (how it shifts in stress), and blind spots (when it fails).

Select a target and a run to load the model story.

Target

Run

Download artifacts or share this view.

JSON

Metrics CSV Feature Usage CSV

Reliability (Calibration)

Verification: Does "60% probability" actually mean 60% win rate? Points should hug the diagonal line.

Select a run to compute backtest performance…

Prediction Confidence

Select a run to compute prediction uncertainty…

Track how allocation rotates across base models over time (stacked to 100%), with the target-specific benchmark overlaid to reveal who is driving decisions in each regime. Represents the StackedEnsemble model.

Compare the stacked ensemble (blue) against the actual target (black) and the underlying base models (dotted).

Model Stability (Rolling Log Loss)

Consistency check: Spikes indicate regime failures where the model stopped understanding the market.

Compare candidate models against the Conservative (Voting) ensemble. Blue indicates the Champion. Green indicates the Voting System.

The final feature set used by this run, grouped by type (Volatility / Momentum / Macro / Other).

Numbers represent feature importance scores (relative contribution to prediction).

Ranked importance across the base model layer (normalized). Shows what drove predictions most.

Model Dashboard

Portfolio Models Overview

No Models Found

Model Story

Reliability (Calibration)

Reality Check

Prediction Confidence

Evolution of Trust

Signal Forensics

Model Stability (Rolling Log Loss)

Voting Ensemble

Blind Spots

Comparison

The Brain

Drivers

Signal Dashboard

The Speedometer

Ensemble Composition

How Much to Trust This Signal

Why the Model Thinks This