Validation Dashboard
The validation dashboard tracks two complementary metrics across releases:
- Accuracy (0–1): how closely simulation results match calibration targets
from the original BAM model (Delli Gatti et al., 2011). Higher is better.
- Stability (%): how consistently random seeds produce valid results.
A seed “passes” when all its metric scores exceed the tolerance threshold.
Card border colors reflect the accuracy:
- Green: accuracy ≥ 0.85
- Yellow: accuracy 0.70–0.85 (warning)
- Red: accuracy < 0.70 (failing)
Stability is shown inline with its own coloring (green ≥ 97%, yellow ≥ 96%).
Click a scenario card to drill down into per-metric charts. Click any chart to
expand it.
Loading validation data...
No validation benchmark data available yet.