Model Registry
Picochat runs write many useful artifacts. The model registry turns those run folders into a single auditable table for teams deciding which checkpoint is safe to inspect, export, or deploy.
Build a Registry
From explicit run directories:
picochat registry runs/run-a runs/run-b \
--out reports/model_registry.md \
--json-out reports/model_registry.json
Or discover every run with summary.json under a run bank:
picochat registry --runs-dir runs \
--out reports/model_registry.md \
--json-out reports/model_registry.json
The registry includes:
- gate status and profile
- parameter count and planned token budget
- tokens per parameter
- eval and SFT fit rates
- held-out SFT fit when available
- external benchmark count
- honesty and preflight status
- best/resume checkpoint paths
- tokenizer path
Write a Release Card
For a single run:
picochat registry runs/h200-1b-release \
--release-card reports/h200-1b-release-card.md
The release card is a compact artifact for handoff. It points to required evidence: preflight, honesty, eval, external benchmarks, checkpoints, tokenizer, and gate issues.
Product Rule
The registry is an index, not an approval system. A row with status approved
is useful only when the linked evidence exists and the run’s model card/export
matches the same checkpoint. A blocked run should stay visible so failures are
not lost between experiments.