docs: add training roadmap

This commit is contained in:
scawful
2025-12-30 13:15:23 -05:00
parent c3342100e0
commit 85b16363f5
3 changed files with 50 additions and 6 deletions

View File

@@ -1,13 +1,15 @@
# ROADMAP
## Committed
- Minimal module layout + package stubs
- One small utility
- Plugin config helpers (training paths/resources + mounts)
- Dataset registry indexing CLI
- Resource indexing CLI
## Planned
- Local config template
- Example generator
- Validator base + initial validators (ASM/C++/KG/ASAR)
- Generator base + one small generator
- Training monitor schema validation
## Ideas
- Idea: Local validation hooks for training data
- Idea: Minimal dataset manifest generator
- Idea: Dataset manifest + QA summary reports
- Idea: Lightweight training campaign runner (local-only)

27
docs/TRAINING_ROADMAP.md Normal file
View File

@@ -0,0 +1,27 @@
# TRAINING ROADMAP
Scope: AFS Scawful training data pipelines and monitoring. Research-only.
## Committed (exists now)
- Dataset registry indexing (local)
- Resource indexing (local)
- Plugin config loader for training paths/resources
## Planned (near-term)
- Validator base + initial validators (ASM/C++/KG/ASAR)
- Generator base + one small generator
- Training monitor schema validation and config docs
## Planned (mid-term)
- Generator runner that produces dataset manifests
- QA summary reports (counts, sizes, validation results)
- Repeatable dataset build scripts (local-only)
## Ideas (later)
- Training campaign runner (batching + resume)
- Per-generator metrics export (JSON)
- Optional remote sync hooks (explicit opt-in, no auto-network)
## Unknown / needs verification
- Which generators can be safely generalized outside local workflows
- Which validation checks are still useful vs legacy noise