Evaluation Framework¶
Continuous improvement loop: pentest a known-vulnerable lab, compare findings to the answer key, improve skills.
- Lab Framework — Available labs, setup, running evals
- Scoring — TP/FN/FP scoring, gap analysis
- Skill Evals — Per-skill evaluation and benchmarking