#verification
4 posts
-
Install It, Plan It, Heal It
Phases 65, 71 and 72 of kodr: a controlled dependency install so generated apps can actually run, a plan-then-execute self-dev acceptance test (and the local tool-call bug it found), and a bounded self-healing repair loop that feeds real verification failures back to the model.
#ai#kodr#local-models#agents#automation#verification -
Scratchpads That Survive, Stages That Verify
Phases 57-58 of kodr: a planning scratchpad that carries between runs so plan-then-execute works on a small model, and staged execution that refuses to call one giant local-model dump a finished app until verification actually passes.
#ai#kodr#local-models#agents#memory#verification -
Evals, Scores, and Prompt Receipts
Phases 37-38 of kodr: a scored eval command so model regressions surface as a number instead of a squint, and prompt versioning so every run can be traced back to the exact prompt that produced it.
#ai#kodr#local-models#agents#verification#prompt-engineering -
Running Checks Without Handing Over a Shell
Phase 09 of kodr: a verification runner that allowlists a handful of commands instead of giving the model a shell.
#ai#local-models#agents#safety#verification#kodr