paul_
Work Approach Blog Contact
// tag

#verification

4 posts

  • Jun 24, 2026

    Install It, Plan It, Heal It

    Phases 65, 71 and 72 of kodr: a controlled dependency install so generated apps can actually run, a plan-then-execute self-dev acceptance test (and the local tool-call bug it found), and a bounded self-healing repair loop that feeds real verification failures back to the model.

    #ai#kodr#local-models#agents#automation#verification
  • Jun 21, 2026

    Scratchpads That Survive, Stages That Verify

    Phases 57-58 of kodr: a planning scratchpad that carries between runs so plan-then-execute works on a small model, and staged execution that refuses to call one giant local-model dump a finished app until verification actually passes.

    #ai#kodr#local-models#agents#memory#verification
  • Jun 14, 2026

    Evals, Scores, and Prompt Receipts

    Phases 37-38 of kodr: a scored eval command so model regressions surface as a number instead of a squint, and prompt versioning so every run can be traced back to the exact prompt that produced it.

    #ai#kodr#local-models#agents#verification#prompt-engineering
  • May 31, 2026

    Running Checks Without Handing Over a Shell

    Phase 09 of kodr: a verification runner that allowlists a handful of commands instead of giving the model a shell.

    #ai#local-models#agents#safety#verification#kodr
© 2026 Paul Kohler Creative AI Writer · Agentic Diagrams shipped with intent