How to use this
Plain language first
Each term here is written in normal English first, with the technical meaning attached to it rather than hidden inside it.
Independent research and operating notes on AI agent governance.
Reference / Glossary
This glossary keeps the CAISI vocabulary plain because governance work gets muddy fast when every team uses different words for the same mechanism. The goal is not new jargon. It is a small set of terms AppSec, platform, and engineering can reuse without losing the underlying control logic.
How to use this
Each term here is written in normal English first, with the technical meaning attached to it rather than hidden inside it.
Best fit
These are the terms that show up most often when teams are trying to compare tools, define controls, or explain incidents.
Whether approval actually changes execution state. If approval only leaves a record and the action can still execute anyway, the mediation is weak.
The artifact bundle a reviewer, auditor, or incident responder can inspect cold. It should explain scope, policy, execution, validation, and residual risk without side-channel narration.
A receipt shows that something was approved or requested. Proof shows what actually happened. Mature workflows need both.
What remains uncertain, deferred, or dependent on human review after the run. Good proof makes residual risk explicit instead of implying certainty.
The moment before an agent can change something real. This is where meaningful policy has to run if the goal is prevention, not documentation after the fact.
A state where the requested action cannot run. This matters because "deny" or "approval required" is only credible if it actually makes the action non-executable.
Any route an agent can use to change code, systems, data, or other production reality. Repos, MCP tools, CI jobs, and local runners can all be write paths.
The runtime contract that determines what happens after a stop request. A stop control is only strong if it prevents new side effects after the state transition.
The docs, commands, fixtures, tests, and boundaries that tell both humans and agents how work is supposed to happen in a repo.
A background delivery system that turns bounded work items into reviewable artifacts and pull requests with explicit states, retries, and handoff points.
The deterministic workflow around the model: planning, execution, validation, evidence packaging, and shipping. It is what makes the work reusable across teams and repos.
An isolated workspace that is already bootstrapped enough to run quickly, but still separate enough to keep retries, secrets, and side effects bounded.
Tests or scenarios the agent cannot overfit to during the build loop. These matter when visible tests alone are too easy to game.
The measured ability of the control layer to change what can execute and leave behind proof that survives review.
Whether the evidence packet is complete enough for a third party to reconstruct what happened without guessing.
The question of whether a lesson from one stack or run applies to your own environment. Mechanisms often transfer farther than raw rates.