AppSec
Control failure, proof, and review surfaces
Start where runtime behavior, approval, and evidence quality can be measured.
Independent research, field notes, and frameworks on AI-assisted software delivery.
CAISI / AI-assisted software delivery
CAISI studies how AI-assisted engineering workflows gain authority across code, CI/CD, tools, credentials, approvals, and evidence. The work is written for AppSec, security leadership, engineering, and platform teams that need practical answers without overstating what the evidence proves.
Role routes
AppSec
Start where runtime behavior, approval, and evidence quality can be measured.
CISO / Security leadership
Start where leadership needs a defensible story for risk, audit, and board review.
Engineering / Platform
Start where repo standards, CI/CD control, workflow ownership, and developer adoption become reusable engineering work.
GRC / Audit
Start where proof packets, approval records, validation, and re-review triggers need to hold up after the rollout.
Practical rollout questions
Most teams do not begin by asking for AI Software Delivery Control. They begin with a rollout, audit, platform, or credential question that needs a concrete answer.
Map the first Agent Action BOM for the workflows that can write, call tools, trigger CI/CD, or touch secrets.
Define the proof packet before rollout: actor, owner, credential, action, target, approval, validation, and outcome.
Classify actions as allowed, approval-required, or blocked so review sits where authority changes state.
Treat MCP declarations, tool calls, and invocation context as part of the delivery boundary.
Map tokens, service accounts, OAuth grants, CI secrets, and inherited identities to the action paths they enable.
Review OAuth grants, tokens, and connected-tool permissions by the actions they can perform.
Map skills, MCP servers, agent configs, exposed endpoints, and tool declarations as action paths.
Define evidence that lets security, platform, audit, or leadership defend the action later.
Frameworks
CAISI uses AI Software Delivery Control as the working language after the practical problem is clear: AI-assisted workflows are becoming actors in software delivery. The old review model cannot carry the whole control burden unless teams add visibility, validation, governance, and proof.
Guide
A practical first-review path for mapping action authority, credentials, tool reach, approvals, and proof.
Reference
A proof-packet model for actor, owner, credential, action, target, approval, validation, and outcome.
Reference
A practical approval model for allowing, holding, or blocking actions at the execution boundary.
Reference
The practical artifact for mapping actor, owner, repo, workflow, credential, reachable actions, targets, approval, and proof.
Guide
A concrete control guide for GitHub Actions, CI/CD workflows, credentials, approval, and proof trails.
Reference
A practical guide for mapping tool reach, invocation context, approval triggers, and proof fields.
Reference
A practical guide for reducing standing-token risk across agents, CI/CD, tools, and release paths.
Field note
Why the missing artifact is an Agent Action BOM, not another generic AI inventory.
Field note
Why skills, MCP servers, agent configs, and exposed endpoints belong in the software delivery action graph.
Research
The research hub is the canonical entry point for report pages, methodology, and artifact-backed findings.
Published report
A controlled comparison showing what changes when the system moves from prompt-only constraints to enforceable tool-boundary control with evidence capture.
Published report
A locked 250-target publication cohort showing that public AI and agent adoption is easy to detect, but approved, bound, and well-evidenced use is much harder to prove.
Field notes
Use field notes when you need the interpretation layer behind the research: rollout pressure, approval packets, repo contracts, boundaries, pilots, and proof.
Executive adoption series
Five posts on platform standards, sanctioned pathways, approval discipline, and how leaders move from AI pilots to governed use.
Framework series
A 10-part framework on repo contracts, orchestration, isolation, evaluation, proof, and maturity.
Benchmark series
Five posts on risk scenarios, control efficacy, proof completeness, and pilot evaluation language for evaluators.
About
CAISI stands for the Centre for AI Security and Integrity. It publishes open research and operator field notes on governing AI-assisted software delivery.
Every headline claim maps to published artifacts, deterministic queries, and explicit scope limits. The point is to make AI agent control measurable enough for security, engineering, and platform teams to act on.
Team
Contact
For research questions, publication inquiries, or collaboration around reproducible AI governance work: david@caisi.dev