Capability Floors and the Consequentiality Axis
A team operates a Phase 2 organisation. They have identity controls, basic supply chain verification, behavioural baselines, output validation. They feel rea...
A team operates a Phase 2 organisation. They have identity controls, basic supply chain verification, behavioural baselines, output validation. They feel rea...
A team is auditing their agentic system. They have a spreadsheet. The columns are the controls they have shipped: identity verification, supply chain attesta...
A team builds a framework. They write the technical exposition: precise, systematic, complete. The exposition is correct. Engineers can implement from it. Th...
There is a temptation in technical work to tell the story as if it begins with you. The framework is yours; the architecture is yours; the discipline is your...
A man visits an old garden. At the centre is a banyan tree. The trunk is wider than three men’s outstretched arms. Roots descend from branches, find the grou...
There is a habit, in technical writing, of explaining the framework first and showing the application second. The framework comes with definitions, decomposi...
A team builds a defence around their AI agent. Threats emerge: a new prompt-injection technique, a new model-capability frontier, a new supply-chain compromi...
A team builds a Zero Trust framework for AI agents. They write a long document. They keep adding sections. New threat? Add a section. New control? Add a sect...
When the agentic-systems community talks about “AI safety,” we talk like we are inventing the discipline. We are not. We are receiving it.
The framework names a Vitality Formula: the engineered outcome of an agentic system, the experienced fruit of trust and safety done right. The formula has a ...
How do you know what an AI agent’s output is actually true?
Conventional security promises prevention. The discipline tries to stop bad things from happening: block the input, patch the bug, eliminate the vulnerabilit...
There is a clean version of the Trust × Safety frame that says: trust is built from competence and character; safety is built from alignment, resilience, and...
In conventional thinking, safety is a subset of security. You harden the system; harm doesn’t land. Secure is safe is the implicit mental model.
There is a tempting failure mode in agent design. The agent is asked to do something. The instruction is mildly ambiguous. The context is incomplete. The rig...
A team builds an AI agent. They invest deeply in trust: the agent’s identity is verified, its supply chain is attested, its behavior is profiled, its outputs...
A reviewer reads a Zero Trust framework. The framework claims its evaluation dimensions are orthogonal. The reviewer pushes:
A team is evolving their framework. They add a new constraint — a regulatory requirement, a new threat class, a new product surface, a new dimension of evalu...
A team is evolving an agentic Trust framework. They have built it for some time. The framework computes verdicts; the verdicts gate actions; the system works.
A Phase 2 organization deploys a frontier model. They have identity controls, basic supply chain verification, and competence testing. They feel ready.
Your agent spent $4,200 at 3am. The CISO has five questions. You have until the morning standup to answer them.
Every AI platform today makes the same claim: “We are secure. We are safe. We are trustworthy.”
It is 3:14am. Your purchasing agent detects that printer paper is running low — 15 units against a 50-unit threshold. It places a reorder. $50,000 spent. No ...
Every approach to constraining autonomous agents eventually faces the same question: what happens when the constraint is tested?
AI agents are not users and they are not microservices. They occupy a new category — autonomous software that reasons, acts, and transacts on behalf of human...
When a human buys something online, they click “Purchase.” That click is their consent — implicit, instantaneous, revocable by calling their bank.
The first question every security leader asks about a new framework: “What happens to what I already have?”
This week I had a reading on critical infrastructure policy across the world. While there is a recognition of the importance of critical infrastructure, diff...
Now more than ever in today’s digital age, it’s more important than ever to take steps to protect personal information online. Cyber attacks are becoming inc...
I was intrigued by the release of ChatGPT for research last week. I set out to explore by putting a few of my questions.
An interesting Ted talk that resonated with me a lot. Seemingly complex endeours can indeed be put together using simple concepts.
You’ll find this post in your _posts directory. Go ahead and edit it and re-build the site to see your changes. You can rebuild the site in many different wa...