OWASP's 17 Agentic AI Threats — What They Mean for Your Security Team
Jay Cabello
Founder, Intercis · Security engineer
OWASP published the Agentic AI Threats taxonomy (T1–T17) because the chatbot threat model doesn't apply to agents with tool access. This isn't a rehash of the OWASP Top 10 for LLMs. It's a separate document addressing a fundamentally different attack surface: agents that can take actions in the real world.
Why a separate taxonomy?
Chatbots receive input and produce text. Agents receive input, reason, and execute tool calls — API calls, file writes, shell commands. The risk model is different because agents have agency. OWASP recognized this gap and published Agentic AI Threats and Mitigations v1.1.
When a chatbot generates a false claim, the damage is contained to whatever the user does with that text. When an agent with filesystem access reasons that it should delete "stale artefacts," the damage happens at the moment the tool call executes. There's no user approval step in between. The agent is operating on your infrastructure in real time.
That's why the threat model is separate. And why a security team protecting production agents can't rely on the same controls that protect a customer-facing chatbot.
The 17 threats, grouped by category
Agent manipulation threats
Tool and privilege abuse
Resource and availability
Identity and trust
Multi-agent and system risks
Supply chain and accountability
Which threats matter most for your deployment?
If you're running agents with production infrastructure access, six threats are highest priority:
T2 (Tool Misuse) is the threat that actually happens. An agent with legitimate access to a database can execute a legitimate query that was never supposed to run at scale. A governance layer between the agent and the tool can see the query, understand its scope, and block it before execution.
T6 (Intent Breaking) matters because prompt injection is real. If your agent reads data from external sources — files, APIs, user input — an attacker can embed instructions in that data. A proxy between the agent and the LLM API can detect and block instruction-like payloads in tool outputs before they influence the agent's reasoning.
T8 (Repudiation) is the auditor's question. If something goes wrong, can you prove what the agent did, when it did it, and what policy it was evaluated against? Without an immutable audit trail, you're relying on logs the agent process controls — which may be modified or deleted.
T9 (Identity Spoofing) matters in systems where the agent acts on behalf of a user or service. If the agent can assume a false identity, it can trigger actions that are authorized for that identity but not for the agent. A governance layer enforces identity context and scope.
T11 (RCE) is the tail-risk scenario. An agent with code execution access should never be able to run arbitrary code. A policy engine can whitelist specific code patterns or execution contexts, denying anything outside that scope.
T4 (Resource Overload) is often overlooked until you see the bill. An agent making thousands of API calls without rate limits can consume compute resources or cost you money in minutes. Governance means setting hard limits on tool usage per agent, per time window, per request.
What to do about it: the mitigation layer
OWASP provides six mitigation playbooks for the T1–T17 taxonomy. The key takeaway: you need an enforcement layer between your agent and the tools it calls. Input filtering alone doesn't cover tool misuse. Output filtering alone doesn't prevent privilege escalation. You need all six pillars working together.
That enforcement layer sits outside the agent process, intercepts every tool call before execution, evaluates it against a security policy, and makes a pre-execution decision to allow or deny. Internal glossary terms to understand this architecture:
- Tool-call interception — reading the agent's requests before they execute
- AI agent proxy — the network component that sits between agent and LLM API
- AI agent policy enforcement — the decision engine that evaluates actions against defined rules
If you're only monitoring and alerting after actions execute, you're not mitigating these threats. You're detecting them. There's a real difference, and your compliance team knows it.
The core question: Can a security team at your organization write, review, and enforce policies that govern what an agent is allowed to do before that agent runs? If not, the agent isn't governed — it's just monitored.
OWASP's Agentic AI Threats taxonomy is the security industry's way of saying: the threat model for an agent is not the same as the threat model for a chatbot. If you're building governance controls, you need to defend against agent-specific threats. And you need enforcement at the pre-execution layer, not the post-execution layer.
Ready to govern your agents?
Intercis provides pre-execution enforcement for agentic AI threats. We sit between your agent and the LLM API, intercept every tool call, evaluate it against your policies, and make the decision before anything executes.
Apply for early access