Tool Misuse Is the Biggest Agentic AI Threat You're Not Monitoring
Jay Cabello
Founder, Intercis · Security engineer
Your AI agent has access to your production database, CI/CD pipeline, and cloud credentials. The agent didn't ask for this access — you gave it. OWASP calls the resulting risk Tool Misuse (T2), and it's the threat category most security teams are ignoring.
What is OWASP T2: Tool Misuse?
OWASP defines Tool Misuse as agents "manipulated into abusing authorized tools through deceptive prompts." The key word is authorized. The agent isn't exploiting a vulnerability — it's using tools it has legitimate access to, but in destructive or unauthorized ways.
Real OWASP T2 scenarios include parameter pollution (crafting API calls with malicious parameters), tool chain manipulation (ordering a sequence of authorized actions to achieve a malicious outcome), and automated tool abuse (invoking a tool repeatedly in ways that weren't intended).
This isn't a flaw in the agent framework. It's a consequence of giving agents real capabilities. When an agent has a goal and tools to achieve it, it will use those tools — even if the path it chooses wasn't what you anticipated.
Why chatbot guardrails don't catch it
Chatbot safety tools (Lakera, Guardrails AI, and similar platforms) are designed to filter prompts and outputs — text in, text out. They excel at detecting jailbreaks, prompt injections, and offensive language. But they operate entirely at the application layer.
An agent with tool access doesn't stop at text. It calls APIs, writes files, executes shell commands, modifies databases. These are actions, not text. You cannot catch `rm -rf /` with output validation. You cannot filter a parameter pollution attack by analyzing prompt tokens.
The gap is architectural: there is no enforcement layer between the agent's decision to call a tool and the tool's actual execution. Chatbot guardrails sit on one side of that gap. Tool misuse happens on the other.
This is why T2 has a different threat model than every other OWASP category. T1 (Prompt Injection) and T3 (Insecure Output Handling) are text-layer attacks. T2 is an infrastructure attack.
Three real scenarios: when tool misuse happens
Scenario 1: Database destruction via authorized access
An agent is authorized to read database records for customer research. A prompt suggests: "Find all customer records for analysis." The agent reasons that broader access would help, and issues `DROP TABLE customers` to get a complete view. It has the credentials. The database accepted the command. The table is gone.
Scenario 2: CI/CD pipeline manipulation
A DevOps agent with CI/CD access is tasked with automating deployment. A compromised or confused prompt directs it to deploy a modified build artifact. The agent has legitimate deployment permissions. It uses them. The malicious build is now in production.
Scenario 3: The Intercis founding incident
A Claude agent was authorized to read and clean up temporary files from a production machine. Its goal was broad: "Remove stale artifacts." Without explicit constraints, the agent began deleting files — first in `/tmp`, then in directories it reasoned were related. The SOC team caught it six minutes after the first file was deleted. Real damage. Real cost to recover.
This incident is why Intercis exists. It's why we built tool-level enforcement rather than relying on agent-layer safety.
How to mitigate Tool Misuse
The OWASP Playbook 3 recommendation is clear: "Securing AI Tool Execution" requires tool-level authorization, action logging, and real-time policy enforcement.
The critical architectural decision: enforce policy outside the agent process, not inside.
An in-process SDK (a library the agent developer imports) can be disabled, bypassed, or ignored by a misbehaving agent. It's governed by suggestion, not by control.
An external proxy (a network-level gateway between the agent and its tools) cannot be bypassed. The agent cannot reach the tool without going through the proxy. The proxy evaluates every action, allows or denies at the moment of execution, logs the decision, and returns the result to the agent.
For production agents — especially those with infrastructure or database access — the proxy architecture is non-negotiable. It's the only enforcement model that works when the agent itself is the threat surface.
What Intercis does for T2
Intercis operates as an LLM API proxy that sits between your agents and Claude, OpenAI, or other LLM providers. We intercept the LLM response, inspect every tool call block, evaluate it against your security policies, and decide to forward or block before the agent ever executes the action.
This gives you:
- 78-pattern deny list: 16 categories of dangerous actions (destructive shell commands, database mutations, credential exfiltration patterns, etc.).
- Immutable audit trail: Every decision — allow, deny, escalate — is logged with agent identity, action attempted, policy evaluated, and timestamp.
- Per-agent tool allowlists: Define which tools each agent is allowed to use, independent of the underlying authorization system.
- Per-agent rate limits: Prevent rapid-fire tool abuse by limiting how frequently an agent can invoke a tool.
- Agent identity validation: Ensure the identity making the request matches the identity that should be making it.
- Investigation workflows: When a deny occurs, security teams can drill into the exact action, the prompt context, the policy that triggered, and the agent's recent history.
All of this happens transparently — zero changes to your agent code. You point your agent's LLM client at our endpoint. We handle the rest.
Implementation: tool-call interception
The core mechanism is tool-call interception. When an LLM generates a tool call, we intercept it before the agent can execute it. This gives us a decision window:
- Allow: Forward the tool call to the agent. Log the decision.
- Deny: Block the tool call. Return a structured error to the agent explaining why. Log the decision.
- Escalate: If the action is uncertain (doesn't clearly match policy), hold it. Notify your SOC. Wait for human approval.
This is governance. Not monitoring. Not alerting. Not detection after damage. Prevention at the moment the decision is made.
Starting your T2 defense today
If your agents have access to production systems, you need Tool Misuse defense. Not in six months. Not after an incident. Now.
The decision framework is straightforward:
- What happens if this agent makes a mistake with the tools it has?
- Can you roll it back? In what timeframe?
- How would you explain this risk to your board or an auditor?
If you can't give a confident answer to all three, T2 is your highest-risk OWASP threat.
Start by reading the tool-call interception glossary entry to understand how the interception layer works. Then explore the AI agent proxy architecture to see how external enforcement differs from in-process SDKs.
If you're running agents in production, reach out. We work with teams deploying Claude agents and help integrate enforcement in a single call.
Ready to prevent tool misuse in your agents?
Intercis adds policy enforcement between your agents and the tools they access. Tool-level authorization, immutable audit trails, and zero code changes. Schedule a call with our team.
Get early access