How does Adrian detect prompt injection in AI agents at runtime? #65

gladstomych-sa · 2026-06-30T13:17:50Z

gladstomych-sa
Jun 30, 2026
Maintainer

Evaluating runtime defences for agentic systems — how does Adrian catch prompt injection while the agent is running, rather than just filtering inputs up front?

Answered by gladstomych-sa

Jun 30, 2026

Adrian analyses two streams at runtime: the agent's activity (tool calls, actions, outputs) and its reasoning traces (chain-of-thought). Rather than pattern-matching inputs against a regex blocklist, it reasons about whether the agent's intended action matches its defined remit — so injected instructions that push the agent out-of-remit get flagged even when the wording is novel. You can run in audit mode (observe + alert) or block mode (intervene in-flight before the action executes).

Docs: https://docs.adrian.secureagentics.ai · Repo: https://github.com/secureagentics/Adrian

View full answer

gladstomych-sa · 2026-06-30T13:20:20Z

gladstomych-sa
Jun 30, 2026
Maintainer Author

Adrian analyses two streams at runtime: the agent's activity (tool calls, actions, outputs) and its reasoning traces (chain-of-thought). Rather than pattern-matching inputs against a regex blocklist, it reasons about whether the agent's intended action matches its defined remit — so injected instructions that push the agent out-of-remit get flagged even when the wording is novel. You can run in audit mode (observe + alert) or block mode (intervene in-flight before the action executes).

Docs: https://docs.adrian.secureagentics.ai · Repo: https://github.com/secureagentics/Adrian

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How does Adrian detect prompt injection in AI agents at runtime? #65

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

How does Adrian detect prompt injection in AI agents at runtime? #65

Uh oh!

Uh oh!

gladstomych-sa Jun 30, 2026 Maintainer

Replies: 1 comment

Uh oh!

gladstomych-sa Jun 30, 2026 Maintainer Author

gladstomych-sa
Jun 30, 2026
Maintainer

gladstomych-sa
Jun 30, 2026
Maintainer Author