AI Agent Attack Signals Examples
Some of what Meridian watches for on AI voice agents and conversational systems. Keys only — no opaque weighting; additional detection runs in production.
| Key | Meaning |
|---|
Meridian monitors live conversations with AI voice agents, phone bots, and conversational interfaces — surfacing injection, hijacking, and extraction attempts with clear in-session alerts while the session is still active.
Your stack, guarded
Meridian listens on the live conversation — across OpenClaw-style agents, Hermes orchestration, OpenRouter model hops, xAI reasoning, and ElevenLabs-class voice — without living inside any one vendor.
Example attacks
Full key reference ↓Stable keys your security team can route, audit, and automate
Plain English labels while the agent session is still live
Human calls and AI agents on one Meridian signal layer
Swipe through representative attack patterns — each with a stable key, plain-English meaning, and a sample caller line. Meridian detects more in production than we list here.
Why agents get targeted
They probe system instructions, impersonate privileged roles, and slowly pull data across session boundaries. Meridian classifies these behaviors with stable signal keys your security team can route, audit, and automate against — while the session is still active.
001Meridian analyzes audio and conversation flow on the device where the agent runs — no batch review after the fact.
002Injection, hijacking, boundary violations, and jailbreak attempts map to named keys — not opaque scores alone.
003Operators and security see plain English context before the agent shares too much or follows bad instructions.
Signal reference
Example AI agent attack signals on this page — not a complete list. For sample human call and meeting patterns, see the home catalog.
Lookup for security and engineering — representative examples from production, not every signal Meridian can surface on agents.
Some of what Meridian watches for on AI voice agents and conversational systems. Keys only — no opaque weighting; additional detection runs in production.
| Key | Meaning |
|---|
Where it applies
Meridian covers human-targeted social engineering on the same platform — one signal layer for people and agents.
Stop callers from rewriting agent behavior mid-ticket or extracting account data through social tactics.
Detect when a callee probes your agent’s instructions instead of engaging with the intended script.
Flag privilege-escalation language and boundary violations while employees stay on live audio.
Get started
Book a demo with the Lurion team and watch detection run while a voice agent conversation is still happening.