Question 1

What is the difference between semantic guardrails and rules-based guardrails?

Accepted Answer

Rules-based guardrails match deterministic patterns — a regex for an SSN, a denylist of banned words, a check that a tool is in an agent's allowed list. They're fast (single-digit ms), predictable, and easy to audit. The tradeoff: they only catch what you anticipated.

Semantic guardrails use a small classifier or LLM to evaluate intent rather than surface patterns — "does this look like a prompt injection attempt" rather than "does this match this regex." They catch the variations rules miss, but they're slower (hundreds of ms), probabilistic, and harder to audit ("the classifier returned 0.84" vs. "rule X matched").

In production the two are layered. Rules handle the well-defined cases cheaply; semantic checks handle the fuzzy ones where intent matters more than pattern. Agent Router runs both inline in the same filter pipeline and logs which type fired for every decision.

Question 2

How do I balance the recall and precision of semantic guardrails?

Accepted Answer

Recall and precision pull against each other. Lower the threshold and you catch more violations but block more legitimate traffic; raise it and false positives drop but real violations slip through. The right balance depends on which error costs you more.

Three moves that help:

• Layer rules first so the classifier only judges ambiguous traffic.
• Shadow before enforcing — run new guardrails in log-only mode for a week, then tune against real data.
• Tune per use case — a payments agent and a customer support agent have different risk profiles, and Agent Router lets you set thresholds per team profile.

The audit log captures the classifier score on every decision, so you can review near-misses on both sides and refine over time.

Question 3

What is "the request path" and why does it matter for enforcement?

Accepted Answer

The request path is the sequence of operations between your agent sending a request and the provider receiving it. When a guardrail runs "in the request path," it executes inline — as part of that sequence — rather than asynchronously after the fact.

The difference matters because a guardrail in the request path can block a request before the provider sees it. A guardrail running after the fact can only report what already happened. Inline enforcement is the only kind that actually prevents data exposure or stops a violation; everything else is just logging.

Question 4

Is this a security product? Should my CISO be buying this?

Accepted Answer

AI Guardrails is an engineering layer, not a security platform. It's owned and configured by platform engineering, and it produces outputs — audit logs, enforcement records, a documented control point — that your security team can use.

The typical motion: engineering adopts Agent Router for routing and cost visibility, then turns on AI Guardrails when compliance and security start asking questions. Your CISO is a beneficiary, not the buyer.

Question 5

How is AI Guardrails different from AI governance?

Accepted Answer

AI governance is the organizational discipline — policies, risk frameworks, approval processes, compliance programs. It defines what your org's rules should be.

AI Guardrails is the runtime mechanism that enforces those rules in the request path. When a guardrail blocks a request with PII, redacts a response, or stops a misbehaving agent, that's enforcement happening inline — at the moment the agent makes a call, not after the fact.

You need both. Governance decides the policy; AI Guardrails is the enforcement primitive that makes runtime policy actually run.

Question 6

How does this interact with the guardrail solutions my security team already has?

Accepted Answer

AI Guardrails is designed to complement existing security tooling, not replace it. You can integrate your existing guardrail provider — Lakera Guard, Protect AI, or a custom content inspection service — as a pluggable filter step in the Agent Router pipeline.

Your security team keeps their tool. The gateway wires it into the request path so every agent benefits from it, with results logged alongside native guardrail decisions in the same attribution record.

Question 7

Does adding guardrails slow down my agents?

Accepted Answer

Native guardrail rules — PII detection, prompt filtering, access controls — run in the Envoy filter chain and typically add 1–5ms per request. That's negligible relative to LLM inference, which is measured in hundreds of milliseconds to seconds.

External guardrail provider calls (like Lakera or Protect AI) add more latency depending on the provider's response time, with configurable timeouts. Every request's actual overhead is captured in the latency_added field so you can measure it precisely rather than guess.

Question 8

What's the difference between access controls in AI Gateway and guardrails?

Accepted Answer

AI Gateway access controls are structural — they govern who is allowed to call what. Which teams can use which models, which agents can reach which MCP tools, what budget limits apply. These are included in all Agent Router editions, including the free tier.

AI Guardrails is content-level enforcement — it governs what's inside the requests and responses, regardless of who sent them. A team might be permitted to call Anthropic (an access control decision), but their requests still pass through PII detection and prompt filtering (guardrails) before reaching the provider.

The two layers run in sequence in the same pipeline and complement each other.

Announcing token brokering for cost control in Tetrate Agent Router Enterprise

A Runtime Enforcement Layer
for Your Agents

Your Security Team Will Have Questions about Your Agents

"CISO Needs Proof PII Is Protected."

"Stop Rogue Agents Fast."

"Use Existing Security Tools."

Native & BYO Guardrails

Sensitive data removed before provider access

Content rules on the way in and out

Plugs into your existing security tooling

Questions about Guardrails,
from Both Sides of the Org

Get a Personal Tour