Announcing Built On Envoy: Making Envoy Extensions Accessible to Everyone

Learn more

Enterprise-Scale AI Gateway, Built on Envoy’s Proven Data Plane

Production-grade circuit breaking, weighted routing, and native OpenTelemetry. Change one line of code to get started.

Based on Envoy

The Infrastructure That Runs Production Traffic at Hyperscale, Applied to LLM Requests

  • C++ data plane, not a Python proxy. Handles millions of requests per second without becoming a bottleneck at scale.
  • Production-grade circuit breaking and outlier detection built for service-to-service traffic, applied to model providers.
  • Deploys as sidecar, edge proxy, regional instance, or central gateway. Same binary, any topology, no redeployment to switch.
  • Native OpenTelemetry emission at the proxy layer. Every request is a trace span with zero instrumentation in agent code.

Every LLM Request is Evaluated, Protected, and Logged in Six Stages

Every request your agents make passes through the same six-stage pipeline inside Al Gateway. Each stage runs in order and can short-circuit — returning a response directly — or pass to the next. The pipeline runs in the Envoy filter chain: each stage adds microseconds, not milliseconds.

Look up the inbound API key to resolve owner, project, and team. All downstream stages use this context for quota checks, routing, and log attribution.

What Each Stage Gets You in Practice

Routing

Route by model, team, token threshold, or request metadata in combination.

Failover

Automatic recovery from provider failures.

Logging

Every request produces a structured log record written to your configured sink.

Cost & Token Budgets

Enforce spend limits before requests hit the provider.

Traffic Splitting

Distribute traffic across multiple models.

Works with What You Already Have

Agent Router is provider-neutral, framework-agnostic, and OpenAI-compatible.
It doesn't replace your stack, it sits in front of it.

Model Providers

Self-hosted
models

Agent Frameworks

Observability

Any OTEL-
compatible
stack

Identity and Access

Any OIDC/SAML
provider
Any SSO provider

Ready to Start Routing?