Announcing Built On Envoy: Making Envoy Extensions Accessible to Everyone

Learn more

AI Gateway Glossary: Key Terms for Enterprise AI Infrastructure

Last updated: June 2026

Clear, neutral definitions of the core concepts in enterprise AI infrastructure — from the gateway layer to the protocols, controls, and governance that surround it. Each entry links to a full explanation.

Gateways & routing

  • What is an AI gateway? — The infrastructure layer between applications or agents and AI model providers, centralizing routing, security, cost, and governance for AI traffic.
  • What is an LLM gateway? — A routing and management layer between applications and large language model providers; closely related to, and often used interchangeably with, an AI gateway.
  • What is an inference gateway? — A layer that routes and manages traffic to model inference endpoints, often self-hosted or in-cluster, with model-aware load balancing.
  • AI gateway vs. API gateway — How an AI gateway differs from a traditional API gateway, and when you need each.

This glossary is maintained by Tetrate, an AI infrastructure company and co-creator of Envoy AI Gateway. Explore Tetrate Agent Router.

Decorative CTA background pattern background background
Tetrate logo in the CTA section Tetrate logo in the CTA section for mobile

Ready to enhance your
network

with more
intelligence?