Back to glossary

API Gateway

A server that acts as a single entry point for client requests, handling cross-cutting concerns like authentication, rate limiting, request routing, and protocol translation before forwarding to backend services.

An API gateway consolidates client-facing API concerns into a single managed layer. Instead of each microservice implementing its own authentication, rate limiting, logging, and CORS handling, the gateway handles these uniformly. Clients interact with one endpoint; the gateway routes requests to the appropriate backend service.

Key features include request routing (path-based, header-based), authentication and authorization, rate limiting, request/response transformation, caching, and API analytics. Products like Kong, AWS API Gateway, Apigee, and Traefik provide these capabilities with different trade-offs in flexibility, performance, and managed vs. self-hosted operation.

For AI products, API gateways serve as the control point for model routing. A single API endpoint can route requests to different model versions based on request content, user tier, or A/B test assignment. The gateway can enforce per-customer token quotas, cache frequent inference requests, and transform between different model API formats, creating a unified interface that abstracts the complexity of multiple AI backends.

Related Terms