multi-provider llm api routing with unified interface
Kong routes LLM requests to multiple AI providers (OpenAI, Anthropic, Azure, Ollama, etc.) through a single standardized API endpoint, translating request/response formats between providers' native schemas. The gateway maintains a provider registry with format adapters that normalize chat completion, embedding, and streaming requests into provider-specific protocols, enabling seamless provider switching and fallback without client-side changes.
Unique: Implements provider-agnostic LLM routing at the gateway layer using Lua-based request/response transformers that normalize OpenAI-compatible, Anthropic, Azure, and Ollama APIs into a unified contract, eliminating the need for client-side provider abstraction libraries
vs alternatives: Unlike client-side SDKs (LiteLLM, Langchain) that add dependency weight, Kong's gateway-level routing centralizes provider management, enables real-time provider switching without redeployment, and provides observability across all LLM traffic in one place
llm request/response transformation and enrichment
Kong intercepts LLM API requests and responses to apply transformations including prompt injection detection, token counting, cost calculation, response filtering, and header injection. The transformation pipeline uses Lua plugins that execute before requests reach the LLM provider and after responses return, enabling cost tracking, security scanning, and response normalization without modifying client or backend code.
Unique: Implements a pluggable transformation pipeline at the gateway layer that intercepts both requests and responses, enabling cost calculation, security scanning, and response normalization as middleware rather than requiring changes to client applications or LLM provider integrations
vs alternatives: Compared to application-level libraries (Guardrails, LangChain middleware), Kong's gateway-level transformations apply uniformly across all clients, reduce code duplication, and enable centralized security policies that can be updated without redeploying applications
control plane and data plane separation for hybrid deployments
Kong supports a hybrid architecture where a control plane (Admin API, configuration management) is separated from data planes (request processing) that connect to the control plane via RPC. The control plane manages configuration and pushes updates to data planes, which apply changes without restarting. Data planes can be deployed in different environments (on-prem, cloud, edge) and sync configuration from the control plane, enabling centralized management with distributed request processing.
Unique: Implements a control plane-data plane architecture with RPC-based configuration synchronization, enabling centralized management of distributed Kong deployments across multiple environments without requiring data plane restarts for configuration changes
vs alternatives: Unlike single-node Kong deployments or service mesh control planes, Kong's hybrid mode enables centralized configuration management with distributed data planes, supports multiple deployment environments, and allows configuration updates without downtime
automatic mcp server generation from rest apis
Kong can automatically generate MCP servers from existing REST APIs by introspecting API schemas (OpenAPI/Swagger) and converting REST endpoints into MCP tools. The generated MCP server exposes REST endpoints as callable tools with parameter schemas derived from API specifications, enabling LLM agents to interact with REST APIs via MCP without manual MCP server implementation.
Unique: Implements automatic MCP server generation from OpenAPI/Swagger specifications, converting REST endpoints into MCP tools with parameter schemas derived from API specs, enabling LLM agents to discover and call REST APIs via MCP without manual server implementation
vs alternatives: Unlike manual MCP server implementation or REST-only agent integrations, Kong's automatic generation reduces boilerplate, enables agents to discover available tools from API specs, and maintains consistency between REST API and MCP tool schemas
openresty/nginx-based reverse proxy with lua extensibility
Kong is built on OpenResty (Nginx + Lua JIT), providing a high-performance reverse proxy foundation with Lua scripting for custom logic. The Nginx core handles connection management, TLS termination, and HTTP protocol processing, while Lua runs in the request processing pipeline for plugins, routing, and transformations. This architecture enables Kong to handle high request volumes (>10K req/sec per node) while remaining extensible via Lua without requiring C module compilation.
Unique: Builds on OpenResty (Nginx + Lua JIT) to provide a high-performance reverse proxy with Lua-based extensibility, enabling custom gateway logic without C module compilation while maintaining throughput of >10K req/sec per node
vs alternatives: Unlike pure Nginx (limited extensibility without C modules) or application-level proxies (higher latency), Kong's OpenResty foundation provides Nginx-level performance with Lua scripting for custom logic, enabling both high throughput and extensibility
kong manager ui for visual configuration and monitoring
Kong Manager is a web-based UI that provides visual configuration of routes, services, plugins, and consumers without requiring Admin API calls or YAML editing. The UI displays real-time metrics (request count, latency, error rates), plugin status, and upstream health, enabling operators to manage Kong via a dashboard. The UI integrates with Kong's Admin API and supports role-based access control for multi-user environments.
Unique: Provides a web-based UI for Kong configuration and monitoring with real-time metrics display, role-based access control, and audit logging, enabling visual management without requiring Admin API or YAML knowledge
vs alternatives: Unlike command-line Admin API or raw YAML configuration, Kong Manager provides a visual interface with real-time metrics and audit trails, making Kong more accessible to non-technical operators and enabling better visibility into gateway state
model context protocol (mcp) traffic governance and routing
Kong provides native MCP server support, routing MCP client requests to backend MCP servers with authentication, authorization, and observability. The gateway implements MCP protocol handling via Lua plugins that parse MCP JSON-RPC messages, enforce access control policies, and forward requests to configured MCP server upstreams, enabling centralized governance of agentic LLM-to-tool interactions.
Unique: Implements native MCP protocol support at the gateway layer with JSON-RPC message parsing, tool authorization policies, and automatic MCP server generation from REST APIs, enabling centralized governance of agentic LLM tool access without requiring custom MCP server implementations
vs alternatives: Unlike client-side MCP implementations (Claude SDK, LangChain MCP), Kong's gateway-level MCP routing provides centralized access control, audit logging, and tool discovery across all agents, and can automatically expose existing REST APIs as MCP tools without backend changes
dynamic request routing with regex and semantic path matching
Kong's router uses a tree-based matching algorithm that supports exact path matching, regex patterns, and semantic matching (e.g., matching by HTTP method, hostname, headers) to route requests to backend services. The router compiles routes into an optimized tree structure at startup, enabling O(1) lookup for exact matches and efficient regex evaluation for pattern-based routes, with support for route priorities and weighted load balancing across multiple upstreams.
Unique: Implements a tree-based router compiled at startup that supports exact, regex, and semantic path matching with O(1) lookup for exact routes and efficient regex evaluation, enabling high-performance routing for thousands of routes without linear search overhead
vs alternatives: Compared to simple regex-based routers (basic reverse proxies), Kong's tree-based approach provides O(1) lookup for exact matches and supports semantic matching on multiple dimensions (path, method, hostname, headers) simultaneously, enabling complex routing logic without performance degradation
+6 more capabilities