Skip to content
🎯 New workshop: Govern AI Costs in Real Time — Hands-On with agentgateway agentgateway has joined the Agentic AI FoundationLearn more

For the complete documentation index, see llms.txt. Markdown versions of all docs pages are available by appending .md to any docs URL.

Page as Markdown

API types

Supported LLM API endpoint types and route configurations

Agentgateway natively supports multiple LLM API endpoint types. These are automatically exposed on the gateway, and translated as appropriate based on the provider.

The following API types have dedicated guides:

  • Chat completions: The OpenAI /v1/chat/completions endpoint. This is the most widely used API type for text generation and chat applications.
  • Responses: The OpenAI /v1/responses endpoint for stateful, multi-step model interactions.
  • Messages: The Anthropic /v1/messages endpoint for Claude models.
  • Embeddings: The OpenAI-compatible /v1/embeddings endpoint for creating vector representations of text.
  • Realtime: The OpenAI Realtime API for low-latency, streaming voice and text interactions over WebSockets.
  • Rerank: The Cohere-compatible /v2/rerank endpoint for ranking documents by relevance to a query.
  • Models: The OpenAI-compatible /v1/models endpoint for listing available models.
  • Token count: The Anthropic /v1/messages/count_tokens endpoint for estimating input tokens.
  • Passthrough: Forwards requests directly to the backend provider without transformation.
Was this page helpful?
Agentgateway assistant

Ask me anything about agentgateway configuration, features, or usage.

Note: AI-generated content might contain errors; please verify and test all returned information.

Tip: one topic per conversation gives the best results. Use the + button in the chat header to start a new conversation.

Switching topics? Starting a new conversation improves accuracy.
↑↓ navigate select esc dismiss

What could be improved?

Your feedback helps us improve assistant answers and identify docs gaps we should fix.

Need more help? Join us on Discord: https://discord.gg/y9efgEmppm

Want to use your own agent? Add the Solo MCP server to query our docs directly. Get started here: https://search.solo.io/.