Background texture

ModelMax

One API, every model. ModelMax is a unified LLM gateway that gives you access to models from multiple providers through a single, OpenAI-compatible API.

Why ModelMax?

  • Single integration — Write once, switch models freely. No provider-specific SDKs.
  • OpenAI-compatible — Drop-in replacement for /v1/chat/completions. Works with any OpenAI SDK.
  • Native Pass-Through (Gateway) — Support for native Anthropic (/v1/messages) and Gemini (/v1beta/models/...) endpoints with zero modification token sniffing.
  • Multi-provider — Access models from AWS Bedrock, Google Gemini, and Anthropic.

Supported capabilities

CapabilityEndpointStatus
OpenAI ChatPOST /v1/chat/completionsAvailable
Anthropic ChatPOST /v1/messagesAvailable
Gemini ChatPOST /v1beta/models/*Available
Streaming (SSE)POST /v1/chat/completionsAvailable
Image generationPOST /v1/images/generationsAvailable
Video generationPOST /v1/queue/{model}Available
Model listingGET /v1/modelsAvailable

Quick links