Background texture

One API, All Models

Access OpenAI, Anthropic, Google, and more through a single OpenAI-compatible endpoint. Zero markup on inference costs.

Chat Completions
~/modelmax $ curl https://api.modelmax.io/v1/chat/completions \ -H "Content-Type: application/json" \ -H "Authorization: Bearer sk-..." \ -d '{
    "model": "gpt-4o",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'
Video Generation
~/modelmax $ curl -X POST https://api.modelmax.io/v1/queue/veo-2.0-high \ -H "Content-Type: application/json" \ -H "Authorization: Bearer sk-..." \ -d '{
    "prompt": "A cinematic shot of a futuristic city...",
    "webhook_url": "https://your-domain.com/webhook"
  }'

Powerful Model Fleet

Access the world's leading foundation models through a single OpenAI-compatible API. No separate accounts, no per-provider SDKs.

25+

Models

5

Providers

4

Capabilities

1

API Key

Gemini + Embedding

Gemini + Embedding

Google's most capable models, excelling at complex reasoning, deep multimodal understanding, and high-quality semantic embeddings.

Featured Capabilities
Gemini 3.1 Pro Preview
gemini-3.1-pro-preview
Chat2M tokens
Gemini 3 Pro Preview
gemini-3-pro-preview
Chat2M tokens
Gemini 3 Flash Preview
gemini-3-flash-preview
Chat1M tokens
Text Embedding 005
text-embedding-005
Embedding
Google Veo

Google Veo

State-of-the-art cinematic video generation with synchronized speech, sound effects, and prompt fidelity.

Featured Capabilities
Veo 3.1
veo-3.1
Video Gen
Veo 3.1 Fast
veo-3.1-fast
Video Gen
Veo 3
veo-3
Video Gen
Veo 3 Fast
veo-3-fast
Video Gen
Kimi

Kimi

Moonshot's extended thinking model with deep chain-of-thought reasoning and unparalleled long context capabilities.

Featured Capabilities
Kimi K2 Thinking
kimi-k2-thinking
Chat128K tokens
Kimi K2.5
kimi-k2.5
Chat128K tokens
MiniMax

MiniMax

Frontier large language model with strong reasoning, creativity, and highly consistent instruction-following.

Featured Capabilities
MiniMax M2
minimax-m2
Chat1M tokens
MiniMax M2.1
minimax-m2.1
Chat1M tokens
DeepSeek

DeepSeek

Top-tier open-source reasoning model with remarkable performance on STEM, coding, and mathematical benchmarks.

Featured Capabilities
DeepSeek R1
deepseek-r1
Chat128K tokens
DeepSeek V3.1
deepseek-v3.1
Chat128K tokens
DeepSeek V3.2
deepseek-v3.2
Chat128K tokens
Qwen

Qwen

Alibaba's robust MoE model excelling at code generation, logic, and comprehensive multilingual capabilities.

Featured Capabilities
Qwen3 Coder 30B A3B
qwen3-coder-30b-a3b
Chat128K tokens
Qwen3 32B
qwen3-32b
Chat128K tokens
Qwen3 235B A22B
qwen3-235b-a22b-2507
Chat128K tokens
Qwen3 Coder 480B A35B
qwen3-coder-480b-a35b
Chat128K tokens

Frequently Asked Questions

Everything you need to know about the product and billing.

Is ModelMax truly a zero-markup platform?

Yes. You pay exactly the inference costs defined by the model providers. We do not add any markup or hidden fees to the per-token pricing.

Do I need to manage separate accounts for each provider?

No. With ModelMax, you only need one account and one API key to access all supported models. We handle the routing and billing across different providers.

How do I pay for the usage?

You can top up your ModelMax balance using credit cards or other supported payment methods. Your usage across all models will be deducted from this single balance.

Is the API fully compatible with OpenAI SDKs?

Absolutely. Our API is built to be a drop-in replacement for OpenAI. You just need to change the base URL to our endpoint and use your ModelMax API key.

Background pattern

Ready to accelerate?

Get your API key in under a minute and start building with the unified, zero-markup LLM gateway.