MiniMax M3
Open-weights multimodal model with 1M token context that matches frontier coding benchmarks at 5-10% of GPT-5.5 pricing.
Overview
MiniMax M3 is an open-weights multimodal model from Chinese AI lab MiniMax, released on June 1, 2026. It competes directly with frontier models like GPT-5.5 and Gemini 3.1 Pro on coding and agentic benchmarks — the key difference being that it does so at roughly 5-10% of the cost. The model supports a 1M token context window natively, handles text, image, and code inputs, and is designed from the ground up for agentic workflows where the model needs to plan, use tools, and execute multi-step tasks.
What makes M3 genuinely interesting for developers is the combination of price and capability. API pricing sits at $0.30 per million input tokens and $1.20 per million output tokens during the current promotional period — dramatically cheaper than comparable closed models. The 1M context window is large enough to ingest entire codebases, long document chains, or extended conversation histories without chunking. Open weights mean you can self-host, fine-tune, and inspect the model, which matters for teams with data sovereignty requirements or specialized use cases.
The main caveats are ecosystem maturity and geographic origin. MiniMax's developer platform and tooling are less polished than OpenAI's or Anthropic's, and third-party integrations (IDE plugins, agent frameworks) are still catching up. As with DeepSeek, the Chinese company origin may raise compliance questions for some enterprise buyers. But on raw price-performance for coding and agentic tasks, M3 is hard to ignore — it's already available on OpenRouter and other inference providers, which lowers the adoption barrier considerably.
Key features
1M Context Window
Supports up to 1 million tokens of context natively, allowing ingestion of entire codebases, long document chains, or extended multi-turn conversations without chunking or summarization.
Code Gen
Scores at or near frontier level on major coding benchmarks. Handles code generation, debugging, refactoring, and code review across popular languages with strong multi-file awareness.
Multimodal
Processes text, image, and code inputs natively within a single model. Useful for tasks like reading diagrams, understanding UI screenshots, or analyzing visual data alongside code.
Agentic
Built for agentic workflows with strong tool-use, planning, and multi-step execution capabilities. Designed to operate autonomously in chains where the model must reason about next actions.
Pricing
Free tier: Free API tier available for evaluation. Check platform.minimax.io for current limits.
| Plan | Price | What's included |
|---|---|---|
| Free Tier | Free | Limited API access for evaluation and testing |
| Plus | $20/mo | Approximately 1.7B tokens per month, priority access |
| API — M3 | $0.30/M input, $1.20/M output | Promotional pricing. 1M context window, multimodal input, tool use |
Limited API access for evaluation and testing
Approximately 1.7B tokens per month, priority access
Promotional pricing. 1M context window, multimodal input, tool use
Pros & cons
Pros
- ✓Frontier-level coding and agentic performance at 5-10% of GPT-5.5 or Gemini 3.1 Pro pricing
- ✓1M token context window handles entire codebases without chunking
- ✓Open weights allow self-hosting, fine-tuning, and full model inspection
- ✓Available on OpenRouter and third-party providers, not locked to a single platform
Cons
- ×Developer platform and tooling less mature than OpenAI or Anthropic ecosystems
- ×Limited native IDE integrations — mostly accessed via API or third-party providers
- ×Chinese company origin may raise data governance concerns for some enterprise buyers
- ×Promotional API pricing may increase once the introductory period ends


