EarlyTerms

MiniMax M3

Emergent · Emerged · 13 days old · Last reviewed

MiniMax M3 is a 428B-parameter Mixture-of-Experts large language model from Shanghai-based MiniMax (稀宇科技), activating 22B parameters per token. It is the first open-weight model to combine frontier-level coding, a 1M-token context window, and native multimodal input in a single architecture.

Released June 1, 2026, M3 introduces MSA (MiniMax Sparse Attention), a sparse attention mechanism that cuts per-token compute at one-million-token context to 1/20th of its predecessor M2.7. On SWE-Bench Pro it scores 59.0%, edging past GPT-5.5 (58.6%) and Gemini 3.1 Pro, with weights open-sourced on Hugging Face shortly after API launch.

Think of MSA as a librarian who scans the index before pulling shelves — skipping 95% of stacks to fetch only the relevant pages.

Search Interest

peak ~13K/mo
updated 2026-06-14
~13K/mo ~6.5K/mo 0
2026-05-16 2026-05-31 2026-06-14
Term Lifecycle
  1. Nascent
    0–7 days
  2. Emergent ← now
    8–30 days
  3. Validating
    31–90 days
  4. Rising
    91–180 days
  5. Established
    180 days +

Why is it emerging now?

TL;DR

MiniMax M3 landed June 1 as the first open-weight model pairing frontier coding (59.0% SWE-Bench Pro) with a true 1M-token context and native multimodal input — all at $0.30/M input tokens, roughly 15x cheaper than Claude Opus 4.7. The weights followed on Hugging Face by June 13, removing the last barrier for self-hosted deployment.

6 forces driving coverage — scroll →

Outlook

6-month signal projection and commercial timeline.

Signal high
Revenue strong

Open weights plus 15x cost advantage over Claude Opus 4.7 make M3 the default open-weight baseline for long-context agentic coding through Q3 2026.

Risk · Benchmark self-reporting without independent replication; DeepSWE leaderboard omission leaves credibility gap.

Analogs · deepseek-v3 · kimi-k2-6 · qwen3

Monetization timeline
  1. now
    API live, weights public

    MiniMax API and OpenRouter both live; weights on Hugging Face enable self-hosted deployments.

  2. 3-6mo
    Comparison and fine-tuning tools

    Benchmark comparison sites and LoRA fine-tuning services targeting the 22B-active parameter sweet spot.

  3. 6-12mo
    Enterprise long-context plays

    1M-context codebase analysis and document-processing SaaS built on M3 or its successors.

Competition & Opportunity for term “MiniMax M3”

Three heuristic signals derived from the tracked queries, the term's monetization cards, and its cluster neighbors. Directional, not audited.

Content Gap
10 queries tracked
Led by General (9), Cost breakdown (1)
10 Suggest-only tails — long-tail opening
Revenue Potential
10% commercial-intent queries
2 monetization angles mapped
Mostly informational — pre-commercial
Build Difficulty
Medium
Stage: emergent — early enough to land
1 / 13 default TLDs taken · oldest incumbent minimaxm3.com (2026-06-01)
7 related terms already published
Heuristic · signals: tracked queries, term monetization cards, cluster neighbors

Ideas for term “MiniMax M3”

Buildable pitches — turn this term into an article, site, product, post, newsletter, video, or course. Steal any card and run with it.

Article
MiniMax M3 vs Claude Opus 4.7 vs GPT-5.5: Which Model Wins for Agentic Coding in 2026?

High-intent comparison query with no authoritative benchmark roundup yet. Cover SWE-Bench Pro, BrowseComp, pricing, and context window side by side.

Article
How to Run MiniMax M3 Locally: VRAM Requirements, SGLang Setup, and First Inference

428B MoE weights create concrete deployment questions. Tutorial gap is real — only a handful of posts exist in the first two weeks.

Article
MiniMax Sparse Attention (MSA) Explained: Why 1M Tokens No Longer Costs 1M Tokens

Technical explainer for the architecture innovation behind M3's efficiency claims — underserved despite being the core differentiator.

Article
Best Open-Weight Models for Long-Context Tasks in 2026: M3, Kimi K2.6, DeepSeek V4, and GLM-5.1 Compared

Category comparison anchored on the Chinese open-weight wave — high-volume query with fragmented coverage.

Product
1M-context document analysis SaaS built on MiniMax M3 API

Legal, finance, and medical teams processing 500-page documents need long-context models; M3's $0.60/M token standard rate enables viable unit economics.

Product
MiniMax M3 fine-tuning service for agentic coding workflows

128-expert MoE supports LoRA on 22B activated parameters. Service targeting teams wanting M3-class performance on proprietary codebases.

Video
'MiniMax M3 vs Kimi K2.6 vs DeepSeek V4 Pro: same repo, same prompt, who fixes the bug?' — YouTube coding benchmark livestream

Head-to-head with real codebases beats benchmark screenshots; 20-30 minute format with live screen capture is currently undersupplied.

Post Newsletter / LinkedIn
The Chinese Open-Weight Wave Just Hit a New High-Water Mark With MiniMax M3

Four Chinese labs released frontier open-weight coding models in April 2026. Then MiniMax showed up in June and reset the benchmark.

Post HN / r/LocalLLaMA / personal blog
I Ran MiniMax M3 on a Million-Token Context. Here's What Actually Changed.

Most 'million-token' models struggle past 200K in practice. M3's MSA architecture makes a different bet — and the inference speed numbers back it up.

Post Tech media / YouTube
MiniMax Went Public in January. Its First Post-IPO Model Is Challenging GPT-5.5.

MiniMax listed on the Hong Kong Stock Exchange on January 9, 2026 — and six months later released a model that outscores GPT-5.5 on SWE-Bench Pro for a fraction of the price.

What People Search

Long-tail queries from Google Suggest + Trends. Volume and competition are heuristics — directional, not audited. Content Type comes from query shape.

Keyword
Competition
Content Type
minimax m3
Very Low
General
minimax m3 water softener
Very Low
General
minimax m3 water softener price
Very Low
Cost breakdown
minimax m3.0
Very Low
General
minimax m3 release date
Very Low
General
minimax m3 model
Very Low
General
minimax m3 llm
Very Low
General
minimax m3 ai
Very Low
General
1–8 of 10
1 / 2
Updated 2026-06-14 · sources: Google Trends, Google Suggest · Competition is heuristic

SERP of term “MiniMax M3”

What searchers see today — organic results on top, paid ads if anyone's bidding. Ad density is a real-time commercial signal.

FAQ

What is MiniMax M3?

MiniMax M3 is a 428B-parameter Mixture-of-Experts large language model from Shanghai-based MiniMax (稀宇科技), activating 22B parameters per token.

Why is MiniMax M3 emerging now?

MiniMax M3 landed June 1 as the first open-weight model pairing frontier coding (59.0% SWE-Bench Pro) with a true 1M-token context and native multimodal input — all at $0.30/M input tokens, roughly 15x cheaper than Claude Opus 4.7. The weights followed on Hugging Face by June 13, removing the last barrier for self-hosted deployment.

When did MiniMax M3 emerge?

Publicly emerged around 2026-06-01 (about 13 days ago as of 2026-06-14). EarlyTerms first recorded a pipeline signal on 2026-06-01.

Related Terms

Other terms in the same space — aliases, subtypes, competitors, and neighbors to explore next.

Explore next
Also mentioned
  • Also known as 稀宇科技
  • Part of MiniMax
  • Related MoE (Mixture of Experts)·SWE-Bench

Sources

Primary URLs this report cites — open any to verify the claim yourself.

  1. 01 MiniMax M3 — Hugging Face model card (428B MoE, 22B active, MSA, 1M context) huggingface.co
  2. 02 MarkTechPost — MiniMax M3 launch coverage (Jun 1, 2026) marktechpost.com
  3. 03 NVIDIA Developer Blog — MiniMax M3 deployment guide developer.nvidia.com
  4. 04 The Decoder — Open-weight 1M-context model analysis the-decoder.com
  5. 05 Artificial Analysis — M3 benchmark review and caveats artificialanalysis.ai
  6. 06 Wikipedia — MiniMax Group (company background, HK IPO Jan 2026) en.wikipedia.org
  7. 07 GitHub — MiniMax-AI/MiniMax-M3 (official repo, created Jun 1 2026) github.com