MiniMax M3

Emergent · Emerged 2026-06-01 · 13 days old · Last reviewed 2026-06-01

MiniMax M3 is a 428B-parameter Mixture-of-Experts large language model from Shanghai-based MiniMax (稀宇科技), activating 22B parameters per token. It is the first open-weight model to combine frontier-level coding, a 1M-token context window, and native multimodal input in a single architecture.

Released June 1, 2026, M3 introduces MSA (MiniMax Sparse Attention), a sparse attention mechanism that cuts per-token compute at one-million-token context to 1/20th of its predecessor M2.7. On SWE-Bench Pro it scores 59.0%, edging past GPT-5.5 (58.6%) and Gemini 3.1 Pro, with weights open-sourced on Hugging Face shortly after API launch.

Think of MSA as a librarian who scans the index before pulling shelves — skipping 95% of stacks to fetch only the relevant pages.

Search Interest

peak ~13K/mo

updated 2026-06-14

~13K/mo ~6.5K/mo 0

2026-05-16 2026-05-31 2026-06-14

Term Lifecycle

Nascent

0–7 days
Emergent ← now

8–30 days
Validating

31–90 days
Rising

91–180 days
Established

180 days +

Why is it emerging now?

TL;DR

MiniMax M3 landed June 1 as the first open-weight model pairing frontier coding (59.0% SWE-Bench Pro) with a true 1M-token context and native multimodal input — all at $0.30/M input tokens, roughly 15x cheaper than Claude Opus 4.7. The weights followed on Hugging Face by June 13, removing the last barrier for self-hosted deployment.

6 forces driving coverage — scroll →

MarkTechPost

MiniMax releases M3 with MSA, 1M-Token Context, native multimodality

59.0% SWE-Bench Pro, 66.0% Terminal-Bench 2.1, 70.06% OSWorld-Verified; MSA delivers 9x prefill and 15x decoding speedup at 1M context.

Jun 1, 2026

NVIDIA Developer Blog

MiniMax M3 on NVIDIA: 428B MoE, 1M context, TensorRT-LLM support

428B total / 22B activated, 128 experts 4-active; trained on ~100 trillion interleaved tokens. TensorRT-LLM, SGLang, vLLM inference backends confirmed.

Jun 2026

The Decoder

MiniMax M3: open-weight 1M-context model challenges proprietary leaders

BrowseComp 83.5 beats Claude Opus 4.7 (79.3). Output speed ~100 tokens/second, roughly 3x faster than Opus at equivalent context length.

Jun 2026

Artificial Analysis

MiniMax-M3: leading open-weights model once weights release

AA-Omniscience caveat: M3 attempts only 30.9% of questions (low hallucination, low accuracy). License may restrict commercial use as M2.7 did.

Jun 2026

MiniMax-AI/MiniMax-M3

Official model repo — created June 1, weights published by June 13

250 stars

Y Hacker News

MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities

Jun 1, 2026 21 points · 2 comments

Outlook

6-month signal projection and commercial timeline.

Signal high

Revenue strong

Open weights plus 15x cost advantage over Claude Opus 4.7 make M3 the default open-weight baseline for long-context agentic coding through Q3 2026.

Risk · Benchmark self-reporting without independent replication; DeepSWE leaderboard omission leaves credibility gap.

Analogs · deepseek-v3 · kimi-k2-6 · qwen3

Monetization timeline

now

API live, weights public

MiniMax API and OpenRouter both live; weights on Hugging Face enable self-hosted deployments.
3-6mo

Comparison and fine-tuning tools

Benchmark comparison sites and LoRA fine-tuning services targeting the 22B-active parameter sweet spot.
6-12mo

Enterprise long-context plays

1M-context codebase analysis and document-processing SaaS built on M3 or its successors.

Competition & Opportunity for term “MiniMax M3”

Three heuristic signals derived from the tracked queries, the term's monetization cards, and its cluster neighbors. Directional, not audited.

Content Gap

10 queries tracked

Led by General (9), Cost breakdown (1)

10 Suggest-only tails — long-tail opening

Revenue Potential

10% commercial-intent queries

2 monetization angles mapped

Mostly informational — pre-commercial

Build Difficulty

Medium

Stage: emergent — early enough to land

1 / 13 default TLDs taken · oldest incumbent minimaxm3.com (2026-06-01)

7 related terms already published

Heuristic · signals: tracked queries, term monetization cards, cluster neighbors

Ideas for term “MiniMax M3”

Buildable pitches — turn this term into an article, site, product, post, newsletter, video, or course. Steal any card and run with it.

Article

MiniMax M3 vs Claude Opus 4.7 vs GPT-5.5: Which Model Wins for Agentic Coding in 2026?

High-intent comparison query with no authoritative benchmark roundup yet. Cover SWE-Bench Pro, BrowseComp, pricing, and context window side by side.

Article

How to Run MiniMax M3 Locally: VRAM Requirements, SGLang Setup, and First Inference

428B MoE weights create concrete deployment questions. Tutorial gap is real — only a handful of posts exist in the first two weeks.

Article

MiniMax Sparse Attention (MSA) Explained: Why 1M Tokens No Longer Costs 1M Tokens

Technical explainer for the architecture innovation behind M3's efficiency claims — underserved despite being the core differentiator.

Article

Best Open-Weight Models for Long-Context Tasks in 2026: M3, Kimi K2.6, DeepSeek V4, and GLM-5.1 Compared

Category comparison anchored on the Chinese open-weight wave — high-volume query with fragmented coverage.

Product

1M-context document analysis SaaS built on MiniMax M3 API

Legal, finance, and medical teams processing 500-page documents need long-context models; M3's $0.60/M token standard rate enables viable unit economics.

Product

MiniMax M3 fine-tuning service for agentic coding workflows

128-expert MoE supports LoRA on 22B activated parameters. Service targeting teams wanting M3-class performance on proprietary codebases.

Video

'MiniMax M3 vs Kimi K2.6 vs DeepSeek V4 Pro: same repo, same prompt, who fixes the bug?' — YouTube coding benchmark livestream

Head-to-head with real codebases beats benchmark screenshots; 20-30 minute format with live screen capture is currently undersupplied.

Post Newsletter / LinkedIn

The Chinese Open-Weight Wave Just Hit a New High-Water Mark With MiniMax M3

Four Chinese labs released frontier open-weight coding models in April 2026. Then MiniMax showed up in June and reset the benchmark.

Post HN / r/LocalLLaMA / personal blog

I Ran MiniMax M3 on a Million-Token Context. Here's What Actually Changed.

Most 'million-token' models struggle past 200K in practice. M3's MSA architecture makes a different bet — and the inference speed numbers back it up.

Post Tech media / YouTube

MiniMax Went Public in January. Its First Post-IPO Model Is Challenging GPT-5.5.

MiniMax listed on the Hong Kong Stock Exchange on January 9, 2026 — and six months later released a model that outscores GPT-5.5 on SWE-Bench Pro for a fraction of the price.

What People Search

Long-tail queries from Google Suggest + Trends. Volume and competition are heuristics — directional, not audited. Content Type comes from query shape.

Keyword

Competition

Content Type

minimax m3

Very Low

General

minimax m3 water softener

Very Low

General

minimax m3 water softener price

Very Low

Cost breakdown

minimax m3.0

Very Low

General

minimax m3 release date

Very Low

General

minimax m3 model

Very Low

General

minimax m3 llm

Very Low

General

minimax m3 ai

Very Low

General

1–8 of 10

1 / 2

Updated 2026-06-14 · sources: Google Trends, Google Suggest · Competition is heuristic

SERP of term “MiniMax M3”

What searchers see today — organic results on top, paid ads if anyone's bidding. Ad density is a real-time commercial signal.

FAQ

What is MiniMax M3?

MiniMax M3 is a 428B-parameter Mixture-of-Experts large language model from Shanghai-based MiniMax (稀宇科技), activating 22B parameters per token.

Why is MiniMax M3 emerging now?

When did MiniMax M3 emerge?

Publicly emerged around 2026-06-01 (about 13 days ago as of 2026-06-14). EarlyTerms first recorded a pipeline signal on 2026-06-01.

Related Terms

Other terms in the same space — aliases, subtypes, competitors, and neighbors to explore next.

Explore next

Also mentioned

Also known as 稀宇科技
Part of MiniMax
Related MoE (Mixture of Experts)·SWE-Bench

Sources

Primary URLs this report cites — open any to verify the claim yourself.

Domain Availability

minimaxm3.com
minimaxm3.ai
minimaxm3.net
minimaxm3.io
minimaxm3.co
minimaxm3.app
minimaxm3.pro
minimaxm3.top
minimaxm3.org
minimaxm3.info
minimaxm3.xyz
minimaxm3.run
minimaxm3.me
minimax-m3.com
minimax-m3.ai
minimax-m3.net
minimax-m3.io
minimax-m3.co
minimax-m3.app
minimax-m3.pro
minimax-m3.top
minimax-m3.org
minimax-m3.info
minimax-m3.xyz
minimax-m3.run
minimax-m3.me

Checked via RDAP — live from your browser.

EarlyTerms Weekly

5–8 new terms every Tuesday. Research, story angles, buildable ideas — straight to your inbox.

Join the waitlist for issue #1. No spam.

Search Interest

Why is it emerging now?

Outlook

Competition & Opportunity for term “MiniMax M3”

Ideas for term “MiniMax M3”

What People Search

SERP of term “MiniMax M3”

FAQ

Related Terms

Sources

Full access is a paid feature