Product Launch3 min read

Mistral Small 4: 119B-Parameter Open-Weight Model Unifies Reasoning, Vision, and Code

Mistral launched Small 4, an Apache 2.0 licensed 119B MoE model with only 6.5B active parameters that competitively matches top-tier models in reasoning, coding, and math benchmarks.

Editorial

Mar 16, 2026

Mistral released Small 4 on March 16, 2026 — a 119B-parameter Mixture-of-Experts model under Apache 2.0 license that unifies capabilities previously split across four separate Mistral models.

Small 4 is the first Mistral model to combine instruction following (Mistral Small), reasoning (Magistral), multimodal understanding (Pixtral), and agentic coding (Devstral) into a single architecture.

The MoE design uses 128 total experts with only 4 activated per token, resulting in just 6.5 billion active parameters per inference despite the 119B total. This delivers 40% lower latency and triple the throughput compared to Mistral Small 3.

On benchmarks, Small 4 matches or surpasses models like GPT-OSS 120B and Qwen across reasoning, coding, and multimodal tasks — remarkable given its efficiency advantage.

The model is available through the Mistral API, AI Studio, Hugging Face, and Nvidia's NIM containers, with support for vLLM and llama.cpp inference frameworks.

The Apache 2.0 license means developers can use, modify, and commercially deploy the model without restrictions — positioning Small 4 as a compelling alternative to closed-source frontier models for organizations that need on-premises or self-hosted deployments.

Editorial

Mar 16, 2026 · 3 min read

Back to News

Mistral Small 4: 119B-Parameter Open-Weight Model Unifies Reasoning, Vision, and Code

More Stories

Google AI Studio Now Builds Native Android Apps for Free

OpenAI Codex Can Now Control Your Mac Even When It's Locked

Google Debuts Gemini 3.5 Flash at I/O 2026, Revamps Search Box After 25 Years