Gemma 4 Models:
Complete Specs & Comparison
Google's latest open model family delivers best-in-class performance per parameter. Four powerful variants — from 2B embedded models to 31B dense — all under the permissive Apache 2.0 license.
What is Gemma 4?
Gemma 4 is Google DeepMind's newest open model family, released in early 2026 under the Apache 2.0 license. Named after the Latin word for "gem," this model family represents Google's most advanced open AI technology to date.
The standout achievement at launch: Gemma 4 31B reached #3 open model on Arena AI (ELO 1452 as of Apr 1, 2026), while 26B MoE reached #6 open (ELO 1441). In a later snapshot (Apr 7, 2026), these shifted to #4 and #7 open, which is why all ranking claims on this site are explicitly date-scoped.
Available in four variants spanning from embedded devices (E2B at 2B parameters) to research-grade models (31B Dense), Gemma 4 brings multimodal capabilities — text, image, audio, and video — to the open model ecosystem for the first time. Whether you're building on a Raspberry Pi or a high-end GPU cluster, there's a Gemma 4 model optimized for your use case.
Model Specifications
Choose the right model for your hardware. From edge devices to enterprise servers, there's an optimal Gemma 4 variant.
| Model | Total Params | Active Params | Context Window | Modalities | Best For |
|---|---|---|---|---|---|
E2B | 5.1B | ~2.3B | 128K | Text / Image / Audio | Mobile, Edge devices |
E4B | 8.0B | ~4.5B | 128K | Text / Image / Audio | Mobile, Raspberry Pi |
26B MoE | 26B | ~3.8B | 256K | Text / Image / Video | Low-latency local use |
31B Dense | 31B | 31B | 256K | Text / Image / Video | Max quality, fine-tuning |
Built for everything
From embedded devices to research environments, Gemma 4 delivers state-of-the-art capabilities across the entire model family.
Advanced Reasoning
State-of-the-art chain-of-thought reasoning for complex problem-solving. Tackle multi-step math problems, logical puzzles, and intricate analytical tasks with unprecedented accuracy.
Agentic Workflows
Built-in function calling, structured JSON output, and robust system instruction following. Deploy reliable AI agents that can interact with external tools and APIs seamlessly.
Code Generation
Expert-level code generation across 140+ programming languages. From quick scripts to complex applications, Gemma 4 serves as your capable local code assistant.
Vision & Audio
Native multimodal understanding processes images, audio, and video alongside text. Analyze diagrams, transcribe speech, and understand visual context in real-time.
Long Context
Process up to 256K tokens in a single context window. Summarize entire codebases, analyze lengthy documents, or maintain coherent conversations over vast amounts of information.
140+ Languages
Trained on an exceptionally diverse multilingual corpus, Gemma 4 communicates fluently across a vast array of languages, from English and Mandarin to Swahili and Gaelic.
Image, video, and audio by model tier
Gemma 4 capability is not uniform across all four models. This breakdown helps choose the right variant before deployment or fine-tuning.
E2B / E4B / 26B / 31B
Image Understanding
- Variable resolution and aspect ratio input support
- Strong OCR, chart analysis, and document parsing
- Useful for receipts, PDFs, and screenshot workflows
26B / 31B
Video Processing
- Supports video clips (up to around 60 seconds in published examples)
- Practical for meeting recap and screen-recording summarization
- Best quality on larger models with longer context
E2B / E4B
Audio Input
- Audio-enabled edge variants for speech-centric workloads
- Good fit for lightweight transcription and voice QA
- Can run fully local for privacy-sensitive pipelines
Prompt format example
<|turn|>user
Describe this image and summarize what is happening:
<|image|>
Then transcribe and translate this short audio clip:
<|audio|>
<|turn|>Truly Open, No Strings Attached
- Commercial use allowed
- No royalties or fees
- Modification permitted
- Distribution allowed
- No attribution required
Use It Your Way
Unlike models with restrictive licenses, Gemma 4's Apache 2.0 license lets you build commercial products, modify the model weights, and distribute it however you like — all without paying royalties or even mentioning Google.
This makes Gemma 4 ideal for startups, researchers, enterprises, and hobbyists alike. Build a paid SaaS product, create a local code assistant, or fine-tune it for your specific domain — the choice is yours.
Gemma 4 vs Gemma 3
A significant leap forward in capability, efficiency, and scope. See how Gemma 4 compares to its predecessor.
| Feature | Gemma 3 | Gemma 4 |
|---|---|---|
| Context Window | 32K - 128K | 128K - 256K |
| Parameters | Up to 12B | Up to 31B |
| Architecture | Dense | Dense + MoE |
| Modalities | Text / Image | Text / Image / Audio / Video |
| Languages | 40+ | 140+ |
| Arena AI Rank (date-scoped) | #8 (historical) | Apr 1: #3/#6 open, Apr 7: #4/#7 open |
| Performance/Param | Good | Best-in-class |
Ready to get started?
Follow our quickstart guide to run Gemma 4 locally in minutes. No complex setup required.