#3 open on Arena AI (Apr 1, 2026 snapshot)

Gemma 4 Models:
Complete Specs & Comparison

Google's latest open model family delivers best-in-class performance per parameter. Four powerful variants — from 2B embedded models to 31B dense — all under the permissive Apache 2.0 license.

Get Started View Specs View Benchmarks

Overview

What is Gemma 4?

Gemma 4 is Google DeepMind's newest open model family, released in early 2026 under the Apache 2.0 license. Named after the Latin word for "gem," this model family represents Google's most advanced open AI technology to date.

The standout achievement at launch: Gemma 4 31B reached #3 open model on Arena AI (ELO 1452 as of Apr 1, 2026), while 26B MoE reached #6 open (ELO 1441). In a later snapshot (Apr 7, 2026), these shifted to #4 and #7 open, which is why all ranking claims on this site are explicitly date-scoped.

Available in four variants spanning from embedded devices (E2B at 2B parameters) to research-grade models (31B Dense), Gemma 4 brings multimodal capabilities — text, image, audio, and video — to the open model ecosystem for the first time. Whether you're building on a Raspberry Pi or a high-end GPU cluster, there's a Gemma 4 model optimized for your use case.

Technical Specs

Model Specifications

Choose the right model for your hardware. From edge devices to enterprise servers, there's an optimal Gemma 4 variant.

Model	Total Params	Active Params	Context Window	Modalities	Best For
E2B	5.1B	~2.3B	128K	Text / Image / Audio	Mobile, Edge devices
E4B	8.0B	~4.5B	128K	Text / Image / Audio	Mobile, Raspberry Pi
26B MoE	26B	~3.8B	256K	Text / Image / Video	Low-latency local use
31B Dense	31B	31B	256K	Text / Image / Video	Max quality, fine-tuning

Key Capabilities

Built for everything

From embedded devices to research environments, Gemma 4 delivers state-of-the-art capabilities across the entire model family.

Advanced Reasoning

State-of-the-art chain-of-thought reasoning for complex problem-solving. Tackle multi-step math problems, logical puzzles, and intricate analytical tasks with unprecedented accuracy.

Agentic Workflows

Built-in function calling, structured JSON output, and robust system instruction following. Deploy reliable AI agents that can interact with external tools and APIs seamlessly.

Code Generation

Expert-level code generation across 140+ programming languages. From quick scripts to complex applications, Gemma 4 serves as your capable local code assistant.

Vision & Audio

Native multimodal understanding processes images, audio, and video alongside text. Analyze diagrams, transcribe speech, and understand visual context in real-time.

Long Context

Process up to 256K tokens in a single context window. Summarize entire codebases, analyze lengthy documents, or maintain coherent conversations over vast amounts of information.

140+ Languages

Trained on an exceptionally diverse multilingual corpus, Gemma 4 communicates fluently across a vast array of languages, from English and Mandarin to Swahili and Gaelic.

Multimodal Deep Dive

Image, video, and audio by model tier

Gemma 4 capability is not uniform across all four models. This breakdown helps choose the right variant before deployment or fine-tuning.

E2B / E4B / 26B / 31B

Image Understanding

Variable resolution and aspect ratio input support
Strong OCR, chart analysis, and document parsing
Useful for receipts, PDFs, and screenshot workflows

Source: Google AI for Developers - Gemma 4 Model Card

26B / 31B

Video Processing

Supports video clips (up to around 60 seconds in published examples)
Practical for meeting recap and screen-recording summarization
Best quality on larger models with longer context

Source: Hugging Face Blog - Gemma 4

E2B / E4B

Audio Input

Audio-enabled edge variants for speech-centric workloads
Good fit for lightweight transcription and voice QA
Can run fully local for privacy-sensitive pipelines

Source: Google AI for Developers - Gemma 4 Model Card

Prompt format example

Multimodal Prompt

<|turn|>user
Describe this image and summarize what is happening:
<|image|>

Then transcribe and translate this short audio clip:
<|audio|>
<|turn|>

Apache 2.0 License

Truly Open, No Strings Attached

Commercial use allowed
No royalties or fees
Modification permitted
Distribution allowed
No attribution required

Use It Your Way

Unlike models with restrictive licenses, Gemma 4's Apache 2.0 license lets you build commercial products, modify the model weights, and distribute it however you like — all without paying royalties or even mentioning Google.

This makes Gemma 4 ideal for startups, researchers, enterprises, and hobbyists alike. Build a paid SaaS product, create a local code assistant, or fine-tune it for your specific domain — the choice is yours.

Read the full Apache 2.0 license

Evolution

Gemma 4 vs Gemma 3

A significant leap forward in capability, efficiency, and scope. See how Gemma 4 compares to its predecessor.

Feature	Gemma 3	Gemma 4
Context Window	32K - 128K	128K - 256K
Parameters	Up to 12B	Up to 31B
Architecture	Dense	Dense + MoE
Modalities	Text / Image	Text / Image / Audio / Video
Languages	40+	140+
Arena AI Rank (date-scoped)	#8 (historical)	Apr 1: #3/#6 open, Apr 7: #4/#7 open
Performance/Param	Good	Best-in-class

Ready to get started?

Follow our quickstart guide to run Gemma 4 locally in minutes. No complex setup required.

Get Started with Gemma 4 Explore Resources

Gemma 4 Models:Complete Specs & Comparison