#3 open on Arena AI (Apr 1, 2026 snapshot)

Gemma 4 Models:
Complete Specs & Comparison

Google's latest open model family delivers best-in-class performance per parameter. Four powerful variants — from 2B embedded models to 31B dense — all under the permissive Apache 2.0 license.

Overview

What is Gemma 4?

Gemma 4 is Google DeepMind's newest open model family, released in early 2026 under the Apache 2.0 license. Named after the Latin word for "gem," this model family represents Google's most advanced open AI technology to date.

The standout achievement at launch: Gemma 4 31B reached #3 open model on Arena AI (ELO 1452 as of Apr 1, 2026), while 26B MoE reached #6 open (ELO 1441). In a later snapshot (Apr 7, 2026), these shifted to #4 and #7 open, which is why all ranking claims on this site are explicitly date-scoped.

Available in four variants spanning from embedded devices (E2B at 2B parameters) to research-grade models (31B Dense), Gemma 4 brings multimodal capabilities — text, image, audio, and video — to the open model ecosystem for the first time. Whether you're building on a Raspberry Pi or a high-end GPU cluster, there's a Gemma 4 model optimized for your use case.

Technical Specs

Model Specifications

Choose the right model for your hardware. From edge devices to enterprise servers, there's an optimal Gemma 4 variant.

ModelTotal ParamsActive ParamsContext WindowModalitiesBest For
E2B
5.1B~2.3B
128K
Text / Image / AudioMobile, Edge devices
E4B
8.0B~4.5B
128K
Text / Image / AudioMobile, Raspberry Pi
26B MoE
26B~3.8B
256K
Text / Image / VideoLow-latency local use
31B Dense
31B31B
256K
Text / Image / VideoMax quality, fine-tuning
Key Capabilities

Built for everything

From embedded devices to research environments, Gemma 4 delivers state-of-the-art capabilities across the entire model family.

Advanced Reasoning

State-of-the-art chain-of-thought reasoning for complex problem-solving. Tackle multi-step math problems, logical puzzles, and intricate analytical tasks with unprecedented accuracy.

Agentic Workflows

Built-in function calling, structured JSON output, and robust system instruction following. Deploy reliable AI agents that can interact with external tools and APIs seamlessly.

Code Generation

Expert-level code generation across 140+ programming languages. From quick scripts to complex applications, Gemma 4 serves as your capable local code assistant.

Vision & Audio

Native multimodal understanding processes images, audio, and video alongside text. Analyze diagrams, transcribe speech, and understand visual context in real-time.

Long Context

Process up to 256K tokens in a single context window. Summarize entire codebases, analyze lengthy documents, or maintain coherent conversations over vast amounts of information.

140+ Languages

Trained on an exceptionally diverse multilingual corpus, Gemma 4 communicates fluently across a vast array of languages, from English and Mandarin to Swahili and Gaelic.

Multimodal Deep Dive

Image, video, and audio by model tier

Gemma 4 capability is not uniform across all four models. This breakdown helps choose the right variant before deployment or fine-tuning.

E2B / E4B / 26B / 31B

Image Understanding

  • Variable resolution and aspect ratio input support
  • Strong OCR, chart analysis, and document parsing
  • Useful for receipts, PDFs, and screenshot workflows
Source: Google AI for Developers - Gemma 4 Model Card

26B / 31B

Video Processing

  • Supports video clips (up to around 60 seconds in published examples)
  • Practical for meeting recap and screen-recording summarization
  • Best quality on larger models with longer context
Source: Hugging Face Blog - Gemma 4

E2B / E4B

Audio Input

  • Audio-enabled edge variants for speech-centric workloads
  • Good fit for lightweight transcription and voice QA
  • Can run fully local for privacy-sensitive pipelines
Source: Google AI for Developers - Gemma 4 Model Card

Prompt format example

Multimodal Prompt
<|turn|>user
Describe this image and summarize what is happening:
<|image|>

Then transcribe and translate this short audio clip:
<|audio|>
<|turn|>
Apache 2.0 License

Truly Open, No Strings Attached

  • Commercial use allowed
  • No royalties or fees
  • Modification permitted
  • Distribution allowed
  • No attribution required

Use It Your Way

Unlike models with restrictive licenses, Gemma 4's Apache 2.0 license lets you build commercial products, modify the model weights, and distribute it however you like — all without paying royalties or even mentioning Google.

This makes Gemma 4 ideal for startups, researchers, enterprises, and hobbyists alike. Build a paid SaaS product, create a local code assistant, or fine-tune it for your specific domain — the choice is yours.

Evolution

Gemma 4 vs Gemma 3

A significant leap forward in capability, efficiency, and scope. See how Gemma 4 compares to its predecessor.

Feature
Gemma 3
Gemma 4
Context Window32K - 128K128K - 256K
ParametersUp to 12BUp to 31B
ArchitectureDenseDense + MoE
ModalitiesText / ImageText / Image / Audio / Video
Languages40+140+
Arena AI Rank (date-scoped)#8 (historical)Apr 1: #3/#6 open, Apr 7: #4/#7 open
Performance/ParamGoodBest-in-class

Ready to get started?

Follow our quickstart guide to run Gemma 4 locally in minutes. No complex setup required.