Gemma 4 Benchmarks
with source transparency
This page separates official and community evidence for each score. Unknown values stay explicitly marked as pending instead of being backfilled with assumptions.
Core benchmark table
Primary rows are sourced from official model documentation where available.
| Benchmark | Description | Gemma 4 31B | 26B A4B | E4B | E2B | Gemma 3 27B |
|---|---|---|---|---|---|---|
| AIME 2026 | Competition math reasoning | Pending official publication | Pending official publication | Pending official publication | ||
| tau2-bench | Agentic tool-use accuracy | Pending official publication | Pending official publication | Pending official publication | ||
| Arena AI ELO | General conversation quality | Pending official publication | Pending official publication | Pending official publication | Pending official publication | |
| OmniDocBench 1.5 | Document OCR/edit distance (lower is better) | 0.131 Community · confidence medium Media/community summary; validate against official updates when published. | Pending official publication | Pending official publication | Pending official publication |
Cross-generation uplift
AIME 2026
+328.8% uplift
tau2-bench
+1209.1% uplift
Arena ELO competitor context
Competitor rows below are community/media references and should be treated as indicative.
Sources used on this page
Google AI for Developers - Gemma 4 Model Card
official · checked 2026-04-08
Google DeepMind - Gemma 4
official · checked 2026-04-08
Google AI for Developers - Prompt Formatting
official · checked 2026-04-08
Google AI for Developers - Function Calling
official · checked 2026-04-08
vLLM Docs - Gemma 4
official · checked 2026-04-08
Unsloth Docs - Gemma 4 Fine-tuning
official · checked 2026-04-08
Hugging Face Blog - Gemma 4
media · checked 2026-04-08
VentureBeat - Gemma 4 coverage
media · checked 2026-04-08
DEV Community - Gemma 4 deployment guide
community · checked 2026-04-08
Spheron Network - Gemma 4 deployment
community · checked 2026-04-08
Hacker News - Gemma 4 hardware reports
community · checked 2026-04-08
AMD Day-0 support for Gemma 4
official · checked 2026-04-08