Gemma 4 31B

Apache 2.0

Google's flagship dense 31B model with 256K context. Near-frontier quality, top open-source performer on code and reasoning. Arena Elo ~1452.

Provider

Google

Parameters

31B

Context

262.144K

Released

2026-04-08

VRAM Requirements by Quantization

Method	Disk Size	VRAM Required	Fits GPUs
Q8_0	31 GB	34 GB	2 GPUs
Q4_K_M	17 GB	19 GB	7 GPUs
Q4_0	15.5 GB	17.5 GB	7 GPUs
Q2_K	9.5 GB	11 GB	14 GPUs

Run in terminal:

ollama pull gemma4:31b

Minimum 11GB VRAM required. Install Ollama from ollama.com

mmlu89%

humaneval82%

Scores are approximate and may vary by quantization level.

HuggingFace

google/gemma-4-31b-it