GLM 5.1
MoEGLM
Zhipu's 754B flagship MoE. $1.4/M tokens via API; strong on agentic coding benchmarks. Not local-runnable on consumer hardware — included for completeness alongside GLM-5.
Provider
Zhipu AI
Parameters
754B
Context
128K
Released
2026-04-08
VRAM Requirements by Quantization
| Method | Disk Size | VRAM Required | Fits GPUs |
|---|---|---|---|
| Q4_K_M | 380 GB | 410 GB | 0 GPUs |
| Q2_K | 200 GB | 220 GB | 0 GPUs |
Benchmark Scores
mmlu87.2%
humaneval91.5%
Scores are approximate and may vary by quantization level.
HuggingFace
zai-org/GLM-5.1