runlocal.devCheck My GPU →

GLM 5

MoEGLM

Zhipu's 744B frontier MoE. $1.0/M tokens via API. Cluster-scale deployment only; expect 200GB+ VRAM even at aggressive quantization.

Provider

Zhipu AI

Parameters

744B

Context

128K

Released

2026-04-08

VRAM Requirements by Quantization

MethodDisk SizeVRAM RequiredFits GPUs
Q4_K_M370 GB400 GB0 GPUs
Q2_K195 GB215 GB0 GPUs

Benchmark Scores

mmlu87%
humaneval89.5%

Scores are approximate and may vary by quantization level.

HuggingFace

zai-org/GLM-5

View on HF →