Zyphra ZAYA1-8B

MoEApache 2.0

Compact reasoning-tuned MoE trained on AMD GPUs, optimized for intelligence density per parameter. Comfortable fit on a single 12GB card at Q4.

Provider

Zyphra AI

Parameters

8B (MoE)

Context

65.536K

Released

2026-05-04

VRAM Requirements by Quantization

Method	Disk Size	VRAM Required	Fits GPUs
Q8_0	8.5 GB	9.5 GB	18 GPUs
Q4_K_M	4.8 GB	6 GB	19 GPUs
Q4_0	4.5 GB	5.6 GB	19 GPUs

Run in terminal:

ollama pull zaya1:8b

Minimum 5.6GB VRAM required. Install Ollama from ollama.com

mmlu73.5%

humaneval68%

Scores are approximate and may vary by quantization level.

HuggingFace

Zyphra/ZAYA1-8B