NVIDIA RTX 5090
Blackwell · 2025
VRAM
32 GB
Bandwidth
1792 GB/s
MSRP
$1,999
Models
22 compatible
Compatible Models (22)
Qwen 3.5 27B
Alibaba · 27B
Q8_0
✓ Full qualityQwen 3.5 3B
Alibaba · 3B
Q8_0
✓ Full qualityQwen 3.5 9B
Alibaba · 9B
Q8_0
✓ Full qualityQwen 3.6 27B
Alibaba · 27B
Q8_0
✓ Full qualityQwen 3.6 35B-A3B
Alibaba · 3B active / 35B total (MoE)
Q4_K_M
✓ Full qualityDeepSeek R1 7B
DeepSeek · 7B
Q8_0
✓ Full qualityGemma 4 27B
Google · 27B (4B active MoE)
Q8_0
✓ Full qualityGemma 4 31B
Google · 31B
Q4_K_M
✓ Full qualityGemma 4 E2B
Google · 2.3B active / 5B total
Q8_0
✓ Full qualityGemma 4 E4B
Google · 4B active (MoE)
Q8_0
✓ Full qualityGemma 4 26B-A4B
Google DeepMind · 26B (4B active MoE)
Q8_0
✓ Full qualitymicro-kiki v3
L'Électron Rare · 3B active / 35B total (MoE + 35 LoRAs)
Q4_K_M
✓ Full qualityPhi-4
Microsoft · 14B
Q8_0
✓ Full qualityMistral Small 3.2
Mistral AI · 22B
Q8_0
✓ Full qualityNemotron-3 Nano Omni 30B-A3B
NVIDIA · 3.5B active / 30B total (Mamba-2 + MoE + Attention)
Q4_K_M
✓ Full qualityMiniCPM 4.6
OpenBMB · 1.2B
Q8_0
✓ Full qualityPoolside Laguna XS.2
Poolside AI · 3B active / 33B total (MoE)
Q4_K_M
✓ Full qualityTernary Bonsai 1.7B
PrismML · 1.7B
1.58-bit (MLX)
✓ Full qualityTernary Bonsai 4B
PrismML · 4B
1.58-bit (MLX)
✓ Full qualityTernary Bonsai 8B
PrismML · 8B
1.58-bit (MLX)
✓ Full qualityZyphra ZAYA1-8B
Zyphra AI · 8B (MoE)
Q8_0
✓ Full qualityQwen 3.5 72B
Alibaba · 72B
Q2_K
⚡ LimitedRequires more VRAM (5)
Llama 4 Scout
Meta
Needs 35GB+
MiniMax M2.7
MiniMax
Needs 31GB+
GLM 4.6
Zhipu AI
Needs 105GB+
GLM 5
Zhipu AI
Needs 215GB+
GLM 5.1
Zhipu AI
Needs 220GB+