Qwen 3.5 3B
Qwen
Ultra-compact 3B model for edge devices and low VRAM setups. Runs on 4GB VRAM.
Provider
Alibaba
Parameters
3B
Context
32.768K
Released
2025-09-01
VRAM Requirements by Quantization
| Method | Disk Size | VRAM Required | Fits GPUs |
|---|---|---|---|
| Q8_0 | 3.1 GB | 4 GB | 15 GPUs |
| Q4_K_M | 1.8 GB | 2.5 GB | 15 GPUs |
Install with Ollama
Benchmark Scores
mmlu68.2%
humaneval62.1%
Scores are approximate and may vary by quantization level.
Compatible GPUs (15)
AMD RX 7900 GRE (16GB)AMD RX 7900 XTX (24GB)Apple M4 Pro (24GB) (24GB)Apple M3 Max (36GB) (36GB)Apple M4 Max (48GB) (48GB)NVIDIA RTX 4060 (8GB)NVIDIA RTX 4070 SUPER (12GB)NVIDIA RTX 3080 12GB (12GB)NVIDIA RTX 4080 SUPER (16GB)NVIDIA RTX 4060 Ti 16GB (16GB)NVIDIA RTX 4070 Ti SUPER (16GB)NVIDIA RTX 5080 (16GB)NVIDIA RTX 3090 (24GB)NVIDIA RTX 4090 (24GB)NVIDIA RTX 5090 (32GB)
HuggingFace
Qwen/Qwen3.5-3B-Instruct