Ternary Bonsai 1.7B
Apache 2.0
Edge-class ternary model — runs on phones and small embedded devices. 1.58-bit quantization at 1.7B parameters. MLX packed only today; llama.cpp / vLLM ports in progress.
Provider
PrismML
Parameters
1.7B
Context
32.768K
Released
2026-04-17
VRAM Requirements by Quantization
| Method | Disk Size | VRAM Required | Fits GPUs |
|---|---|---|---|
| 1.58-bit (MLX) | 0.5 GB | 0.6 GB | 15 GPUs |
Benchmark Scores
mmlu54%
humaneval35%
Scores are approximate and may vary by quantization level.
Compatible GPUs (15)
AMD RX 7900 GRE (16GB)AMD RX 7900 XTX (24GB)Apple M4 Pro (24GB) (24GB)Apple M3 Max (36GB) (36GB)Apple M4 Max (48GB) (48GB)NVIDIA RTX 4060 (8GB)NVIDIA RTX 4070 SUPER (12GB)NVIDIA RTX 3080 12GB (12GB)NVIDIA RTX 4080 SUPER (16GB)NVIDIA RTX 4060 Ti 16GB (16GB)NVIDIA RTX 4070 Ti SUPER (16GB)NVIDIA RTX 5080 (16GB)NVIDIA RTX 3090 (24GB)NVIDIA RTX 4090 (24GB)NVIDIA RTX 5090 (32GB)
HuggingFace
PrismML/ternary-bonsai-1.7b