Check My GPU →

Ternary Bonsai 1.7B

Apache 2.0

Edge-class ternary model — runs on phones and small embedded devices. 1.58-bit quantization at 1.7B parameters. MLX packed only today; llama.cpp / vLLM ports in progress.

Provider

PrismML

Parameters

1.7B

Context

32.768K

Released

2026-04-17

VRAM Requirements by Quantization

Method	Disk Size	VRAM Required	Fits GPUs
1.58-bit (MLX)	0.5 GB	0.6 GB	19 GPUs

Benchmark Scores

mmlu54%

humaneval35%

Scores are approximate and may vary by quantization level.

Compatible GPUs (19)

AMD RX 9070 XT (16GB)AMD RX 7900 GRE (16GB)AMD RX 7900 XTX (24GB)AMD Ryzen AI Max+ 395 (unified memory) (64GB)Apple M4 Pro (24GB) (24GB)Apple M3 Max (36GB) (36GB)Apple M4 Max (48GB) (48GB)Apple M4 Ultra (64GB) (64GB)NVIDIA RTX 4060 (8GB)NVIDIA RTX 3080 12GB (12GB)NVIDIA RTX 4070 SUPER (12GB)NVIDIA RTX 4070 Ti SUPER (16GB)NVIDIA RTX 4080 SUPER (16GB)NVIDIA RTX 5070 Ti (16GB)NVIDIA RTX 4060 Ti 16GB (16GB)NVIDIA RTX 5080 (16GB)NVIDIA RTX 4090 (24GB)NVIDIA RTX 3090 (24GB)NVIDIA RTX 5090 (32GB)

HuggingFace

PrismML/ternary-bonsai-1.7b