runlocal.devCheck My GPU →

Ternary Bonsai 8B

Apache 2.0

1.58-bit ternary quantization — weights are only {-1, 0, +1}. Memory footprint ~1/9 of FP16 at the same parameter count. MLX 2-bit packed format today; other backends coming soon.

Provider

PrismML

Parameters

8B

Context

32.768K

Released

2026-04-17

VRAM Requirements by Quantization

MethodDisk SizeVRAM RequiredFits GPUs
1.58-bit (MLX)1.6 GB2 GB15 GPUs

Benchmark Scores

mmlu68%
humaneval58.5%

Scores are approximate and may vary by quantization level.

Compatible GPUs (15)

HuggingFace

PrismML/ternary-bonsai-8b

View on HF →