runlocal.devCheck My GPU →

micro-kiki v3

MoEApache 2.0

Qwen 3.5 35B-A3B base with 35 domain LoRA experts and an automatic router. Embedded engineering specialist with Aeon long-term memory. Built on FineFab; fully open-source LoRA routing reference.

Provider

L'Électron Rare

Parameters

3B active / 35B total (MoE + 35 LoRAs)

Context

131.072K

Released

2026-04-18

VRAM Requirements by Quantization

MethodDisk SizeVRAM RequiredFits GPUs
Q4_K_M19.5 GB21 GB9 GPUs
Q4_018.5 GB20 GB9 GPUs
Q2_K11.5 GB13 GB16 GPUs

Benchmark Scores

mmlu83%
humaneval87%

Scores are approximate and may vary by quantization level.

Compatible GPUs (16)

HuggingFace

LElectronRare/micro-kiki-v3

View on HF →