mlx-optiq
Lab · Quantize

Quantize

A four-step wizard around the OptIQ convert pipeline.

Paste an HF model id → pick target BPW + reference mode → watch live progress on the sensitivity, knapsack, and convert phases → save locally and optionally push to your HF account with one click. The wizard detects supported architectures (Qwen3.5/3.6, Gemma-4) and warns on untested ones.

Quantize wizard

The per-layer sensitivity pass and multi-tier bit allocation are exactly the same as the CLI optiq convert; the Lab just gives them a progress UI and a save/push button.