BitsMoE: Spectral Energy-Guided Bit Allocation for MoE LLM Quantization (+27.83pp over GPTQ at 2-bit)

SVD-based mixed-precision quantization framework for MoE models: separates shared basis from expert-specific factors, allocates bits via integer linear programming. +27.83pp accuracy over GPTQ under 2-bit Qwen3-30B-A3B, 12.3x decoding acceleration. Code public.

SVD-based mixed-precision quantization framework for MoE models: separates shared basis from expert-specific factors, allocates bits via integer linear programming. +27.83pp accuracy over GPTQ under 2-bit Qwen3-30B-A3B, 12.3x decoding acceleration. Code public.

Read the original article ↗

Reading context

publishedJune 2, 2026

sourcearxiv.org

tagsquantization, compression

Jump back into the feed

Latest edition

Return to the ranked homepage rail and continue scanning the current edition.

Original source

Open the upstream article in a new tab for the full source context.