
Open Source·
Cloudflare open-sourced a lossless LLM compressor that shaves 22% off model weights
Unweight is Cloudflare Research's new BF16 weight compressor. 22% smaller bundles, 13% smaller inference footprint, 30-40% throughput overhead, BSD license.

