Qwen3.5-0.8B
Fast proof point for compression behavior and tier quality.
Supported Models
The first public model surface is centered on Qwen3.5 because it is easier to believe a strong compression story when the model line, the tiers, and the setup path all stay coherent.
Fast proof point for compression behavior and tier quality.
First model where smaller and still useful becomes a real product story.
Mainstream pressure test for the launch surface.
Where memory ceilings and multi-GPU costs start to bite hard.
Base
Original fp16. This is the reference point everything else has to justify.
Parity
The safety-first tier for people who want aggressive size reduction without giving up the behavior they trust.
Frontier High / Low
The compression-first tiers for people chasing much smaller artifacts and willing to choose how hard they want to push.