Same 17-prompt suite as the Qwopus3.6-27B v1-preview eval — 5 agentic + 1 nothink rerun, 5 web-design, 6 canvas/WebGL. Q5_K_M on a single RTX 5090 via llama.cpp. Headline: 162 tok/s avg, 129k tokens generated, 13.3 min runtime. The MoE is 2.6× faster than the dense 27B preview at a larger quant. The web-design outputs are some of the best one-shot HTML I've seen out of any open model in this size class — verbose in a good way, the pages feel complete on the first attempt where most models land surface-level.
3 of 6 creative-canvas runs shipped clean and are below. The Mandelbulb shader, soft-body physics sandbox, and audio-reactive visualizer didn't render correctly on first attempt — these are prompts most one-shot models in this size class fail on, and they're typically the kind of brief that needs a second turn to fix shader errors or collision math. Excluded here for the headline; raw outputs are still in the repo if you want to inspect.