How it works

One prompt. Two models. You judge the pixels, not the price tag.

Every page in the test is generated cold from a single prompt - one take, n=1, no edits, no cherry-picking. GLM 5.2 (open, on Together AI) and Claude Opus 4.8 (closed) each get the exact same brief, and the page you see is the raw output. The only thing hidden during the test is the bill.

$0.66GLM 5.2 · all 11
$4.31Opus 4.8 · all 11
6.6×cheaper, in total
  1. 1
    Same brief, both models. A short, identical prompt goes to GLM 5.2 and Opus 4.8 - no design kit, no tools, no planning step.
  2. 2
    One raw take each. Whatever the model returns is the page. No retries, no human edits, costed at real API token rates.
  3. 3
    You guess blind. Two unlabelled pages, side by side. Pick the one that cost more - then we show the receipts.

The exact prompt

Identical for both models. Just a system line and a one-paragraph brief - the rest is the model.

System
You are a senior product designer and front-end
engineer. Build a complete, polished landing page
for the brief below as a single, self-contained
HTML file: all CSS and any JS inline, no
frameworks, no build step, no placeholder text.
Make deliberate choices about layout, type,
colour, spacing and motion. Output exactly one
final ```html block.
Brief · per page
Brief: Nimbus - an edge compute platform
that runs your code in 300+ cities with zero
config and instant rollbacks. Audience: backend
and platform engineers. Register: dark mode,
technical, confident, fast.

Build it now. Output one final ```html block.
One of 11 briefs. The category, audience and register change; the shape stays the same.

What each page actually cost

Hover a row to see both pages. Prices are the real API cost of that one generation.

PageGLM 5.2Opus 4.8CheaperRelative cost
NimbusEdge compute platform$0.096$0.414.3×open GLM ↗open Opus ↗
FathomSleep & meditation app$0.060$0.36open GLM ↗open Opus ↗
ForgeStreetwear drop$0.054$0.27open GLM ↗open Opus ↗
Olive & AshFarm-to-table restaurant$0.064$0.385.9×open GLM ↗open Opus ↗
QuantaAI research lab$0.037$0.246.4×open GLM ↗open Opus ↗
VersoDesign magazine$0.048$0.24open GLM ↗open Opus ↗
HalcyonPremium headphones$0.054$0.468.5×open GLM ↗open Opus ↗
MeridianLogistics SaaS$0.068$0.558.1×open GLM ↗open Opus ↗
YonderGuided hiking trips$0.070$0.547.7×open GLM ↗open Opus ↗
SiftEmail client$0.043$0.388.7×open GLM ↗open Opus ↗
BloomPlant subscription$0.062$0.497.9×open GLM ↗open Opus ↗

Methodology

One take per page, n=1, no cherry-picking; both models built from the same prompt and the page you see is the raw output. GLM 5.2 is zai-org/GLM-5.2 on Together AI; Opus is claude-opus-4.8. Prices are the real cost of each generation at list token rates. The typical gap is about (widest: Sift at 8.7×; narrowest: Nimbus at 4.3×).

Token + time detail
NimbusGLM 22.0k tok · 97sOpus 16.7k tok · 167s
FathomGLM 13.8k tok · 57sOpus 14.7k tok · 140s
ForgeGLM 12.5k tok · 58sOpus 11.2k tok · 104s
Olive & AshGLM 14.7k tok · 53sOpus 15.4k tok · 148s
QuantaGLM 8.7k tok · 31sOpus 9.9k tok · 89s
VersoGLM 11.1k tok · 38sOpus 10.0k tok · 92s
HalcyonGLM 12.6k tok · 40sOpus 31.7k tok · 140s
MeridianGLM 15.6k tok · 72sOpus 39.8k tok · 205s
YonderGLM 16.0k tok · 56sOpus 59.4k tok · 193s
SiftGLM 10.0k tok · 51sOpus 32.7k tok · 139s
BloomGLM 14.3k tok · 58sOpus 37.8k tok · 182s