Coding Arena: The Glass
task vglass-fill-1.0.0One coding prompt, given to every model word for word. Each card runs that model's answer, live and sandboxed. No edits, no fixes. What it shipped is what you click.
The task ▸▾
Build a 2D game in a single self-contained HTML file (HTML + CSS + JS, no external libraries, no network). On screen is an empty glass. Each click pours water in; the glass fills up over repeated clicks, with a visible animated water level. Include a way to empty/reset it. Keep it to one file that runs by just opening it in a browser.
Reference
referenceThe target, written as the spec for this task, not a model entry. It sets the bar the models are measured against.
gpt-5.4-nano
Auto-recorded. Grade chips by hand.
gpt-5-nano
Auto-recorded. Grade chips by hand.
gpt-4o
Auto-recorded. Grade chips by hand.
claude-opus-4-7
Auto-recorded. Grade chips by hand.
GLM-5.2
gpt-5.5
claude-opus-4-6
GLM-4.6
GLM-4.5
gpt-5-mini
gpt-5.2
claude-opus-4-8
claude-sonnet-4-6
claude-haiku-4-5
Each game runs in a sandboxed iframe (scripts only, no network, no access to this page). The numbers were measured when the game was recorded: tokens (input→output), cost (usage × the fixed price table), time (how long the model took), and code (raw line count).