TAG
#puzzlescript
0 reviews · 4 essays
Related essays
Inside Stephen Lavelle's Philosophy — Making the Refusal to Explain Into the Work
"Nah I don't feel like it." That was how Stephen Lavelle (Increpare) answered a player who asked him to explain his puzzle design. The prolific maker of 500-plus freeware games, the author of the free engine PuzzleScript and of Stephen's Sausage Roll. I read his philosophy, obsessions, failures, dilemmas and influences using only words he himself wrote.
Where 'Solvable' and 'Fun' Diverge — PuzzleJAX Hands 500+ PuzzleScript Games to the Machines (arXiv, Aug 2025)
One article today: "PuzzleJAX: A Benchmark for Reasoning and Learning" (arXiv preprint, August 2025) by researchers at NYU, the University of Malta, the University of the Witwatersrand (South Africa), and Microsoft (Sam Earle, Graham Todd, Ahmed Khalifa, Julian Togelius and others). They reimplement PuzzleScript — Stephen Lavelle's (increpare) 2013 puzzle-authoring language — on the GPU and hand 500+ human-authored games to tree search, reinforcement learning, and large language models. Read as a designer, the core is one observation: 'solvable by a machine' and 'interesting to a human' are not the same thing. Tree search brute-forces simple games but stalls the moment they get richer; LLMs score 0% on most. The authors even note PuzzleScript's own creator hesitating to embed an auto-solver into the IDE, a caution about measuring difficulty by search.
No prizes, clear constraints, real community — the design wisdom behind Thinky Puzzle Game Jam 6
One article today. We cover Corey Hardt’s May 15 announcement of the 6th annual Thinky Puzzle Game Jam (running June 20–28). The jam’s deliberate “no prizes” policy, 48-hour working limit, and PuzzleScript-friendly approach distinguish it from commercially-oriented jams, creating a space for pure design experimentation. Over 150 participants have already joined.
Can AI Build a Whole Puzzle Game? ScriptDoctor and Its Generate-Playtest-Repair Loop
ScriptDoctor has a large language model write an entire puzzle game — rules, sprites, levels — then lets a compiler and a search-based agent inspect the result and demand revisions. The testbed is PuzzleScript, a language indie developers know well. I walk through the paper in five parts — problem, method, findings, where you can use it, limitations — covering why human-authored examples boost success rates, why reasoning models win, and the distance between 'solvable' and 'fun'.