ESSAY · 2026-07-05
When the Control Scheme Decides the Difficulty — From Grid Movement to Drag-and-Arrange
The single move you hand to the fingers shapes the very unit of thought
Introduction
When we design a puzzle game we talk endlessly about rules and boards. But what the player actually touches is not a rule. It is a fingertip. One press of an arrow key, one drag of a tile, one line traced across the screen — that single input decides the very unit of thought. I think the control scheme is the most underrated variable in puzzle design. The same board becomes an entirely different problem depending on whether you push it, trace it, or drag it.
This essay follows how the control scheme shapes difficulty through how specific games feel in the hand: from Sokoban's grid movement in 1982 to The Witness's line drawing, the dragging of Gorogoa and A Little to the Left, and the cursor of Return of the Obra Dinn. In the subtractive-design essay I counted verbs; here I treat the layer one step earlier — how a verb is input — because input decides the shape of thought before the verb does.
Grid Movement as Discretization — The Power of Countable Moves
The oldest control scheme in thinking puzzles is discrete movement on a grid. In 1982, Thinking Rabbit's Sokoban had you push a crate one cell at a time. That discreteness was decisive. Because a move is clearly countable, the board's states divide into a finite set, and you can look ahead to how many moves until a deadlock. Continuous control allows no such exact foresight. Grid movement is an input that translates a puzzle into something computable.
Baba Is You (2019) and Snakebird (2015) both build on this discrete movement. In Baba, each move shifts a word block and rewrites a rule; in Snakebird, each move grows the body one cell and gravity acts. Because the unit of a move is clear, the player can rewind and try another. As I argued in The Ethics of Undo, undo works because input is discrete. Rewinding a continuous action raises the ambiguity of how far to go back.
The strength of grid movement is that player and designer speak of the board in one language. Right, right, up, push can be shared as a solution outright. Discrete input converts a solution into something you can put into words. It meshes with the one-screen essay: when the board is surveyable and moves are countable, a puzzle reaches its most legible difficulty. I take this pairing as the reference point of the genre.
Tracing a Line — The Witness and the Verb of Drawing
In 2016, Jonathan Blow's The Witness built an entire game from one input: tracing a line across panels all over the island. Draw from start to end — that is all. But this tracing demands a different kind of thought from grid movement. The player must design the path itself, verifying as they draw whether each branch satisfies the board's symbols: black and white dots, colored blocks, tetromino pieces. Because the input is a continuous trajectory, the solution is not a list of points but the shape of a single line.
The cleverness of tracing is that failure is instantly visible. A wrong line is rejected in red the moment you finish it. The player externalizes a hypothesis mid-input and sees the error. This is continuous with the observation-as-play lineage. In The Witness, the act of drawing is itself the instrument of observation: you look, you trace, and the board answers whether the path is right. It is a rare design where input and observation fuse.
What intrigues me is that The Witness's line keeps an essentially grid-like discreteness. Because it runs along the panel's lattice, the path decomposes into finite options. You trace continuously, yet solve discretely inside. That duality gives tracing its peculiar feel: the finger moves smoothly while the mind solves a combinatorial join of point to point. Continuity of input and discreteness of solution coexist without contradiction here.
Drag and Arrange — The Continuous Hand of Gorogoa and A Little to the Left
When input leaves the grid, a puzzle's character changes greatly. In December 2017, Jason Roberts' Gorogoa had you grab, move, overlap, and rearrange tiles of a quartered picture. The player operates not a discrete move but the continuous relation between images. The instant one tile's window frame aligns with another's corridor, a path connects. Here the input is not where to place but how to overlap, and the solution rises as a spatial coincidence.
In November 2022, Max Inferno's A Little to the Left had you drag scattered objects into order: pencils by length, plates nested together. What makes it distinctive is that the answer is not always singular. Dragging, a continuous input, lacks the unique procedure of grid movement. The player feels out a configuration that looks tidy with the fingertips. The greater freedom tilts the solution from logic toward aesthetics. The same act of tidying uses a wholly different brain than Sokoban's push.
The design difficulty of drag input lies in undo and answer-checking: how far to rewind a freely movable object, which arrangement to accept as right. A Little to the Left turned this ambiguity to advantage by allowing multiple solutions, but many drag puzzles stumble here. Even a track-drawing input like Cosmic Express clarifies checking by snapping to a discrete lattice in the end. If you adopt continuous input, you need some device to recover discreteness. That is homework the designer must take on.
Input Without an Avatar — Observation, Time, and Cards
The control schemes so far all had something to move: a crate, a snake, a line, a tile. But some puzzles make the object of input the player's own attention. In 2018, Lucas Pope's Return of the Obra Dinn has you move a cursor and a memory. You walk the ship, observe moments of death, and fill the crew ledger's blanks by deduction. Most input is look and record; there is no piece to move on a board. A move here is not physical travel but one increment of certainty. Lorelei and the Laser Eyes likewise centers its input on notes and cross-checking.
In September 2018, The Voxel Agents' The Gardens Between made the object of input time itself. You do not move the characters directly; you rewind time back and forth. Push the stick and time advances, pull it and it reverses. Replacing the verb of movement with control over the timeline makes causality itself the material of the puzzle. That same year, Golf Peaks (November 2018) turned movement into cards. Only by choosing a card from your hand does the ball hop a fixed distance. Bundling input into discrete cards turns a continuous golf shot into move-by-move planning like Sokoban.
What these share is prying the object of input away from moving an avatar. A cursor, time, a card. Change what you move and the focus of thought moves too: toward observation, toward causality, toward resource management. The works in the observation-as-play series generate deep focus precisely because they move little; the object of input sits outside the board, in the player's own reasoning. Input design is also the design of where to point the player's attention.
Why Input Decides Difficulty — Discreteness, Undo Cost, and Affordance
The first reason input decides difficulty is the degree of discreteness. The more discrete the input, as in grid movement, the more the player can count moves, look ahead, and put a solution into words. The more continuous, as in dragging, the more the solution turns sensory, leaning on trial and error over logic. The designer must first decide where on this axis to place the work. There is a middle answer too, like The Witness: input continuously while keeping the interior discrete. Discrete or continuous defines the very quality of difficulty.
The second reason is the cost of undo. As I argued in The Ethics of Undo, whether you permit or punish trial transforms the experience, but whether undo is even possible depends on input design. A discrete move rewinds exactly. A continuous drag or time control raises a new question of how far to go back. The Gardens Between making time itself the object of input was a clever answer that fuses undo and input: rewinding is itself the play.
Third, input defines affordance. The player reads what seems possible from the shape of the input. Arrow keys imply movement, dragging implies arrangement, a cursor implies selection. As I wrote in Shaping the Learning Curve, a good puzzle teaches its controls without words. That is possible because the input itself speaks of the available acts. Get input design wrong and the player stumbles before what to do. Before the rules, the question is whether the shape is one the fingers can understand.
Closing
From Sokoban's grid movement to The Witness's line, the dragging of Gorogoa and A Little to the Left, Obra Dinn's cursor, The Gardens Between's time, and Golf Peaks's cards, the control scheme looks like a puzzle's surface, yet it decides the very unit of thought. Discrete or continuous, whether undo works, what the input affords. The same board yields a different difficulty when the input changes. Before counting verbs, you design how the verb is handed to the fingers. That is control-scheme design.
If I built a puzzle next, I would settle the input before the rules. Whether the first move the player's fingers touch is discrete or continuous. Whether it can be rewound or not. That single point later governs the board, the rules, and the quality of difficulty alike. I leave the reader a question: the puzzle you love most right now, by what single move of the fingers does it run? When you can put that feel into words, the core of its design comes into view.
Reactions (no login)
Anonymous • one of each per visitor per day