Parametric Lunar Lander: A Controlled-Dynamics Testbed

Lunar Lander is a small control environment: a policy uses main and side thrust to bring a vehicle down safely between two flags. In the stock environment, the physics stays the same every episode. Gravity stays the same. Engine powers stay the same. Density stays the same. The wind regime stays the same. That is exactly what a fixed control environment is for: hold the dynamics constant so the policy is the thing being measured. ...

May 8, 2026 · vignesh saptarishi

When the Repo Isn't the Right Boundary

The useful version of a coding agent is the one you do not have to babysit. The risky version is the one with broad permissions pointed at your normal working tree. That tension gets sharper once the task stops fitting inside one repo. A lot of the work I care about spans multiple repos, often code and docs, plus local datasets and sometimes a prepared environment with GPU access. A typical task is not “edit repo A.” It is “change code in one repo, update notes in another, and run against local data in the same attempt.” The repo is often the wrong unit of isolation. ...

March 17, 2026 · vignesh saptarishi

The Layer That Document Systems Keep Rebuilding

Documents are text arranged in space, and the arrangement carries meaning that the text alone does not. A heading is a heading because of where it sits and how it is set on the page. A label and its value live in a directional relationship before they live in a semantic one. A multi-column page has a reading order that the page geometry imposes on the language, not the other way around. ...

February 26, 2026 · vignesh saptarishi