Parametric Lunar Lander: A Controlled-Dynamics Testbed
Lunar Lander is a small control environment: a policy uses main and side thrust to bring a vehicle down safely between two flags. In the stock environment, the physics stays the same every episode. Gravity stays the same. Engine powers stay the same. Density stays the same. The wind regime stays the same. That is exactly what a fixed control environment is for: hold the dynamics constant so the policy is the thing being measured. ...