Describe your RL environment

Plain English in, runnable Gymnasium code out. Complete with training script and Dockerfile.