Something broke → I investigated → I fixed → I learned. Engineering thinking, not tutorials.
An agent that never reached the goal — hypothesis-driven debugging from rewards to environment reset.