What did Jesse Robbins build at Amazon?
Jesse Robbins joined Amazon in 2001 as “Master of Disaster” — his actual business card title — responsible for the availability of every property bearing the Amazon brand. He created Amazon’s Incident Management program and the GameDay practice of deliberate failure injection, which established the operational discipline that allowed Amazon’s web services to scale reliably.
His approach was shaped by his background in emergency services. The incident-command structure used by fire departments — clear roles, practiced procedures, calm under pressure — became the template for how Amazon managed outages. But the turning point was watching a junior engineer physically shaking during an outage, terrified of being blamed. That moment convinced Robbins that the punitive culture around failure was the real reliability problem. He shifted Amazon’s approach from blame to learning — making it safe to experiment, safe to fail, and safe to report problems honestly.
GameDay took this further by deliberately creating crises so teams could practice before real failures hit. The programs Robbins built at Amazon directly influenced the emergence of site reliability engineering as a discipline and laid the groundwork for the DevOps movement he would later help start.