John Allspaw

SE Radio 284: John Allspaw on System Failures: Preventing, Responding, and Learning From

Venue: Internet
John Allspaw CTO of Etsy speaks with Robert Blumen about systemic failures and outages; how are systems defended against outages?; why do they fail anyway?; why are failures not entirely preventable?; why do outages involve multiple failures?; the time that Etsy identified it’s own office as a potential source of fraud; the human as part of the system; is human error an important component of failure?; understanding human action during failures; what can we learn from outages?; effective post-mortems; testing as a way of preventing failure; the limitations of testing; testing in production.

