Maintaining production systems can be a burden, especially for those who haven’t been doing it for a long time. Knowledge is lost when people leave, practices are only put in place when needed, and onboarding new people can take a long time. In this talk, we will go through production readiness principles and cover the recent technical and organization shift in various organizations when it comes to pushing new components to production systems. Production readiness sometimes requires collaboration from several teams and it influences change from design to late-time operational practices. This talk will examine practices from design & development, configuration management, observability, release management, capacity planning, security and compliance. After this talk, you will have better practical skills to keep your production happy and your engineers more stress-free when they are deploying production services.
This talk will explain achieving production maturity via manual/automated checks, knowledge share, production readiness reviews and organizational rethinking. After giving motivation why production readiness is important, it will give an example of Google PRR system with a concrete example from Youtube.