Mean-time to recovery shows how long it takes for an application in the production stage to recover from failure
Mean-time to production shows how long it takes when new code is committed into code repository
Average lead-time shows how long it takes for a new requirement to be delivered and deployed
Deployment speed shows how fast you can deploy a new version of the application between staging, test and production
Deployment frequency shows how often you can deploy a new release into production environment and testing / QA
Production failure rate shows how often software fails in production
DEVOPS SUCCESS FACTORS
Risk Management Case Study
Chaos Monkey, developed by Netflix in 2011, is a tool used for resilience testing of their IT infrastructure. It deliberately disables computers in their production network to observe how the remaining systems respond to the outage. Part of the Simian Army suite, it simulates and tests various system failures and edge cases. Embracing the principles of DevOps, Chaos Monkey contributes to continuous testing and supports the “Design for failure” pattern, ensuring computer applications can handle the failure of underlying components.
Learn more about this story here .