Eoin Woods on Bringing Systems into Production and Keeping them there
Eoin Woods discusses with Sven Johann all the things developers need to know to bring systems successfully into production and how to keep them there. They discuss production environments, what goes wrong in production, architectural requirements for operations, cost of very high availability, stability and capacity, communicating operational concerns, observability, learning from incidents, chaos engineering and operational models (SRE, You build it, you run it, classic).
Read transcriptShow Notes
- Eoin’s book on software and systems architecture
- Understanding quality attributes
- Calculus of service availability
- Release It
- arc42 template for communicating software architecture
- C4 model
- Operational, the forgotten architecural view
- Google Dapper, a large scale distributed tracing infrastructure
- Learning from incidents
- Organizing teams for flow
- Chaos Engineering (Casey Rosenthal, Nora Jones)
- Chaos Engineering (Russ Miles)
- Site Reliability Engineering
- Seeking SRE
- Adrian Cockcroft on resiliency - learing from other disciplines
- Cindy Sridharan, Distributed Systems Observability
- Esther Derby, Diana Larson, Agile Retrospectives
- Aino Corry, Retrospectives Anti Patterns
Comments
New comment