Ultimate lesson learned
Incident overview
Impact summary
Outage start time / Outage end time
Timeline of the issue
Root cause
Software component:
Team:
Contact person
Manager of contact person
Detection
Incident response
Could the change be rolled back easily?
Monitoring
Was issue automatically detected?
Prevention and Improvement
What to do in order to prevent the issue?
Others
How many issues your team had per quarter?
How you measure impact of the issue?
Last updated
Was this helpful?