Postmortem Template
Date | Version | Changes |
---|---|---|
2022-04-12 | v1.0.0 |
|
Incident Summary
Write a summary of the incident in a few sentences, including
What happened
Why it happened
The severity of the incident
How long the impact lasted
Key Performance Indicators
KPI | Time | Comment |
---|---|---|
Time to Repair |
| Time form the incident started until normal operation is restored |
Time to Recover |
| Time from the incident started until it is resolved |
Time to Respond |
| Time from incident was discovered until normal operation was restored |
Time to Acknowledge |
| Time from first indication of the incident until work was started to resolve it |
Time Since Last Failure |
| Time from the end of previous incident to start of this one |
Leadup
Describe the sequence of events that led to the incident, for example
Previous changes that introduced bugs that had not yet been detected
Fault
Describe how the change that was implemented didn’t work as expected. If available, attach screenshots of relevant data visualisations that illustrate the fault.
Impact
Detection
Response
Recovery
Timeline
Date/Time | Incident Activity |
---|---|
|
|
|
|
Root Cause Analysis
Backlog Check
Recurrence
Lessons Learned
Corrective Actions
Action | Responsible | Deadline | Issue Tracking |
---|---|---|---|
|
|
|
|
|
|
|
|