incident management
STPA (System Theoretic Process Analysis) -- Teaching a new way to prevent outages at Google
https://sre.google/stpa/teaching/
Read how Google is using System Theoretic Process Analysis (STPA) to analyze pure software systems and discover risks.
Added 10 hours ago
Summary of the Amazon DynamoDB Service Disruption in Northern Virginia (US-EAST-1) Region
https://aws.amazon.com/message/101925/
Added 6 days ago
Major AWS Outage Happening
https://old.reddit.com/r/aws/comments/1obd3lx/dynamodb_down_useast1/
Well, looks like we have a dumpster fire on DynamoDB in us-east-1 again.
Added 1 week ago
Root cause analysis? You're doing it wrong
https://entropicthoughts.com/root-cause-analysis-youre-doing-it-wrong
Added 2 weeks ago
New AWS Security Incident Response helps organizations respond to and recover from security events |
https://aws.amazon.com/blogs/aws/new-aws-security-incident-response-helps-organizations-respond-to-and-recover-from-security-events/
AWS introduces a new service to streamline security event response, providing automated triage, coordinated communication, and expert guidance to recover from cybersecurity threats.
Added 5 months ago
https://www.crowdstrike.com/wp-content/uploads/2024/08/Channel-File-291-Incident-Root-Cause-Analysis-08.06.2024.pdf
https://www.crowdstrike.com/wp-content/uploads/2024/08/Channel-File-291-Incident-Root-Cause-Analysis-08.06.2024.pdf
Added 5 months ago
https://docs.dissect.tools/en/latest/overview/index.html?s=09
https://docs.dissect.tools/en/latest/overview/index.html?s=09
Added 5 months ago