Search Results
Showing results for "postmortem"
No image available
Production Incident Learner: Postmortem to Patches
Create an agent that ingests postmortems and incident tickets, extracts recurring causes, and proposes preventative code/monitoring changes. Require human review.
Tags:
incidents,
postmortem,
prevention,
monitoring,
ops
Author: Assistant
Category: safe-self-improving-ai | Model: gpt-5.2
No image available
Incident Response Plan for AI Failures
Create an incident response plan specific to AI: detection, containment, user comms, rollback, forensic logging, and post-incident retraining rules. Include severity levels and example incidents.
Tags:
incident-response,
rollback,
postmortem,
ops,
safety
Author: Assistant
Category: recursive-ai-safety | Model: GPT-5.2
No image available
Post-Mortem Template for AI Regressions
Create a post-mortem template tailored to AI regressions: data/prompt/model diffs, evaluation gaps, monitoring misses, and remediation tasks. Include a ‘lessons to tests’ section.
Tags:
postmortem,
regression,
ops,
testing,
remediation
Author: Assistant
Category: recursive-ai-safety | Model: GPT-5.2
No image available
Change Control for AI Systems (RFC Process)
Create an RFC-style change control process tailored to recursive AI: what must be documented, reviewers, rollout plan, rollback triggers, and postmortem requirements. Provide a reusable RFC template.
Tags:
change-control,
RFC,
governance,
rollout,
rollback,
safety
Author: Assistant
Category: recursive-ai-safety | Model: GPT-5.2
No image available
AI Incident Response: Runbooks for Model Failures
Write incident response runbooks for AI failures: hallucination spike, data leakage, tool misuse, latency blow-ups, and agent runaway. Include severity levels, comms templates, and postmortem format.
Tags:
incident-response,
runbooks,
agents,
security,
reliability
Author: Assistant
Category: ai-strategy-2026 | Model: gpt-4o
No image available
Incident Postmortem Synthesizer
Collect logs/incidents; ChatGPT drafts a blameless postmortem; Cursor queries log/trace snippets and links to code; Antigravity reconstructs a timeline and verifies action items land in code/config. P...
Tags:
SRE,
incident,
postmortem,
observability,
ChatGPT,
Cursor,
Antigravity
Author: Assistant
Category: reliability-ops | Model: gpt-4o
No image available
Incident Learning Loop
Create incident severities, comms templates, and blameless postmortem format. Propose a 30 minute weekly learning review.
Tags:
incidents,
SRE,
postmortem,
learning,
managers
Author: Assistant
Category: reliability-ops | Model: gpt-4o
No image available
Decision Review Cadence
Define a cadence for reversible vs irreversible decisions. Add premortem, postmortem, and decision log templates. Output a 30 60 90 rollout plan.
Tags:
decision-making,
cadence,
managers,
templates
Author: Assistant
Category: management-ops | Model: gpt-4o
No image available
SaaS SLO & Incident Playbook
Create SLO targets per region, on-call rotations spanning TW/US, incident severity ladder, customer comms templates, and postmortem format.
Tags:
software,
SRE,
incidents,
SLA,
SLO,
on-call
Author: Assistant
Category: reliability-ops | Model: gpt-4o
No image available
SRE Incident Drill Pack
As an SRE lead, prepare an incident drill pack: 3 realistic failure scenarios, runbook steps, on-call rotation, comms templates, status page samples, and a postmortem format with action owners and dea...
Tags:
SRE,
incidents,
runbooks,
on-call,
postmortem
Author: tsubasa
Category: engineering | Model: gpt-4o
No image available
Observability Minimum Viable Platform
As a platform engineer, design an observability MVP: log, metric, trace standards; correlation IDs; dashboards for latency, errors, saturation; SLOs and burn-rate alerts; incident response runbook; an...
Tags:
observability,
SRE,
SLO,
alerts,
runbooks
Author: tsubasa
Category: engineering | Model: gpt-4o
No image available
Real-Time Anomaly Detection & Alerting
Goal: deploy real-time anomaly detection and alerting. Data: streaming events and KPIs. Steps: 1) Forecast baselines with ETS/Prophet; 2) Residual monitoring with robust z-scores; 3) Alert playbooks; ...
Tags:
anomaly;alerting;streaming
Author: Tsubasa Kato
Category: Web Analytics | Model: GPT-5 Thinking
No image available
Incident Postmortem Generator
Create a blameless postmortem for incident <id>: timeline, customer impact, 5 Whys, contributing factors, detection gaps, and corrective actions. Propose guardrails, SLO/SLA adjustments, runbooks, and...
Tags:
"CTO;SRE;incident;postmortem;SLA"
Author: ChatGPT
Category: CTO | Model: GPT-5 Thinking
Back to Home