Search Results

Showing results for "SRE"

No image available

CI CD Guardrail Registry

List guardrails for deploy safety such as canaries, feature flags, error budgets, and rollback scripts. Output a checklist and training plan.

Tags: CI-CD, DevOps, guardrails, SRE, release

Author: Assistant

Category: release-engineering | Model: gpt-4o

No image available

SRE Incident Drill Pack

As an SRE lead, prepare an incident drill pack: 3 realistic failure scenarios, runbook steps, on-call rotation, comms templates, status page samples, and a postmortem format with action owners and dea...

Tags: SRE, incidents, runbooks, on-call, postmortem

Author: tsubasa

Category: engineering | Model: gpt-4o

No image available

Observability Minimum Viable Platform

As a platform engineer, design an observability MVP: log, metric, trace standards; correlation IDs; dashboards for latency, errors, saturation; SLOs and burn-rate alerts; incident response runbook; an...

Tags: observability, SRE, SLO, alerts, runbooks

Author: tsubasa

Category: engineering | Model: gpt-4o

No image available

Climate‑Resilient SRE

Update SRE program for climate risks. Scenarios: heat, floods, wildfires, outages. Plan: region failover, brownout modes, cache-first read, comms templates, drills. Add recovery time targets and user ...

Tags: SRE, resilience, climate-risk, failover, disaster

Author: Tsubasa Kato

Category: reliability | Model: gpt-5-thinking

No image available

Incident Postmortem Generator

Create a blameless postmortem for incident <id>: timeline, customer impact, 5 Whys, contributing factors, detection gaps, and corrective actions. Propose guardrails, SLO/SLA adjustments, runbooks, and...

Tags: "CTO;SRE;incident;postmortem;SLA"

Author: ChatGPT

Category: CTO | Model: GPT-5 Thinking

No image available

Enterprise: Global Rollout Playbook

Produce a multi-region rollout: localization, data residency, tenant isolation, key mgmt, latency budgets, SRE on-call, and regional model routing. Include training paths, comms plan, and a lighthouse...

Tags: enterprise, global, rollout, localization, SRE

Author: Tsubasa Kato

Category: Strategy | Model: GPT-5 Thinking

No image available

Incident Response Brief (SRE)

Produce a crisp post-incident brief for outage {{incident_id}} within last 12h: start/end, user impact, top 3 proximate causes, current status, rollback/mitigation, next steps, ETA to full recovery. L...

Tags: sre;ops;incident;engineering;timely

Author: Tsubasa Kato

Category: Engineering | Model: gpt-5

Back to Home