Explore Prompts

Page 86 of 360 · 4318 prompts

Architecture Guardrails: No Unapproved Major Refactors

Define guardrails that block large architectural changes unless an explicit human-approved RFC exists. Include heuristics to detect “too big” diffs.
Tags: architecture, guardrails, RFC, change-size, governance
Author: Assistant
Created at: 2026-02-20 00:00:00
Average Rating:
Total Ratings:

Scoped Repo Understanding Agent: Map the System

Create an agent that builds a “system map”: modules, ownership, dependencies, and critical paths. Use it to constrain edits to low-risk areas first.
Tags: codebase-map, dependencies, ownership, risk-reduction
Author: Assistant
Created at: 2026-02-20 00:00:00
Average Rating:
Total Ratings:

Multi-Agent Consensus on Risky Changes

Design a consensus mechanism for risky changes: verifier override, structured voting, and escalation to human. Include disagreement logs and rationale storage.
Tags: consensus, multi-agent, risk, escalation, logging
Author: Assistant
Created at: 2026-02-20 00:00:00
Average Rating:
Total Ratings:

Evaluation Harness: Deterministic Replays

Build an eval harness for self-edits: deterministic tool mocks, seeded randomness, replayable runs, and stored artifacts for auditing decisions.
Tags: evals, reproducibility, mocks, replay, audit
Author: Assistant
Created at: 2026-02-20 00:00:00
Average Rating:
Total Ratings:

Self-Improving Prompt Library With Versioning

Design a prompt library that the agent can improve safely: semantic versioning, eval gates, canary prompts, and rollback. Include prompt linting rules.
Tags: prompts, versioning, evals, canary, rollback
Author: Assistant
Created at: 2026-02-20 00:00:00
Average Rating:
Total Ratings:

Sandboxed Tool Execution for Code Changes

Create a sandbox spec: network isolation, filesystem constraints, resource limits, and safe temp dirs. Include how MCP tool wrappers enforce the sandbox.
Tags: sandbox, containment, resource-limits, security, MCP
Author: Assistant
Created at: 2026-02-20 00:00:00
Average Rating:
Total Ratings:

No-Secrets Guarantee: Redaction and Leak Tests

Design secret-handling: vault integration, redaction in logs, and automated leak scans on diffs, logs, and artifacts. Include fail-closed behavior.
Tags: secrets, redaction, vault, leak-prevention, security
Author: Assistant
Created at: 2026-02-20 00:00:00
Average Rating:
Total Ratings:

Self-Edit Safety Case Template

Create a safety case template per capability: hazards, mitigations, tests, monitoring, rollback, and residual risk. Require evidence links and sign-off rules.
Tags: safety-case, hazards, mitigations, monitoring, signoff
Author: Assistant
Created at: 2026-02-20 00:00:00
Average Rating:
Total Ratings:

Human-in-the-Loop Review UX

Design a reviewer UX: diff summaries, risk flags, test results, and “approve with conditions.” Include prompts that help humans spot hidden risks quickly.
Tags: human-review, UX, diff-summary, risk-flags, workflow
Author: Assistant
Created at: 2026-02-20 00:00:00
Average Rating:
Total Ratings:

Safe Autonomy Levels: Manual→Assisted→Automatic

Define autonomy tiers for the self-improver. Specify which actions are allowed at each tier and what evidence is required to graduate tiers safely.
Tags: autonomy, tiers, governance, safety-evidence, controls
Author: Assistant
Created at: 2026-02-20 00:00:00
Average Rating:
Total Ratings:

Regression Detective Agent Using Golden Tests

Design golden tests for key outputs and an agent that compares before/after behavior. Include tolerance rules and how to prevent “golden drift” over time.
Tags: golden-tests, regression, behavioral-testing, drift
Author: Assistant
Created at: 2026-02-20 00:00:00
Average Rating:
Total Ratings:

Observability-First Self-Improvement

Create a tracing and logging spec for the self-edit loop: prompts, tool calls, diffs, test results, deploy events, and user impact metrics.
Tags: observability, tracing, logging, diffs, metrics
Author: Assistant
Created at: 2026-02-20 00:00:00
Average Rating:
Total Ratings: