Explore Prompts

Page 110 of 360 · 4318 prompts

Safety Culture: Team Habits That Prevent Accidents

Design safety culture practices: blameless reporting, review norms, escalation comfort, and training. Include weekly rituals and how to measure culture health.
Tags: safety-culture, team, process, training, ops
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Ethical Review: High-Impact Use Cases

Create an ethical review process for high-impact use: fairness, accessibility, harm minimization, and user consent. Include a checklist and escalation rules.
Tags: ethics, review, fairness, consent, governance
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Capability Containment: Limit Scope of Actions

Design containment by limiting action scope: allowlisted domains, read-only modes, rate limits, and staged privileges. Include how to measure whether containment is effective.
Tags: containment, scope-control, allowlist, rate-limits, safety
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Safety-Weighted Product Roadmap (Quarterly)

Create a quarterly roadmap for recursive improvement: initiatives, milestones, safety deliverables, and staffing. Include an ‘if risk increases’ contingency branch.
Tags: roadmap, planning, governance, safety, milestones
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

User Feedback Loop: High Signal Without Gaming

Design a feedback system that’s hard to game: structured feedback types, sampling, and weighting. Include how to prevent brigading and ensure minority failure modes are captured.
Tags: feedback, robustness, anti-gaming, quality, governance
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Safety Review Checklist Before Production Release

Create a production readiness review checklist: evaluation results, red-team status, privacy review, rollback tested, monitoring live, and owner sign-offs. Output a go/no-go rubric.
Tags: release-readiness, checklist, governance, monitoring, rollback
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Safe Logging: What to Log and What Not to Log

Define a safe logging policy: what to log for debugging, what to redact, retention windows, access controls, and anonymization. Include examples and anti-examples.
Tags: logging, privacy, redaction, retention, security
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Access to Secrets: Design for Zero Trust

Design a zero-trust approach: secrets management, short-lived credentials, environment segregation, and auditing. Include how to test that the AI cannot access restricted secrets.
Tags: zero-trust, secrets-management, security, audit, containment
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Non-Deceptive Behavior Requirements

Write explicit requirements to prevent deceptive behavior: transparency rules, auditability, and logging. Include tests to detect hidden policy violations or manipulation attempts.
Tags: non-deception, transparency, audit, tests, governance
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Safe Self-Reflection: How to Use Critique Without Loops

Design a safe self-reflection procedure: bounded critique, avoiding infinite loops, and ensuring critique is grounded in evidence. Provide stop rules and performance safeguards.
Tags: self-critique, bounded, stop-rules, quality, safety
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Cross-Validation: Two Models, One Decision

Create a cross-validation pattern: primary model proposes, secondary model critiques/verifies, and a policy gate decides. Include how to handle disagreements and measure error reduction.
Tags: verification, dual-model, critique, policy-gate, quality
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Safety in Multi-Agent Systems: Containment and Roles

Design a multi-agent architecture with safety: role separation, constrained tools, message filtering, and cross-checks. Include failure modes and an evaluation plan.
Tags: multi-agent, architecture, containment, cross-checks, safety
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings: