Explore Prompts

Page 112 of 360 · 4318 prompts

Governance Model: Roles and Responsibilities

Define a governance model: safety owner, release manager, red-team lead, privacy officer, and incident commander. Include RACI matrix and meeting cadence.
Tags: governance, RACI, roles, process, accountability
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Policy Engine Design: Rules + ML Hybrid

Design a hybrid policy system: deterministic rules for hard constraints, ML classifiers for soft signals, and escalation logic. Provide architecture, failure modes, and monitoring plan.
Tags: policy-engine, guardrails, hybrid, architecture, monitoring
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Safety-Focused Prompting Style Guide

Create a prompting style guide for internal prompts: structure, tool usage rules, refusal patterns, and safety reminders. Include examples and a checklist reviewers can apply.
Tags: prompting, style-guide, internal, guardrails, review
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Hallucination Reduction Plan (RAG + Verification)

Design a hallucination reduction plan: retrieval augmentation, answer verification steps, consistency checks, and refusal behaviors. Include evaluation metrics and regression tests.
Tags: hallucination, RAG, verification, consistency, testing
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Truthfulness & Citation Policy for Research Outputs

Create a truthfulness policy: source requirements, citation rules, and how to label speculation. Provide a checklist for editors/analysts and an automated linting concept.
Tags: truthfulness, citations, policy, research, safety
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Uncertainty Calibration: When to Say ‘I’m Not Sure’

Design a calibration approach: confidence estimation, abstention policies, escalation to human, and how to test calibration. Include UI patterns that communicate uncertainty responsibly.
Tags: uncertainty, calibration, abstention, HITL, UX
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Metrics That Matter: Safety + Utility Balanced Scorecard

Create a balanced scorecard: utility metrics (task success), safety metrics (policy adherence), reliability (latency, uptime), and user trust (complaints). Include leading indicators and dashboards.
Tags: metrics, scorecard, safety, utility, reliability, ops
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Eval Design: Avoiding Overfitting to the Test Suite

Design an evaluation strategy that avoids overfitting: holdouts, rotating test sets, adversarial sets, and blind evaluation. Include rules for when to refresh benchmarks.
Tags: evaluation, overfitting, benchmarks, holdout, testing
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Model Update Policy: When to Retrain vs Prompt-Tune

Create decision criteria for retraining vs prompt tuning vs retrieval updates. Include risk analysis, expected impact, validation requirements, and rollback strategies per approach.
Tags: retraining, prompting, RAG, model-updates, governance
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Secure-by-Default Tooling: Safe Defaults Checklist

Create a safe-defaults checklist for the tool layer: deny-by-default, explicit allowlists, safe parameter validation, output filtering, and timeouts. Include common failure modes.
Tags: secure-defaults, tooling, validation, timeouts, guardrails
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Tool Permission Model (Least Privilege)

Create a least-privilege permission model for tools: scopes, rate limits, time bounds, and audit logs. Provide an authorization matrix and guidelines for granting elevated access.
Tags: least-privilege, authorization, tools, audit, security
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Prompt Injection Defense Plan (Tool-Using Agents)

Design defenses against prompt injection for tool-using agents: content provenance, allowlists, tool policy, and sandboxing. Include a suite of adversarial prompts for regression testing.
Tags: prompt-injection, agents, tooling, security, testing
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings: