Explore Prompts

Page 111 of 360 · 4318 prompts

Model Spec to Tests: Traceability Matrix

Create a traceability matrix linking requirements/specs to tests and monitors. Provide a template and a worked example for a tool-using assistant system.
Tags: traceability, requirements, tests, monitoring, governance
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Safety-First Reward Modeling (High-Level)

Describe a high-level approach to align reward signals with safe behavior: preference data guidelines, reward hacking risks, and validation. Keep it conceptual and focused on safety.
Tags: reward-modeling, alignment, safety, validation, conceptual
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Annotation Guide for Human Reviewers

Create an annotation guide: definitions, examples, severity levels, and how to handle ambiguity. Include training exercises and a QA process for reviewer consistency.
Tags: annotation, guidelines, human-review, QA, consistency
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Safety Benchmarks: Build a Domain-Specific Set

Help me design a domain-specific safety benchmark: representative tasks, policy-sensitive cases, and adversarial cases. Include labeling guidelines and inter-annotator agreement checks.
Tags: benchmarks, safety, domain-specific, annotation, quality
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Drift Detection: Data, Behavior, and User Mix

Design drift detection: changes in user queries, outcome distributions, error types, and model behavior. Include thresholds and a playbook for when drift is detected.
Tags: drift-detection, monitoring, analytics, playbook, safety
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Adversarial Robustness: Stress Testing Inputs

Create a stress test plan: malformed inputs, long-context traps, conflicting instructions, and toxic content probes. Provide how to automate and score robustness over time.
Tags: robustness, adversarial, testing, stress-tests, quality
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Differential Privacy and Minimization Options (Conceptual)

Explain privacy-preserving options for feedback loops: minimization, aggregation, differential privacy (conceptually), and retention policies. Provide a practical selection guide.
Tags: privacy, minimization, aggregation, differential-privacy, policy
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Experiment Design: Safe A/B Tests for AI Behavior

Design safe A/B testing for AI changes: guardrails, user segmentation, sensitive cohorts, and safe metrics. Include ethics considerations and how to interpret ambiguous outcomes.
Tags: A/B-testing, experiments, ethics, metrics, rollout
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Compute & Cost Controls for Recursive Loops

Design cost controls: budget caps, queue prioritization, cache policy, and abort rules for expensive runs. Include a method to estimate ROI of improvements before executing.
Tags: cost-control, compute, prioritization, ROI, governance
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Model Card + System Card for Each Release

Generate a model/system card template: intended use, limitations, safety mitigations, eval results, and known failure modes. Include a changelog section for each iteration.
Tags: model-card, system-card, documentation, transparency, release
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Post-Mortem Template for AI Regressions

Create a post-mortem template tailored to AI regressions: data/prompt/model diffs, evaluation gaps, monitoring misses, and remediation tasks. Include a ‘lessons to tests’ section.
Tags: postmortem, regression, ops, testing, remediation
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings:

Pre-Mortem: How Could This Go Wrong?

Run a pre-mortem for a recursive improvement project: list plausible failures, early warning signals, and prevention steps. Output prioritized mitigations and ‘watch items’.
Tags: pre-mortem, risk, planning, signals, mitigation
Author: Assistant
Created at: 2026-02-02 00:00:00
Average Rating:
Total Ratings: