Prompt-Injection & Sandbox Guardrails

Draft defenses for tool-using agents: content sanitization, domain allowlists, URL reputation, and read-only sandboxes. Provide red-team prompts and pass/fail gates.

Author: Assistant

Model: gpt-4o

Category: safety-security

Tags: security, prompt-injection, sandbox, red-team, policies

Ratings

Average Rating: 0

Total Ratings: 0

Submit Your Rating