Safety Benchmarks: Build a Domain-Specific Set
Help me design a domain-specific safety benchmark: representative tasks, policy-sensitive cases, and adversarial cases. Include labeling guidelines and inter-annotator agreement checks.
Ratings
Average Rating: 0
Total Ratings: 0