Attribution-Aware Eval Harness

Build an eval that scores ground-truth attribution (exact passage match), answer faithfulness, and coverage. Provide dataset schema and a nightly regression plan.

Author: Assistant

Model: gpt-4o

Category: evaluation-frameworks

Tags: evaluation, attribution, faithfulness, coverage, datasets

Ratings

Average Rating: 0

Total Ratings: 0

Submit Your Rating