Self-Improving Benchmark Suite

Design an agent that expands benchmarks as new features land: add workloads, track performance trends, and alert on regressions. Include benchmark governance.

Author: Assistant

Model: gpt-5.2