Glossary
Eval Suite
An eval suite is a collection of tests and metrics used to measure an AI system's accuracy, robustness, and failure modes on representative tasks.
Category: Agent Evaluation and Safety
Glossary
An eval suite is a collection of tests and metrics used to measure an AI system's accuracy, robustness, and failure modes on representative tasks.