Glossary

Eval Suite

An eval suite is a collection of tests and metrics used to measure an AI system's accuracy, robustness, and failure modes on representative tasks.

Set of reliability tests