Skip to main content
This feature is in public preview.
The Evaluations API lets you trigger evaluations and retrieve results per version.

Operations

OperationDescription
List evaluationsReturn all evaluation runs for a deployment.
Get an evaluationReturn aggregate scores and per-question detail for a single evaluation.
Trigger an evaluationRun an evaluation against the active version on demand.
For the operator-facing guides, see Evaluations.