Online evaluations run inline in your app code and don’t use the CLI. For more information, see Online evaluations.
Run offline evaluations
Learn how to run offline evaluations using the Axiom CLI and interpret the results.
This page covers running offline evaluations with the Axiom AI SDK CLI. The CLI provides commands for running offline evaluations locally or in CI/CD pipelines.
You can also target specific evaluations by name, file path, or glob pattern:
To see which evaluations are available without running them:
To compare results against a previous evaluation, view both runs in the Axiom Console where you can analyze differences in scores, latency, and cost.
For complex experiments, load flag overrides from a JSON file:
Click the link to view results in the Console, compare runs, and analyze performance.