ZeroEval Documentation home page
Search...
⌘K
Ask AI
Support
Console
Console
Search...
Navigation
Tuning
Models
Guides
Tracing
Quickstart
SDKs
Advanced
Reference
Autotune
Introduction
Setup
Tuning
Models
Prompts
Reference
Calibrated Judges
Introduction
Setup
LLM Gateway
Introduction
Tuning
Models
Copy page
Evaluate your agent’s performance across multiple models
Copy page
ZeroEval lets you evaluate real production traces of specific agent tasks across different models, then ranking them over time. This helps you pick the best model for each part of your agent.
Was this page helpful?
Yes
No
Setup
Prompts
Assistant
Responses are generated using AI and may contain mistakes.