Introduction

Your agent produces thousands of outputs, but without feedback you can’t tell which ones are good. Feedback closes the loop — it connects real-world quality judgments to the traces, spans, and completions your agent generates. ZeroEval supports two kinds of feedback:

Human feedback — thumbs-up/down, star ratings, corrections, and expected outputs submitted by users or reviewers
AI feedback — automated evaluations from calibrated judges that score outputs against criteria you define

Both feed into the same system. Feedback attached to completions powers prompt optimization. You can also retrieve unified feedback — combining human reviews and judge evaluations — for any span, trace, or session via the Feedback API.

How feedback flows

Agent produces output

Your agent runs and ZeroEval captures the full trace — inputs, outputs, model, prompt version.

Feedback is attached

Humans review outputs in the dashboard or your app submits feedback programmatically. Judges evaluate outputs automatically based on your criteria.

Quality becomes measurable

Feedback appears on spans, traces, and completions in the console. Filter by thumbs-up rate, judge scores, or tags to find patterns.

Improvements are driven by data

Use feedback to optimize prompts, compare models, calibrate judges, and catch regressions before users do.

Get started

Human Feedback

Collect feedback from reviewers in the dashboard or from end users via like/dislike buttons, ratings, and corrections.

AI Feedback (Judges)

Configure AI evaluators that automatically score every completion against criteria you define. Includes built-in judges for common failures.

API Reference

Introduction

⌘I

How feedback flows
Get started

Tracing

Prompts

Feedback

Helpers

How feedback flows

Get started

Human Feedback

AI Feedback (Judges)

Tracing

Prompts

Feedback

Helpers

Documentation Index

​How feedback flows

​Get started

Human Feedback

AI Feedback (Judges)

How feedback flows

Get started