Evaluation measures how well your agents perform, helping you improve over time.Documentation Index
Fetch the complete documentation index at: https://docs.praison.ai/llms.txt
Use this file to discover all available pages before exploring further.
Quick Start
Evaluator Types
| Evaluator | Measures |
|---|---|
AccuracyEvaluator | Correctness vs expected |
CriteriaEvaluator | Multiple custom criteria |
PerformanceEvaluator | Speed and efficiency |
Judge | LLM-as-judge scoring |
Best Practices
Test on diverse examples
Test on diverse examples
Use varied test cases to get accurate evaluation.
Iterate based on scores
Iterate based on scores
Low scores indicate where to improve prompts or tools.
Related
Optimizer
Auto-improve agents
Tracing
Performance tracing

