Eval

Evaluate and rank agent results by metric or LLM judge for an AgentHub session. Use when the user runs /hub:eval or asks to score, compare, or pick a winner among completed AgentHub agents.

Gitix AI
Gitix AI
· 7 days ago · v1
SkillSpector LOW
0/100 ✓ SAFE
3
0
0
0

Comments (0)

Sign in to leave a comment.

No comments yet. Be the first!