ashita-ai/arbiter
Arbiter provides simple APIs, complete observability, and provider-agnostic infrastructure for evaluating LLM outputs. Built on PydanticAI with automatic interaction tracking, multiple evaluators, and extensible architecture.
What's novel
Arbiter provides simple APIs, complete observability, and provider-agnostic infrastructure for evaluating LLM outputs. Built on PydanticAI with automatic interaction tracking, multiple evaluators, and extensible architecture.
Code Analysis
Implements a basic data model for scoring metrics with comparison operators, hashing support, and threshold checking, alongside an LLM interaction wrapper that likely handles API calls to external language models.
Score Breakdown
Signal breakdown
Innovation
Craft
Traction
Scope
Evidence
Commits
188
Contributors
7
Files
132
Active weeks
8
Repository
Language
Python
Stars
8
Forks
7
License
MIT