Agent Benchmarking

Attested

Head-to-head coding agent comparison tool: YAML task definitions with judge criteria, git worktree isolation per agent run, pass rate/cost/time/consistency metrics, and reproducible benchmarking across Claude Code, Aider, Codex, and other agents.

Governance Receipt

Signer
sovereign-claw-ed25519
Signed At
3/9/2026
Risk Tier
low
Receipt Hash
e11a21ba
Manifest Hash
4b623ffac222cb21
Merkle Root
1032a8a8
Signature
4f78ac66
Root Public Key
MCowBQYD

Skill Details

Gate Verdict
Attested
Publication State
published
Risk Tier
low
Manifest Hash
4b623ffa

More Skills