Why Measure Performance?
Without measurement, you can’t answer:- Which agents deliver value?
- Where are bottlenecks?
- Is quality improving over time?
- What’s the ROI of agent investment?
Git-Based Metrics
Git provides a natural audit trail for agent work. Every commit, PR, and change is tracked.Core Metrics
| Metric | Description | Calculation |
|---|---|---|
| Commits/day | Agent activity level | git log --author="agent" --since="1 day" |
| PR merge rate | Quality of output | Merged PRs / Total PRs |
| Time to merge | Review efficiency | PR created → PR merged |
| Lines changed | Scope of work | Additions + Deletions |
| Revert rate | Error frequency | Reverts / Total commits |
Tracking Agent Commits
Tag agent commits with consistent metadata:Query Agent Performance
Performance Dashboard
Using squads CLI
Custom Metrics Script
Quality Metrics
Code Review Scores
Track review feedback on agent PRs:Automated Quality Checks
Benchmarking
Task Completion Benchmarks
Track how long standard tasks take:| Task Type | Target | Current Avg |
|---|---|---|
| Bug fix | < 30 min | 25 min |
| Feature (small) | < 2 hours | 1.5 hours |
| Feature (medium) | < 1 day | 6 hours |
| Refactor | < 4 hours | 3 hours |
A/B Testing Agents
Compare different agent configurations:Monitoring & Alerts
Performance Thresholds
Alerting
Feedback Loop
Recording Feedback
Using Feedback
Continuous Improvement
Best Practices
- Tag all agent commits with consistent metadata
- Track metrics weekly, review monthly
- Set quality thresholds and alert on breaches
- A/B test prompt and configuration changes
- Record human feedback after task completion
- Use metrics to guide agent improvements