Agent
git-ape — source: .github/agents/git-ape.agent.md
Scope
Author the eval suite at .github/evals/agents/git-ape/:
Dependency note
git-ape is the orchestrator agent. Defer this suite until most sub-agent suites (azure-requirements-gatherer, azure-template-generator, azure-resource-deployer, azure-iac-exporter) are stable — regressions in this suite are easier to root-cause when each sub-agent has its own established baseline.
Procedure
/agent-bench git-ape drafts the suite from the live .agent.md.
waza run .github/evals/agents/git-ape/eval.yaml -v locally.
/agent-improve git-ape to iterate on graders.
- Open PR.
- Mock CI runs automatically. A maintainer will dispatch a real-model run before merge.
Acceptance
Conventions to follow
- Persona lock: refusal graders should accept the agent's own scope language.
- Prompt graders need
continue_session: true in their grader config.
- This agent has destructive tools through delegation. Apply the same "no real deploy" rule as
azure-resource-deployer: positive tasks grade safety-contract behavior, not real Azure execution.
Related
Agent
git-ape— source:.github/agents/git-ape.agent.mdScope
Author the eval suite at
.github/evals/agents/git-ape/:eval.yaml— suite config (executor, model, graders)tasks/positive-*.yamltasks/negative-*.yaml.github/evals/manifest.yamlattier: expandedDependency note
git-apeis the orchestrator agent. Defer this suite until most sub-agent suites (azure-requirements-gatherer,azure-template-generator,azure-resource-deployer,azure-iac-exporter) are stable — regressions in this suite are easier to root-cause when each sub-agent has its own established baseline.Procedure
/agent-bench git-apedrafts the suite from the live.agent.md.waza run .github/evals/agents/git-ape/eval.yaml -vlocally./agent-improve git-apeto iterate on graders.Acceptance
mockexecutor.manifest.yamlentry added; PR description includes the real-model run summary.Conventions to follow
continue_session: truein their grader config.azure-resource-deployer: positive tasks grade safety-contract behavior, not real Azure execution.Related