Boundary-respect evaluation framework for autonomous LLM agents — from the pilot where Claude refused to confabulate $20K
ai-safety ai-evaluation llm-agents claude-code agent-evaluation anthropic-mcp alignment-evaluation openai-operator
-
Updated
May 24, 2026 - Python