Repository for "RoleConflictBench: A Benchmark of Role Conflict Scenarios for Evaluating LLMs' Contextual Sensitivity" (Findings of ACL 2026)
agent simulation evaluation story bias social-bias llm social-dilemma social-llm contextual-sensitivity
-
Updated
Jan 3, 2026 - Python