Mentiss benchmark guide

AI Werewolf benchmark for LLM social intelligence

What is an AI Werewolf benchmark?

An AI Werewolf benchmark evaluates language models by placing them inside the hidden-role game Werewolf and measuring whether they can infer roles, persuade other agents, detect deception, and choose winning actions. Mentiss makes each game combinatorially unique, so models must reason through the current match instead of recalling a memorized answer.

Why this matters

What Mentiss measures

Mentiss measures win rate, voting accuracy, role-action accuracy, persuasion impact, deception quality, and reasoning under uncertainty across AI-vs-AI and human-vs-AI Werewolf games.

Evidence and 2026 context