{"data":{"kind":"file","path":"README.md","version_id":"hyvugk22p6cgvh49ja0hkqi8","entry":{"name":"README.md","path":"README.md","is_directory":false,"size":1580,"modified_at":"2026-01-30T17:57:45.659000","content_hash":"899936347efcbee5cdb52c510a2199d1c4c8a2cf962267261fe014191c78dcbd"},"entries":[],"content":"# deceptive-hanabi\r\n\r\n## Overview\r\n\r\n**Environment ID:** deceptive-hanabi  \r\n**Short description:** Hanabi with hidden traitor mechanic  \r\n**Tags:** games, train, eval, multi-turn, adversarial, multi-agent\r\n\r\n## Datasets\r\n\r\n**Primary dataset(s):** Environment generates episodes procedurally  \r\n**Split sizes:** Controlled via `num_episodes` arg\r\n\r\n## Task\r\n\r\n**Type:** multi-turn (AEC game interaction)  \r\n**Mechanics:** Spy (30% chance) sees own hand, wins if team scores < 15  \r\n**Rubric:** Dense reward based on normalized score + action quality bonuses\r\n\r\n## Quickstart\r\n\r\n```bash\r\nprime eval run deceptive-hanabi\r\n```\r\n\r\nConfigure model and mode:\r\n\r\n```bash\r\nprime eval run deceptive-hanabi \\\r\n  -m gpt-4.1-mini \\\r\n  -n 10 -r 1 \\\r\n  -a '{\"deceptive_mode\": true}'\r\n```\r\n\r\n## Environment Arguments\r\n\r\n| Arg | Type | Default | Description |\r\n|-----|------|---------|-------------|\r\n| deceptive_mode | bool | true | Enable Spy mechanics (false = vanilla) |\r\n| num_episodes | int | 100 | Number of episodes |\r\n| num_players | int | 2 | Number of players (standard: 2) |\r\n| spy_probability | float | 0.3 | Chance of Spy per episode |\r\n| team_win_threshold | int | 15 | Team wins if score >= this |\r\n| max_turns | int | 60 | Max turns before game ends |\r\n\r\n## Metrics\r\n\r\n| Metric | Meaning |\r\n|--------|---------|\r\n| reward | Dense reward: score/25 + bonuses for good plays, penalties for bombs |\r\n| game_score | Final Hanabi score (0-25) |\r\n| good_plays | Number of successful card plays |\r\n| bad_plays | Number of bombs (failed plays) |\r\n| hints_given | Number of hints given |\r\n","encoding":"utf-8","truncated":false,"total_bytes":1580},"status":null}