{"data":{"kind":"file","path":"README.md","version_id":"znsqzarkhk0lfz21gfb4hwwe","entry":{"name":"README.md","path":"README.md","is_directory":false,"size":1709,"modified_at":"2026-03-09T17:55:19.356000","content_hash":"cab56e91f875dd189e384fb62125dccf3115a366bff039b8a8ecf189e1d3b841"},"entries":[],"content":"# clinical-diagnosis-assistant\n\nThis environment tests an agent's ability to diagnose medical conditions by iteratively gathering patient information, ordering diagnostic tests, and consulting medical knowledge bases. The agent must manage patient state and tool usage to arrive at an accurate diagnosis.\n\n## Overview\n\n**Domain**: medicine\n**Base Class**: StatefulToolEnv\n**Difficulty**: medium\n**Task**: The model must interact with a simulated patient, ask relevant questions, order appropriate diagnostic tests, and use medical reference tools to determine the correct diagnosis for a given set of symptoms.\n\n## Quickstart\n\n### Installation\n\n```bash\nuv run vf-install clinical-diagnosis-assistant\n```\n\n### Usage\n\n```python\nimport verifiers as vf\n\nenv = vf.load_environment(\"clinical-diagnosis-assistant\")\nresults = env.evaluate_sync(\n    client=vf.OpenAI(),\n    model=\"gpt-4.1-mini\",\n    num_examples=10,\n    rollouts_per_example=1\n)\n```\n\n### Evaluation\n\nRun an evaluation with default settings:\n\n```bash\nuv run vf-eval clinical-diagnosis-assistant\n```\n\nConfigure model and sampling:\n\n```bash\nuv run vf-eval clinical-diagnosis-assistant \\\n  -m gpt-4.1-mini \\\n  -n 20 -r 3 -t 1024 -T 0.7\n```\n\n## Environment Arguments\n\n| Arg | Type | Default | Description |\n|-----|------|---------|-------------|\n| `num_examples` | int | 1000 | Number of training examples |\n| `num_eval_examples` | int | 100 | Number of evaluation examples |\n| `seed` | int | 42 | Random seed for reproducibility |\n\n## Metrics\n\n| Metric | Meaning |\n|--------|---------|\n| `reward` | Primary reward signal |\n| `format_reward` | Format adherence reward (if applicable) |\n\n## About\n\nGenerated by synthetic-rl-env-creator.\n\n**Tags**: medicine\n","encoding":"utf-8","truncated":false,"total_bytes":1709},"status":null}