{"data":{"kind":"file","path":"README.md","version_id":"cww7in5rhahmrapd23anfdox","entry":{"name":"README.md","path":"README.md","is_directory":false,"size":731,"modified_at":"2025-10-01T17:22:51.927000","content_hash":"ce04704510e6101648582852ebf6d69f3b6c3ca2e9126dbb13795f2fc6b484fa"},"entries":[],"content":"# medcasereasoning\n\n### Overview\n- **Environment ID**: `medcasereasoning`\n- **Short description**: MedCaseReasoning dataset from Stanford (Wu et al. 2025)\n- **Tags**: \n\n### Datasets\n- **Primary dataset(s)**: <name(s) and brief description>\n- **Source links**: https://huggingface.co/datasets/zou-lab/MedCaseReasoning\n- **Split sizes**: 13.1k (train) / 500 (val) / 897 (test)\n\n### Task\n- **Type**: single-turn\n- **Parser**: JudgeRubric\n- **Rubric overview**: <briefly list reward functions and key metrics>\n\n### Quickstart\nRun an evaluation with default settings:\n\n```bash\nuv run vf-eval medcasereasoning\n```\n\nConfigure model and sampling:\n\n```bash\nuv run vf-eval medcasereasoning   -m gpt-4.1-mini   -n 20 -r 3 -t 1024 -T 0.7\n```\n\n","encoding":"utf-8","truncated":false,"total_bytes":731},"status":null}