{"data":{"kind":"file","path":"README.md","version_id":"xlxr3toq5t3tdllbbcykwvbk","entry":{"name":"README.md","path":"README.md","is_directory":false,"size":1371,"modified_at":"2026-02-08T18:43:18.840000","content_hash":"1e1541956bda32b1f990b3bfcff5f7759bf81c49e99880f628c9996d937ce80d"},"entries":[],"content":"# OpenMed MedKnowledge\n\nComprehensive medical knowledge MCQ environment using MMLU medical subsets.\n\n## Task\n\nGiven a medical knowledge question with 4 answer choices spanning anatomy, clinical knowledge, college medicine, medical genetics, and professional medicine, select the correct answer.\n\n## Dataset\n\n- **Source**: [cais/mmlu](https://huggingface.co/datasets/cais/mmlu) (medical subsets)\n- **Subsets**: anatomy, clinical_knowledge, college_biology, college_medicine, medical_genetics, professional_medicine\n- **Size**: ~3K questions across 6 medical domains\n- **Format**: MCQ with 4 choices (A, B, C, D)\n- **License**: MIT\n\n## Reward Structure\n\n| Reward Function | Weight | Description |\n|----------------|--------|-------------|\n| accuracy_reward | 45% | Exact letter match |\n| partial_match_reward | 20% | Credit for valid letter selection |\n| thinking_reward | 20% | Quality of medical reasoning |\n| format_reward | 15% | Proper \\\\boxed{} or \\|answer\\| format |\n\n## Example\n\n**Input**: \"Which of the following is the primary function of hemoglobin?\\nA. Oxygen transport\\nB. Immune defense\\nC. Blood clotting\\nD. Nutrient transport\"\n\n**Expected Output**: `\\boxed{A}`\n\n## Citation\n\n```bibtex\n@article{hendrycks2021measuring,\n  title={Measuring Massive Multitask Language Understanding},\n  author={Hendrycks, Dan and others},\n  journal={ICLR},\n  year={2021}\n}\n```\n","encoding":"utf-8","truncated":false,"total_bytes":1371},"status":null}