{"data":{"kind":"file","path":"README.md","version_id":"su56r4d3ovse966jaaw5p48b","entry":{"name":"README.md","path":"README.md","is_directory":false,"size":1731,"modified_at":"2026-02-23T14:07:37.019000","content_hash":"8fe39d3e521240d8310a42e6163b163b0be9fd6451d1d7ea670ba7b5ad3ea8c3"},"entries":[],"content":"# pubmedqa-yesno\n\nBiomedical yes/no/maybe question answering environment using [PubMedQA](https://huggingface.co/datasets/qiaojin/PubMedQA).\n\nThe model is given a biomedical research question and supporting context passages, and must answer with exactly **yes**, **no**, or **maybe** inside `<answer>` tags.\n\n## Task\n\n- **Dataset:** `qiaojin/PubMedQA` (`pqa_artificial` config)\n- **Split:** `train` split, 90/10 train/eval split with seed 42\n- **Input:** Research question + context passages\n- **Output:** `<answer>yes|no|maybe</answer>`\n- **Reward:** 1.0 for correct label, 0.0 otherwise (+ 0.2 format bonus)\n\n## Example\n\n**Prompt:**\n```\nQuestion: Does high-dose vitamin C supplementation improve outcomes in sepsis patients?\n\nContext:\nMETHODS: We conducted a randomized controlled trial of 200 sepsis patients...\nRESULTS: No significant difference was observed in 28-day mortality...\n\nAnswer with yes, no, or maybe.\n```\n\n**Expected response:**\n```\nThe study found no significant difference in outcomes, suggesting high-dose vitamin C\ndoes not improve sepsis outcomes based on current evidence.\n<answer>no</answer>\n```\n\n## Reward\n\n| Component | Weight | Description |\n|-----------|--------|-------------|\n| `correct_label` | 1.0 | Exact match of yes/no/maybe after normalization |\n| `format_reward` | 0.2 | Model used `<answer>` tags correctly |\n\n## Usage\n\n```python\nfrom pubmedqa_yesno import load_environment\n\nenv = load_environment()                          # full train split\nenv = load_environment(num_examples=100)          # subset for testing\nenv = load_environment(config=\"pqa_labeled\")      # different PubMedQA config\n```\n\n## Environment Variables\n\nNone required. Dataset is loaded from HuggingFace Hub automatically.\n","encoding":"utf-8","truncated":false,"total_bytes":1731},"status":null}