{"data":{"kind":"file","path":"README.md","version_id":"yakhr7zecx179pdzvqthn69x","entry":{"name":"README.md","path":"README.md","is_directory":false,"size":1552,"modified_at":"2026-02-08T18:27:39.226000","content_hash":"71525e7faa051a7e0e253c15c4c95a976bfa723db25b6e1782a4cb7e75177eea"},"entries":[],"content":"# OpenMed Dreaddit\n\nStress detection in social media posts for RL fine-tuning.\n\n## Task\n\nGiven a Reddit post, classify whether the author is experiencing significant stress or distress.\n\n## Dataset\n\n- **Source**: `asmaab/dreaddit`\n- **Size**: 3,555 labeled posts (train: 2,270, val: 568, test: 715)\n- **Labels**: Binary (stressed / not stressed)\n- **Subreddits**: anxiety, PTSD, depression, domesticviolence, relationships, etc.\n- **Paper**: [Dreaddit: A Reddit Dataset for Stress Analysis](https://arxiv.org/abs/1911.00133)\n\n## Reward Structure\n\n| Reward | Weight | Description |\n|--------|--------|-------------|\n| Accuracy | 45% | Exact label match |\n| Partial Match | 20% | Credit for valid prediction |\n| Thinking | 20% | Quality of stress assessment reasoning in `<think>` tags |\n| Format | 15% | Proper `\\boxed{}` or `|answer|` format |\n\n## Example\n\n**Input**: \"I can't sleep anymore. Every night I lie awake thinking about everything that went wrong. My heart races and I can't stop the thoughts.\"\n\n**Expected Output**:\n```\n<think>\nThe author describes insomnia, racing thoughts, and physical symptoms (racing heart).\nThese are classic indicators of anxiety and stress. The inability to control thoughts\nand the sleep disruption suggest significant distress.\n</think>\n\\boxed{stressed}\n```\n\n## Citation\n\n```\n@inproceedings{turcan2019dreaddit,\n  title={Dreaddit: A Reddit Dataset for Stress Analysis in Social Media},\n  author={Turcan, Elsbeth and McKeown, Kathleen},\n  booktitle={EMNLP Workshop on NLP for Internet Freedom},\n  year={2019}\n}\n```\n","encoding":"utf-8","truncated":false,"total_bytes":1552},"status":null}