{"data":{"kind":"file","path":"README.md","version_id":"zmo0bemklkq3cy46fmc5hwhl","entry":{"name":"README.md","path":"README.md","is_directory":false,"size":1724,"modified_at":"2026-03-09T17:58:21.271000","content_hash":"9d5006dbb739eceb59c6c4c79f374187d091f0679dfbafaa7477cc0eb842b7e4"},"entries":[],"content":"# scientific-literature-review\n\nThis environment tests an agent's ability to perform a basic scientific literature review, identifying key papers, extracting relevant information, and synthesizing findings. It evaluates the agent's proficiency in navigating scientific databases and understanding research abstracts.\n\n## Overview\n\n**Domain**: science\n**Base Class**: ToolEnv\n**Difficulty**: medium\n**Task**: The model must identify relevant scientific papers based on a given research question, extract specific data points (e.g., methodology, key findings), and summarize the collective insights from the selected literature.\n\n## Quickstart\n\n### Installation\n\n```bash\nuv run vf-install scientific-literature-review\n```\n\n### Usage\n\n```python\nimport verifiers as vf\n\nenv = vf.load_environment(\"scientific-literature-review\")\nresults = env.evaluate_sync(\n    client=vf.OpenAI(),\n    model=\"gpt-4.1-mini\",\n    num_examples=10,\n    rollouts_per_example=1\n)\n```\n\n### Evaluation\n\nRun an evaluation with default settings:\n\n```bash\nuv run vf-eval scientific-literature-review\n```\n\nConfigure model and sampling:\n\n```bash\nuv run vf-eval scientific-literature-review \\\n  -m gpt-4.1-mini \\\n  -n 20 -r 3 -t 1024 -T 0.7\n```\n\n## Environment Arguments\n\n| Arg | Type | Default | Description |\n|-----|------|---------|-------------|\n| `num_examples` | int | 1000 | Number of training examples |\n| `num_eval_examples` | int | 100 | Number of evaluation examples |\n| `seed` | int | 42 | Random seed for reproducibility |\n\n## Metrics\n\n| Metric | Meaning |\n|--------|---------|\n| `reward` | Primary reward signal |\n| `format_reward` | Format adherence reward (if applicable) |\n\n## About\n\nGenerated by synthetic-rl-env-creator.\n\n**Tags**: science\n","encoding":"utf-8","truncated":false,"total_bytes":1724},"status":null}