{"data":{"kind":"file","path":"README.md","version_id":"gt5ty3hmxwgjf6y1r2hhm3gh","entry":{"name":"README.md","path":"README.md","is_directory":false,"size":1490,"modified_at":"2026-01-27T21:17:42.854000","content_hash":"a94600cf72240eef7388e545c35930b0d079e6a74ae5d233d3671305339b03c2"},"entries":[],"content":"# my-agent-task\r\n\r\n> Replace the placeholders below, then remove this callout.\r\n\r\n### Overview\r\n- **Environment ID**: `my-agent-task`\r\n- **Short description**: <one-sentence description>\r\n- **Tags**: <comma-separated tags>\r\n\r\n### Datasets\r\n- **Primary dataset(s)**: <name(s) and brief description>\r\n- **Source links**: <links>\r\n- **Split sizes**: <train/eval counts>\r\n\r\n### Task\r\n- **Type**: <single-turn | multi-turn | tool use>\r\n- **Parser**: <e.g., ThinkParser, XMLParser, custom>\r\n- **Rubric overview**: <briefly list reward functions and key metrics>\r\n\r\n### Quickstart\r\nRun an evaluation with default settings:\r\n\r\n```bash\r\nprime eval run my-agent-task\r\n```\r\n\r\nConfigure model and sampling:\r\n\r\n```bash\r\nprime eval run my-agent-task   -m gpt-4.1-mini   -n 20 -r 3 -t 1024 -T 0.7   -a '{\"key\": \"value\"}'  # env-specific args as JSON\r\n```\r\n\r\nNotes:\r\n- Use `-a` / `--env-args` to pass environment-specific configuration as a JSON object.\r\n\r\n### Environment Arguments\r\nDocument any supported environment arguments and their meaning. Example:\r\n\r\n| Arg | Type | Default | Description |\r\n| --- | ---- | ------- | ----------- |\r\n| `foo` | str | `\"bar\"` | What this controls |\r\n| `max_examples` | int | `-1` | Limit on dataset size (use -1 for all) |\r\n\r\n### Metrics\r\nSummarize key metrics your rubric emits and how they’re interpreted.\r\n\r\n| Metric | Meaning |\r\n| ------ | ------- |\r\n| `reward` | Main scalar reward (weighted sum of criteria) |\r\n| `accuracy` | Exact match on target answer |\r\n\r\n","encoding":"utf-8","truncated":false,"total_bytes":1490},"status":null}