Tasks format
HUD tasksets can be provided in two primary formats (both supported):- A single JSON file containing a list of task objects (recommended)
basic-2048.json
and run:
- JSONL file with one task object per line
- prompt: instruction for the agent
- mcp_config: where to run the environment (local docker or remote MCP)
- setup_tool (optional): a tool call to prepare the environment
- evaluate_tool: a tool call to compute reward
- system_prompt (optional): extra guidance for the agent
Minimal JSONL example
basic-2048.jsonl
and run:
Hosting on HuggingFace
You can host tasksets on the Hub and fetch them with:Tips
- Keep tasks self-contained; use
setup_tool
to open apps or load data - Ensure
evaluate_tool
returns a numeric reward per episode - Use small task counts to iterate quickly; scale up once stable