Skip to main content
Build MCP environments that wrap any software for agent interaction. Think of it in three phases: Phase 1: Environment - Wrap software in MCP tools Phase 2: Tasks - Define evaluation scenarios Phase 3: Agents - Run evaluations and training

Phase 1 · Create a project (2 min)

# Pick a template: blank, deep-research, browser
hud init my-env
cd my-env
Start development servers:
# Terminal 1 - Environment backend
cd environment && uv run uvicorn server:app --reload

# Terminal 2 - MCP server  
cd server && uv run hud dev

Edit-save-test flow

  1. Open server/tools.py, add or tweak a tool.
  2. Save – the mcp restarts instantly.
  3. Visit http://localhost:8765/docs to test tools/

Phase 2 · Write Tasks (2 min)

Build your environment image first (in the global folder):
hud build
Create tasks.json using docker run:
{
  "prompt": "Complete task",
  "mcp_config": {
    "local": {
      "command": "docker",
      "args": ["run", "--rm", "-i", "my-env:0.1.0"]
    }
  },
  ...your setup and evaluation tools
}
See Task System or the hud init README for details.

Phase 3: Run Agents

# Test with agents
hud eval tasks.json

# Deploy to registry
hud push

# Train agents on your tasks
hud rl tasks.json

Cheatsheet

ActionCommand
Create envhud init my-env -p blank
Hot-reload devhud dev --build
Interactive testhud dev --interactive
Troubleshoothud debug my-env:dev
Build imagehud build
Push to registryhud push
RL traininghud rl tasks.json

Learn more →

Have fun – and remember: stderr for logs, stdout for MCP!
I