hud rl
command trains an agent with GRPO on tasks, locally or via the HUD remote service.
Usage
Arguments
Path to tasks JSON/JSONL file or HuggingFace dataset name. If omitted, looks for a tasks file in the current directory.
Model to train (default: interactive selection)
Options
Path to existing configuration file. Short:
-c
Output directory for checkpoints. Short:
-o
Restart the vLLM server before training
Enable verbose output. Short:
-v
Disable DistributedDataParallel (even with multiple GPUs)
Specific GPUs for DDP (e.g.,
0,1,2,3
)Specific GPU for vLLM server
Run training locally instead of the remote HUD server
Behavior
- If no tasks file is provided, an interactive picker helps locate one.
- Remote mode (default) converts tasks to remote MCP automatically (build/push as needed) and launches remote training.
- Local mode runs training on your machine (delegated to
local_runner
).