Trajectory

A Trajectory represents the detailed recording of a single run of an Agent within an Environment for a specific Task.

Overview

Trajectories capture the step-by-step history of an agent’s interaction, useful for analysis, debugging, and visualization.

They are automatically generated and associated with a Job when env.close() is called on a linked environment.

Accessing Trajectories

The primary way to access trajectories is through a Job object using job.load_trajectories():

from hud import load_job

async def analyze_job_trajectories(job_id: str):
    job = await load_job(job_id)
    trajectories = await job.load_trajectories()

    for i, traj in enumerate(trajectories):
        print(f"--- Trajectory {i+1} (ID: {traj.id}) ---")
        print(f"  Reward: {traj.reward}")
        print(f"  Number of steps: {len(traj.trajectory)}") # Access the list of steps
        if traj.error:
            print(f"  Error: {traj.error}")

        # You can iterate through individual steps if needed
        # for step_index, step_data in enumerate(traj.trajectory):
        #    print(f"    Step {step_index}: Actions: {step_data.actions}")
        #    print(f"    Step {step_index}: Obs Text: {step_data.observation_text}")
        #    print(f"    Step {step_index}: Obs Image URL: {step_data.observation_url}")

Key Properties

A Trajectory object contains:

id (str): Unique ID for this run.
reward (float | None): The final evaluation score from the Task’s evaluate logic.
logs (str | None): Captured logs.
error (str | None): Error message if the run failed.
trajectory (list[TrajectoryStep]): List of individual steps.

Each TrajectoryStep contains:

observation_url (str | None): URL to the step’s screenshot.
observation_text (str | None): Text observed in the step.
actions (list[dict]): Agent action(s) leading to this step’s observation.
start_timestamp / end_timestamp (str | None): Step timing.

Visualization

HUD Platform: The Jobs page offers the best visualization, including videos.
Jupyter: The trajectory.display() method provides basic step-by-step rendering.

# In Jupyter:
# traj.display()

Job: How trajectories are grouped and accessed.
Environment: Generates the trajectory data during a run.
Task: Defines the scenario and evaluation logic recorded.

Getting Started

Examples

Features

Concepts

Environments

Trajectory

Trajectory

Overview

Accessing Trajectories

Key Properties

Visualization

Getting Started

Examples

Features

Concepts

Environments

​Trajectory

​Overview

​Accessing Trajectories

​Key Properties

​Visualization

​Related Concepts

Trajectory

Overview

Accessing Trajectories

Key Properties

Visualization

Related Concepts