Skip to content

Commit 22981ce

Browse files
authored
Merge pull request hud-evals#184 from hud-evals/l/dev-log-updates
L/dev log updates
2 parents fcd2488 + ee01b61 commit 22981ce

35 files changed

+3428
-439
lines changed

docs/core-concepts/architecture.mdx

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@ HUD is built as a thin orchestration layer on top of MCP, providing structure fo
1111
```mermaid
1212
graph TB
1313
subgraph "Remote"
14-
Dashboard["📊 Dashboard<br/>(hud.so)"]
14+
Dashboard["📊 Dashboard<br/>(hud.ai)"]
1515
API["🔌 MCP Orchestrator<br/>(mcp.hud.so)"]
1616
end
1717
@@ -158,7 +158,7 @@ import hud
158158

159159
with hud.trace("my-evaluation"):
160160
result = await agent.run(task)
161-
# View at hud.so/traces/{trace_id}
161+
# View at hud.ai/traces/{trace_id}
162162
```
163163

164164
## Execution Flow

docs/docs.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,7 @@
2424
],
2525
"topbarCtaButton": {
2626
"name": "Dashboard",
27-
"url": "https://hud.so"
27+
"url": "https://hud.ai"
2828
},
2929
"navigation": {
3030
"versions": [

docs/index.mdx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -45,7 +45,7 @@ graph LR
4545
## Why HUD?
4646

4747
- **🔌 MCP-native**: Any agent can connect to any environment
48-
- **📡 Live telemetry**: Debug every tool call at [hud.so](https://hud.so)
48+
- **📡 Live telemetry**: Debug every tool call at [hud.ai](https://hud.ai)
4949
- **🚀 Production-ready**: From local Docker to cloud scale
5050
- **🎯 Built-in benchmarks**: OSWorld-Verified, SheetBench-50, and more
5151
- **🔧 CLI tools**: Create, develop, run, and train with `hud init`, `hud dev`, `hud run`, `hud eval`, `hud rl`

docs/quickstart.mdx

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -46,7 +46,7 @@ uv tool install hud-python
4646
Set your API keys as environment variables:
4747

4848
```bash
49-
export HUD_API_KEY="sk-hud-..." # Get from hud.so
49+
export HUD_API_KEY="sk-hud-..." # Get from hud.ai
5050
export ANTHROPIC_API_KEY="sk-ant-..." # For Claude agents
5151
export OPENAI_API_KEY="sk-..." # For OpenAI agents
5252
```
@@ -67,7 +67,7 @@ from hud.agents import ClaudeAgent
6767

6868
async def main():
6969
# The trace context captures ALL agent interactions within a "task run"
70-
# Everything inside this context shows up as one trace on hud.so
70+
# Everything inside this context shows up as one trace on hud.ai
7171
with hud.trace("quickstart-2048"):
7272
# Define task with remote MCP environment
7373
task = Task(

docs/train-agents/quickstart.mdx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ icon: "graduation-cap"
88
- HUD API key: Remote training requires authentication. Set `HUD_API_KEY` before running:
99

1010
```bash
11-
export HUD_API_KEY="sk-hud-..." # get one at https://hud.so
11+
export HUD_API_KEY="sk-hud-..." # get one at https://hud.ai
1212
# Or persist it locally:
1313
hud set HUD_API_KEY=sk-hud-...
1414
```

environments/README.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -495,7 +495,7 @@ from hud.agents import ClaudeAgent
495495
from hud.clients import MCPClient
496496

497497
async def main():
498-
# `trace` captures *everything* that happens and sends it to hud.so
498+
# `trace` captures *everything* that happens and sends it to hud.ai
499499
with hud.trace("local_test"):
500500
task = Task(
501501
prompt="Complete the task",
@@ -524,15 +524,15 @@ async def main():
524524
asyncio.run(main())
525525
```
526526

527-
The `trace` context manager sends a full timeline of agent actions, tool calls, and rewards to hud.so – perfect for debugging.
527+
The `trace` context manager sends a full timeline of agent actions, tool calls, and rewards to hud.ai – perfect for debugging.
528528

529529
See `examples/01_hello_2048.py` and `examples/task_with_setup_eval.py` for larger end-to-end demos.
530530

531531
---
532532

533533
## Phase 4 – Remote Deployment & HUD Runner
534534

535-
**Goal →** the exact same image runs in parallel on hundreds of instances, and exposes more telemetry so the hud.so can visualise the whole lifecycle.
535+
**Goal →** the exact same image runs in parallel on hundreds of instances, and exposes more telemetry so the hud.ai can visualise the whole lifecycle.
536536

537537
### 1. Publish your image
538538

@@ -595,11 +595,11 @@ async def initialize_environment(session=None, progress_token=None):
595595
await send(100, "ready")
596596
```
597597

598-
Those messages are displayed live on hud.so alongside resource graphs – perfect feedback while you wait.
598+
Those messages are displayed live on hud.ai alongside resource graphs – perfect feedback while you wait.
599599

600600
### 4. Live telemetry (`telemetry://live`) (Optional)
601601

602-
Expose a resource named `telemetry://live` exactly like in `environments/browser/src/hud_controller/server.py` to return live url to be displayed on hud.so.
602+
Expose a resource named `telemetry://live` exactly like in `environments/browser/src/hud_controller/server.py` to return live url to be displayed on hud.ai.
603603

604604
Once all of the above works you can unleash *hundreds* of concurrent agents on your new environment.
605605

0 commit comments

Comments
 (0)