feat(engine): add RemoteAgentFlowEngine for remote agent runtimes by luyuzhe111 · Pull Request #441 · rllm-org/rllm

luyuzhe111 · 2026-03-13T20:24:29Z

Summary

This PR adds the integration of AWS Bedrock AgentCore Runtime as a sandboxed remote agent runtime backend where agents and environments collocate. AgentCore Runtime handles isolation and auto-scaling to allow secure and parallel rollouts.

Type of change

What changed

Introduce RemoteAgentFlowEngine and RemoteAgentRuntime protocol for "agent-in-sandbox" runtimes where agent and environment colocate in a remote container.
Include AgentCore Runtime adapter using ART's RolloutClient, auto-detection of routable gateway IP for cross-host access, and shared compute_step_metrics utility.
Gateway now binds to 0.0.0.0 while advertising the routable IP to remote agents to allow access
Add per-session sampling parameters support in gateway server (different sampling params for train vs val). GatewayManager registers the parameters and selects the right one based on the is_validation flag.
In pyproject.toml: add agentcore as an extra dependency, add rllm-model-gateway as a uv source to install in editable mode, and enable ruff line check (default is 88 chars, which results in numerous cosmetic wraps; changed to 120 to align with modern convention).

Validation

pre-commit run --all-files
Targeted tests: pytest ...
Manual validation performed
Not run (reason below)

It seems there are a bunch of pre-commit errors from existing code. might need a separate PR to clean them up.

Validation details:

uv run pytest tests/engine/test_remote_runtime.py -v passed.
uv run pytest tests/integration/ -v -s passed locally with AGENTCORE_AGENT_ARN, AGENTCORE_S3_BUCKET, AGENTCORE_BASE_URL, and AGENTCORE_MODEL_ID set.
e2e training with bash examples/agentcore_math/train_agentcore_math_tinker.sh. The reward curves from rllm-ui are attached below:

Train:

Val:

Docs / examples

Not needed
Updated docs
Updated examples
Follow-up docs needed

…lm-model-gateway in pyproject.toml

luyuzhe111 added 7 commits March 23, 2026 08:23

feat(engine): add RemoteAgentFlowEngine for remote agent runtimes

64c7b80

chore: add agentcore extras group + [tool.uv.sources] for editable rl…

2bf15ba

…lm-model-gateway in pyproject.toml

feat(gateway): support per-session sampling parameters

1b8aa78

feat(engine): refine remote runtime protocol and improve error

d590232

fix: add top_k sampling parameter support for tinker adapter

d1ba2dc

feat(config): add remote_runtime configuration section

a2f05b9

feat(examples): add GSM8K math agent training with AgentCore

31255f6

luyuzhe111 force-pushed the agentcore_integ branch from e52b185 to 31255f6 Compare March 24, 2026 18:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(engine): add RemoteAgentFlowEngine for remote agent runtimes#441

feat(engine): add RemoteAgentFlowEngine for remote agent runtimes#441
luyuzhe111 wants to merge 7 commits intorllm-org:mainfrom
luyuzhe111:agentcore_integ

luyuzhe111 commented Mar 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

luyuzhe111 commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Type of change

What changed

Validation

Docs / examples

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

luyuzhe111 commented Mar 13, 2026 •

edited

Loading