ExecutionAgent Documentation

ExecutionAgent is a class that enables AI-powered code execution, writing, and editing. It uses a state machine architecture to safely execute commands, write code files, and search for information.

Basic Usage

from ursa.agents import ExecutionAgent

# Initialize the agent
agent = ExecutionAgent()

# Run a prompt
result = agent("Write and execute a python script to print the first 10 integers.")

# Access the final response
print(result["messages"][-1].text)

Parameters

When initializing ExecutionAgent, you can customize its behavior with these parameters:

Parameter	Type	Default	Description
`llm`	`BaseChatModel`	`init_chat_model("openai:gpt-5-mini")`	The LLM model to use
`extra_tools`	`Optional[list[Callable[..., Any]]]`	`None`	Additional tools for the execution agent

Features

Code Execution

The agent can safely execute shell commands in a controlled environment:

result = agent("Install numpy and create a script that uses it to calculate the mean of [1, 2, 3, 4, 5]")

Code Writing

The agent can write code files to a workspace directory:

result = agent("Create a Flask web application that displays 'Hello World'")

Advanced Usage

Customizing the Workspace

The agent creates a workspace folder with a randomly generated name for each run. You can access this workspace path from the agent:

result = agent("Create a Python script"))
workspace_path = agent.workspace
print(f"Files were created in: {workspace_path}")

Setting a Recursion Limit

For complex tasks, you may need to adjust the recursion limit:

result = agent.invoke(
    "Create a complex project with multiple files and tests",
    recursion_limit=2000
)

Safety Features

The agent includes built-in safety checks for shell commands:

Commands are evaluated for safety before execution
Unsafe commands are blocked with explanations
The agent suggests safer alternatives when appropriate

How It Works

State Machine: The agent uses a directed graph to manage its workflow:

agent node: Processes user requests and generates responses

action node: Executes tools (run_cmd, write_code, edit_code, search)

extra tools can be provided to the agent as follows:

from langchain.tools import tool

@tool
def do_magic(a: int, b: int) -> float:
    """Do magic with integers a and b.

    Args:
        a: first integer
        b: second integer
    """
    return sqrt(a**2 + b**2)

agent = ExecutionAgent(extra_tools=[do_magic])

summarize node: Creates a final summary when complete

Tools:
- run_cmd: Executes shell commands in the workspace directory
- write_code: Creates new code files with syntax highlighting
- edit_code: Modifies existing code files with diff preview
- search_tool: Performs web searches via DuckDuckGo
Visualization:
- Code changes are displayed with syntax highlighting
- File edits show detailed diffs
- Command execution shows stdout and stderr

Notes

The agent creates a new workspace directory for each run
Files are written to and executed from this workspace
Shell commands have a 60000-second timeout by default
The agent can handle keyboard interrupts during command execution

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ExecutionAgent Documentation

Basic Usage

Parameters

Features

Code Execution

Code Writing

Advanced Usage

Customizing the Workspace

Setting a Recursion Limit

Safety Features

How It Works

Notes

FilesExpand file tree

execution_agent.md

Latest commit

History

execution_agent.md

File metadata and controls

ExecutionAgent Documentation

Basic Usage

Parameters

Features

Code Execution

Code Writing

Advanced Usage

Customizing the Workspace

Setting a Recursion Limit

Safety Features

How It Works

Notes