Implement a git and go native parallel coding agent based on the plan/execute example by alexlovelltroy · Pull Request #196 · lanl/ursa

alexlovelltroy · 2026-03-06T13:36:40Z

This pull request introduces a new GitGoAgent for autonomous development in git-managed Go repositories, adds comprehensive documentation and example usage, and improves path validation and agent modularity. The most important changes are grouped below:

New Agent and Modular Architecture

Added GitAgent and GitGoAgent classes to provide git-aware agents with language-specific extensions, allowing easy specialization for Go projects and future extensibility for other languages. (src/ursa/agents/git_agent.py, src/ursa/agents/git_go_agent.py) [1] [2]
Introduced prompt composition utilities for git and Go, enabling modular agent instructions and clearer separation of git and language-specific behaviors. (src/ursa/prompt_library/git_prompts.py, src/ursa/prompt_library/go_prompts.py, src/ursa/prompt_library/git_go_prompts.py) [1] [2] [3]

Documentation and Examples

Added detailed documentation for GitGoAgent, including usage instructions, available tools, configuration, and common workflows. (docs/git_go_agent.md)
Provided a runnable example demonstrating GitGoAgent usage for git status and Go file summarization. (examples/single_agent_examples/git_go_agent/git_go_agent_example.py)
Registered the new agent in the documentation navigation. (mkdocs.yml)

Path Safety and Tool Improvements

Improved file path validation for code writing tools, ensuring files are written only within the workspace and optionally within the repository boundary, preventing path traversal and unauthorized writes. (src/ursa/tools/write_code_tool.py) [1] [2] [3]

Multi-Repo Planning Example

Added a YAML configuration example for orchestrating documentation generation across two repositories (boot-service and openchami.org), showcasing agent planning and execution capabilities in multi-repo scenarios. (examples/two_agent_examples/plan_execute/openchami_boot_docs_example.yaml)

Minor Improvements

Standardized error reporting in several example scripts for improved debugging clarity. (examples/two_agent_examples/plan_execute/city_10_vowels.py, examples/two_agent_examples/plan_execute/quantum_Rabi_QuTiP.py, examples/two_agent_examples/plan_execute/scrabble.py) [1] [2] [3]

…lidation for code writing - Implement shared utilities in `plan_execute_utils.py` for YAML config loading, dictionary merging, plan hashing, secret masking, LLM setup, workspace management, and SQLite snapshotting. - Create tests for `GitGoAgent` and `GitAgent` to verify tool availability and functionality. - Add tests for Go tooling functions including `go_build`, `go_test`, `go_vet`, and `golangci_lint` with error handling. - Enhance `PlanningAgent` tests with a fake chat model for structured plan creation. - Introduce path validation tests for `write_code` and `edit_code` functions to ensure security against path traversal and enforce workspace boundaries. Signed-off-by: Alex Lovell-Troy <alovelltroy@lanl.gov>

…ution scripts - Introduced a new YAML configuration for generating boot-service documentation for openchami.org. - Updated `plan_execute_from_yaml.py` to include new utility functions and imports for enhanced functionality. - Modified `plan_execute_multi_repo.py` to support asynchronous checkpointing and improved token tracking during execution. - Refactored `git_agent.py` for cleaner language handling logic. - Enhanced `parse.py` with additional file type support and improved error handling. Signed-off-by: Alex Lovell-Troy <alovelltroy@lanl.gov>

alexlovelltroy · 2026-03-09T15:32:46Z

The tests are passing on my dev host with python 3.12.10 and pytest 9.0.2. Is there a matrix of versions we're targeting for tool versions that I should be referencing?

mikegros · 2026-03-09T15:51:50Z

Reviewing this now - I think you forgot to commit a couple tools files in the tools folder. Ran into some test failures because of it:

    from ursa.tools.git_tools import GIT_TOOLS
E   ModuleNotFoundError: No module named 'ursa.tools.git_tools'

Signed-off-by: Alex Lovell-Troy <alovelltroy@lanl.gov>

alexlovelltroy · 2026-03-10T11:42:46Z

Thanks @mikegros I think it is fixed now. It looks like I had overlapping python environments that masked the missing file.

mikegros

Not finished reviewing but wanted to get a couple comments up so @luiarthur and @ndebard might be able to chime in.

src/ursa/tools/write_code_tool.py

examples/two_agent_examples/plan_execute/plan_execute_from_yaml.py

…d merging via command line.

…oading of agents Signed-off-by: Alex Lovell-Troy <alovelltroy@lanl.gov>

Signed-off-by: Alex Lovell-Troy <alovelltroy@lanl.gov>

alexlovelltroy · 2026-03-11T18:52:21Z

I had more than one ursa workspace on my laptop and some of my tests were using the wrong one. I just added a few more commits to further rely on common utils for plan_execute loop scripts in the utils directory.

randomname is back everywhere!

I added simple test yaml(s) for both plan_execute and plan_execute_multi_repo

python plan_execute_multi_repo.py --config example_multi_repo.yaml

python plan_execute_from_yaml.py --config example_from_yaml.yaml

mikegros · 2026-03-16T18:16:27Z

I'll try to get this merged today after our team meeting. My only real holdup is the added path validation. I want to make sure that wont break anything existing workflows people know of.

We had talked before about whether or not we should add the hard restriction to stay in the workspace or if that should be an optional argument of the agent (have a flag to turn on that hard boundary).

I'd really like to clear this off today though. The basic PR is definitely worth working in. Its just some of the knock-on effects that this that have slowed it down (well that, along with the project appraisal).

luiarthur

I reviewed only write_code_tool.py and a handful of small sections. My main feedback is to permit writing outside of repo/workspace but default to permitting writes only within repo/workspace.

src/ursa/util/diff_renderer.py

src/ursa/tools/write_code_tool.py

Co-authored-by: Arthur Lui <5297817+luiarthur@users.noreply.github.com>

…able Signed-off-by: Alex Lovell-Troy <alovelltroy@lanl.gov>

- Hardened read_path errors that could lead to crash - Added explicit error handling for existence/type checking for directory paths - Enhanced errors that are bubbled up with context Signed-off-by: Alex Lovell-Troy <alovelltroy@lanl.gov>

alexlovelltroy · 2026-03-17T14:12:34Z

Updated write_code_tool.py to allow for unsafe writes. Documented in my git_go_agent docs. I didn't see a better place for it.

Path validation is enabled by default. For trusted sandbox/container usage, you can opt in to unsafe writes by setting:

export URSA_ALLOW_UNSAFE_WRITES=1

When enabled, workspace and repository boundary checks are bypassed for write_code and edit_code.

I also ran a bug analyzer and found a few places where write_code_tool could fail without good feedback to the user so I updated error handling to address it and added tests for all this behavior.

src/ursa/tools/write_code_tool.py

mikegros

Sorry I thought I submitted the other comment yesterday but noticed I put the comments in, but didnt submit the review

src/ursa/tools/write_code_tool.py

…ated functions, using environment variable instead Signed-off-by: Alex Lovell-Troy <alovelltroy@lanl.gov>

alexlovelltroy · 2026-03-26T13:19:34Z

@mikegros the check for file location is now only in _validate_file_path which is called from both write_code and edit_code. It follows the environment variable which means that the agent can't influence the safety when calling the tool. Is this what you were hoping to see?

mikegros · 2026-03-26T15:52:47Z

@mikegros the check for file location is now only in _validate_file_path which is called from both write_code and edit_code. It follows the environment variable which means that the agent can't influence the safety when calling the tool. Is this what you were hoping to see?

I appreciate the update on the validation, but the real blocking thing is the extra arg to write_code (line 89 of the write_code tool file).

I might just make a second write_code tool called something like write_code_with_repo or something and update your cases to call that tool instead of the baseline write_code tool. Because I dont want to keep blocking this PR, but I also don't want to make such a fundamental change to a core tool for an edge case (in addition to the other concerns I addressed in the comment).

Signed-off-by: Alex Lovell-Troy <alovelltroy@lanl.gov>

mikegros · 2026-03-27T15:37:31Z

I think this looks good. I will do some testing and make sure there are no usage gotchas. If i dont run into any problems I will merge.

Thanks for your effort and patience on this.

This was addressed but I dont want to bug you on your weekend to clear this off so I'll just do it.

mikegros · 2026-03-28T18:11:37Z

Thank you so much for your effort and patience. It's all set!

alexlovelltroy added 2 commits March 2, 2026 12:25

alexlovelltroy mentioned this pull request Mar 6, 2026

Multi-repo Coding Agent based on Plan/Execute example #190

Closed

alexlovelltroy changed the title ~~Alovelltroy/coding agent~~ Implement a git and go native parallel coding agent based on the plan/execute example Mar 6, 2026

Add Git and Go tooling modules with corresponding tests

91912f6

Signed-off-by: Alex Lovell-Troy <alovelltroy@lanl.gov>

Lazy load new agents in this branch.

2435ac0

mikegros reviewed Mar 10, 2026

View reviewed changes

src/ursa/tools/write_code_tool.py Show resolved Hide resolved

examples/two_agent_examples/plan_execute/plan_execute_from_yaml.py Show resolved Hide resolved

alexlovelltroy added 6 commits March 11, 2026 13:30

Retry commit of CSV utility functions after formatting.

cc3cf48

Document dependencies for CSV processing in requirements.txt.

7569449

Add documentation with usage examples for CSV utility functions.

07857e1

Integrated utility functions into CLI tool, allowing CSV filtering an…

dc4ae16

…d merging via command line.

Add example YAML configurations for plan execution and enhance lazy l…

feb6ace

…oading of agents Signed-off-by: Alex Lovell-Troy <alovelltroy@lanl.gov>

Remove CSV utility functions and example usage scripts

d0eb149

Signed-off-by: Alex Lovell-Troy <alovelltroy@lanl.gov>

Merge branch 'lanl:main' into alovelltroy/coding-agent

76d57dd

luiarthur previously requested changes Mar 16, 2026

View reviewed changes

src/ursa/util/diff_renderer.py Outdated Show resolved Hide resolved

src/ursa/tools/write_code_tool.py Show resolved Hide resolved

src/ursa/tools/write_code_tool.py Show resolved Hide resolved

src/ursa/tools/write_code_tool.py Outdated Show resolved Hide resolved

alexlovelltroy and others added 3 commits March 17, 2026 09:28

Apply suggestions from code review

04e6c98

Co-authored-by: Arthur Lui <5297817+luiarthur@users.noreply.github.com>

Enhance path validation to support unsafe writes via environment vari…

783fa58

…able Signed-off-by: Alex Lovell-Troy <alovelltroy@lanl.gov>

Addressed several bug code paths

5ab40db

- Hardened read_path errors that could lead to crash - Added explicit error handling for existence/type checking for directory paths - Enhanced errors that are bubbled up with context Signed-off-by: Alex Lovell-Troy <alovelltroy@lanl.gov>

mikegros reviewed Mar 17, 2026

View reviewed changes

src/ursa/tools/write_code_tool.py Outdated Show resolved Hide resolved

mikegros reviewed Mar 17, 2026

View reviewed changes

src/ursa/tools/write_code_tool.py Outdated Show resolved Hide resolved

Remove allow_unsafe_writes parameter from _validate_file_path and rel…

0bc8bb5

…ated functions, using environment variable instead Signed-off-by: Alex Lovell-Troy <alovelltroy@lanl.gov>

alexlovelltroy requested a review from luiarthur March 26, 2026 13:19

alexlovelltroy and others added 2 commits March 26, 2026 20:27

Merge branch 'lanl:main' into alovelltroy/coding-agent

17c18b7

feat: add write_code_with_repo function to enforce repository boundaries

76e7b3e

Signed-off-by: Alex Lovell-Troy <alovelltroy@lanl.gov>

mikegros merged commit a16cfd7 into lanl:main Mar 28, 2026
1 of 2 checks passed

Conversation

alexlovelltroy commented Mar 6, 2026

Uh oh!

alexlovelltroy commented Mar 9, 2026

Uh oh!

mikegros commented Mar 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alexlovelltroy commented Mar 10, 2026

Uh oh!

mikegros left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

alexlovelltroy commented Mar 11, 2026

Uh oh!

mikegros commented Mar 16, 2026

Uh oh!

luiarthur left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alexlovelltroy commented Mar 17, 2026

Uh oh!

Uh oh!

mikegros left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

alexlovelltroy commented Mar 26, 2026

Uh oh!

mikegros commented Mar 26, 2026

Uh oh!

mikegros commented Mar 27, 2026

Uh oh!

Uh oh!

mikegros commented Mar 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mikegros commented Mar 9, 2026 •

edited

Loading