Skip to content

Conversation

@adityasoni9998
Copy link

@adityasoni9998 adityasoni9998 commented Dec 8, 2025

Benchmarking code to evaluate open-source LLMs' ability to localize relevant source code files that require editing to fix a given GitHub issue in input

  • Note: DockerWorkspace somehow doesn't work for me on the latest benchmarks repo, but it did work for an older version. LocalWorkspace works fine though. I am using v1.4.1 of software agent sdk and ghcr.io/openhands/agent-server:latest-python in the docker image and there were probably some breaking changes in recent docker image?

Copy link

@neubig neubig left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@adityasoni9998 please fix git ci and re-request review

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants