Skip to content

Pull requests: OpenHands/benchmarks

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Open agent safety
#91 opened Nov 13, 2025 by MadhaviSG Loading…
benchmark: commit0
#82 opened Nov 7, 2025 by juanmichelini Draft
3 tasks
benchmark: SWT bench eval
#80 opened Nov 7, 2025 by juanmichelini Draft
1 of 3 tasks
benchmark: SWT-bench infer
#55 opened Oct 28, 2025 by juanmichelini Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.