Skip to content

feat: add SWE-bench and TAU-bench benchmark suite #48

feat: add SWE-bench and TAU-bench benchmark suite

feat: add SWE-bench and TAU-bench benchmark suite #48

Triggered via pull request February 26, 2026 15:05
Status Success
Total duration 2m 30s
Artifacts

ci.yml

on: pull_request
Lint & TypeCheck
29s
Lint & TypeCheck
Unit Tests
53s
Unit Tests
Matrix: Provider E2E
Agent Integration
24s
Agent Integration
Comprehensive Agent Tests
21s
Comprehensive Agent Tests
CI Summary
3s
CI Summary
Fit to window
Zoom out
Zoom in