-
Notifications
You must be signed in to change notification settings - Fork 45
Pull requests: OpenHands/benchmarks
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: Reuse pre-built SDK sdist to speed up image builds
#499
opened Mar 10, 2026 by
juanmichelini
•
Draft
build(deps): bump the version-all group across 1 directory with 5 updates
dependencies
Pull requests that update a dependency file
github_actions
Pull requests that update GitHub Actions code
#492
opened Mar 9, 2026 by
dependabot
bot
Loading…
Add background BuildKit pruning for completed batches
#489
opened Mar 8, 2026 by
neubig
Loading…
6 tasks done
Disable rich logging by default to fix multiprocessing deadlock
#487
opened Mar 5, 2026 by
juanmichelini
•
Draft
Fix fork+threading deadlock in SWE-Bench image builds
#486
opened Mar 5, 2026 by
juanmichelini
•
Draft
build(deps): bump the version-all group across 1 directory with 18 updates
dependencies
Pull requests that update a dependency file
python:uv
Pull requests that update python:uv code
#485
opened Mar 4, 2026 by
dependabot
bot
Loading…
Add SWE-bench Multilingual benchmark support
#480
opened Mar 4, 2026 by
juanmichelini
•
Draft
3 tasks
refactor: Replace ProcessPoolExecutor with asyncio for evaluation
#446
opened Feb 25, 2026 by
simonrosenberg
Loading…
Recycle worker processes to prevent OOM from heap fragmentation
#442
opened Feb 24, 2026 by
simonrosenberg
Loading…
3 tasks
Add ACP agent support (Claude Code, Codex) for all benchmarks
#440
opened Feb 23, 2026 by
simonrosenberg
Loading…
6 tasks
Enable configurable context condensation in all benchmarks
#429
opened Feb 18, 2026 by
juanmichelini
Loading…
Add git reset validation script and fix missing resets
#425
opened Feb 17, 2026 by
juanmichelini
•
Draft
Fix: laminar trace timeline to account for idle wait time
#415
opened Feb 13, 2026 by
Rainhunter13
Loading…
fix(swtbench): prevent build workflow from hanging indefinitely
#403
opened Feb 6, 2026 by
juanmichelini
Loading…
BREAKING: Rename --max-attempts to --n-critic-runs
#325
opened Jan 16, 2026 by
juanmichelini
•
Draft
Fix dataset loading schema validation issue in CI
#304
opened Jan 13, 2026 by
juanmichelini
Loading…
Add add_resolve_rate_to_predictions function to output_utils
#199
opened Dec 23, 2025 by
juanmichelini
•
Draft
ProTip!
Adding no:label will show everything without a label.