Skip to content

Actions: CodingThrust/problem-reductions-benchmark

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
121 workflow runs
121 workflow runs

Filter by Workflow

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

leaderboard: aggregate-only public results; never publish found bugs
CI — unit tests and verifier calibration #50: Commit 3d2a526 pushed by isPANN
1m 11s main
skill: add run-benchmark for driving a run end-to-end (macOS/Linux)
CI — unit tests and verifier calibration #49: Commit 5581951 pushed by isPANN
1m 21s main
docs: add MIT LICENSE; rename the guide to CONTRIBUTING.md (#27)
CI — unit tests and verifier calibration #48: Commit 609c772 pushed by isPANN
1m 40s main
docs: collapse to README + SUBMISSION; drop SHOWCASE and CONTRIBUTING…
CI — unit tests and verifier calibration #46: Commit 9b5b7af pushed by isPANN
1m 15s main
docs(readme): submission is a GitHub PR (not the Space); pin Python 3.12
CI — unit tests and verifier calibration #44: Commit 02dac7f pushed by isPANN
1m 15s main
build(deps): bump mini-swe-agent from 2.2.8 to 2.4.4 in /benchmark (#19)
CI — unit tests and verifier calibration #43: Commit 5252b45 pushed by isPANN
1m 32s main
docker in /docker - Update #1447394869
Dependabot Updates #16: by dependabot Bot
37s main
37s
pip in /benchmark - Update #1447394868
Dependabot Updates #15: by dependabot Bot
43s main
43s
48s
ci(dependabot): pin python base to 3.12, ignore its updates
CI — unit tests and verifier calibration #41: Commit dfe1a4e pushed by isPANN
1m 13s main
build(deps): update pytest requirement in /benchmark (#24)
CI — unit tests and verifier calibration #40: Commit 7684fcb pushed by isPANN
1m 38s main