Skip to content

Actions: arcprize/arc-agi-benchmarking

Actions

Python Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
293 workflow runs
293 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

adding codex
Python Tests #155: Commit 1e03de1 pushed by gkamradt
53s main
adding claude agent sdk
Python Tests #152: Commit 2f5ca02 pushed by gkamradt
55s main
util updates
Python Tests #151: Commit eca2992 pushed by gkamradt
47s main
readme polish
Python Tests #149: Commit fda9bdf pushed by gkamradt
48s main
fixing tests
Python Tests #148: Commit 0d1c6c9 pushed by gkamradt
52s main
readme update w/ random
Python Tests #147: Commit b425394 pushed by gkamradt
47s main
fixing custom gemini
Python Tests #146: Commit cd17137 pushed by gkamradt
49s main
new models
Python Tests #145: Commit 648711f pushed by gkamradt
26s main
haiku
Python Tests #139: Commit 8fdb7a5 pushed by gkamradt
44s main
gpt 5 pro
Python Tests #138: Commit 0473a27 pushed by gkamradt
50s main
adding anthropic models
Python Tests #137: Commit eea524f pushed by gkamradt
48s main
gpt-5 configs
Python Tests #135: Commit 6e0b00c pushed by gkamradt
52s main