Skip to content

Actions: allenai/reward-bench

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
1,182 workflow runs
1,182 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Update per token reward
Tests #54: Pull request #25 synchronize by ljvmiranda921
February 19, 2024 17:47 3m 48s update/per-token-reward
February 19, 2024 17:47 3m 48s
Best of N pipeline + tests
Tests #53: Pull request #30 synchronize by natolambert
February 16, 2024 00:47 3m 51s b_o_n
February 16, 2024 00:47 3m 51s
Best of N pipeline + tests
Quality #53: Pull request #30 synchronize by natolambert
February 16, 2024 00:47 3m 33s b_o_n
February 16, 2024 00:47 3m 33s
Add model type to results (#26)
Quality #52: Commit 060d9c2 pushed by natolambert
February 15, 2024 22:44 3m 33s main
February 15, 2024 22:44 3m 33s
Add model type to results (#26)
Tests #52: Commit 060d9c2 pushed by natolambert
February 15, 2024 22:44 4m 8s main
February 15, 2024 22:44 4m 8s
Best of N pipeline + tests
Tests #51: Pull request #30 synchronize by natolambert
February 15, 2024 22:41 3m 39s b_o_n
February 15, 2024 22:41 3m 39s
Best of N pipeline + tests
Quality #51: Pull request #30 synchronize by natolambert
February 15, 2024 22:41 4m 30s b_o_n
February 15, 2024 22:41 4m 30s
Best of N pipeline + tests
Tests #50: Pull request #30 synchronize by natolambert
February 15, 2024 22:36 4m 59s b_o_n
February 15, 2024 22:36 4m 59s
Best of N pipeline + tests
Quality #50: Pull request #30 synchronize by natolambert
February 15, 2024 22:36 3m 35s b_o_n
February 15, 2024 22:36 3m 35s
Best of N pipeline + tests
Tests #49: Pull request #30 synchronize by natolambert
February 15, 2024 22:00 3m 49s b_o_n
February 15, 2024 22:00 3m 49s
Best of N pipeline + tests
Quality #49: Pull request #30 synchronize by natolambert
February 15, 2024 22:00 3m 30s b_o_n
February 15, 2024 22:00 3m 30s
Best of N pipeline + tests
Tests #48: Pull request #30 opened by natolambert
February 15, 2024 21:41 4m 0s b_o_n
February 15, 2024 21:41 4m 0s
Best of N pipeline + tests
Quality #48: Pull request #30 opened by natolambert
February 15, 2024 21:41 3m 35s b_o_n
February 15, 2024 21:41 3m 35s
Per token multiple rms
Tests #47: Pull request #29 opened by khyathiraghavi
February 15, 2024 21:31 3m 43s per-token-multiple-rms
February 15, 2024 21:31 3m 43s
Per token multiple rms
Quality #47: Pull request #29 opened by khyathiraghavi
February 15, 2024 21:31 3m 25s per-token-multiple-rms
February 15, 2024 21:31 3m 25s
Per token multiple rms
Quality #46: Pull request #28 opened by khyathiraghavi
February 15, 2024 21:25 3m 36s per-token-multiple-rms
February 15, 2024 21:25 3m 36s
Per token multiple rms
Tests #46: Pull request #28 opened by khyathiraghavi
February 15, 2024 21:25 3m 45s per-token-multiple-rms
February 15, 2024 21:25 3m 45s
visualizing multiple rewards
Quality #45: Pull request #27 opened by khyathiraghavi
February 15, 2024 21:19 3m 40s per-token-multiple-rms
February 15, 2024 21:19 3m 40s
visualizing multiple rewards
Tests #45: Pull request #27 opened by khyathiraghavi
February 15, 2024 21:19 4m 23s per-token-multiple-rms
February 15, 2024 21:19 4m 23s
Add model type to results
Tests #44: Pull request #26 synchronize by natolambert
February 15, 2024 17:45 4m 6s model_type
February 15, 2024 17:45 4m 6s
Add model type to results
Quality #44: Pull request #26 synchronize by natolambert
February 15, 2024 17:45 3m 28s model_type
February 15, 2024 17:45 3m 28s
Add model type to results
Tests #43: Pull request #26 opened by natolambert
February 15, 2024 17:44 4m 36s model_type
February 15, 2024 17:44 4m 36s
Add model type to results
Quality #43: Pull request #26 opened by natolambert
February 15, 2024 17:44 3m 33s model_type
February 15, 2024 17:44 3m 33s
Update per token reward
Tests #42: Pull request #25 opened by ljvmiranda921
February 14, 2024 20:32 4m 5s update/per-token-reward
February 14, 2024 20:32 4m 5s
Update per token reward
Quality #42: Pull request #25 opened by ljvmiranda921
February 14, 2024 20:32 3m 41s update/per-token-reward
February 14, 2024 20:32 3m 41s
ProTip! You can narrow down the results and go further in time using created:<2024-02-14 or the other filters available.