feat(study-crates): test-file target + reward-hacking benchmark prompt #280
| Job | Run time |
|---|---|
| 45s | |
| 2m 35s | |
| 11m 48s | |
| 4m 51s | |
| 10m 40s | |
| 10s | |
| 4m 6s | |
| 4m 32s | |
| 4m 7s | |
| 4m 20s | |
| 3m 27s | |
| 3m 44s | |
| 3m 19s | |
| 4m 3s | |
| 4m 19s | |
| 4m 13s | |
| 3m 37s | |
| 3m 49s | |
| 4m 35s | |
| 3m 35s | |
| 3m 38s | |
| 2m 59s | |
| 3m 21s | |
| 3m 17s | |
| 4m 27s | |
| 3m 41s | |
| 2m 22s | |
| 5m 27s | |
| 2m 20s | |
| 2m 15s | |
| 2m 24s | |
| 4m 22s | |
| 3m 49s | |
| 2m 11s | |
| 0s | |
| 0s | |
| 2h 13m 8s |