-
Notifications
You must be signed in to change notification settings - Fork 496
Pull requests: allenai/open-instruct
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Require checkpoint on Beaker restarts for DPO and GRPO training
codex
#1469
opened Feb 10, 2026 by
finbarrtimbers
Loading…
Enable packing + compile for DPO with wasted tokens metric
#1466
opened Feb 6, 2026 by
finbarrtimbers
Loading…
add user_prompt_transform to allow easy formatting of prefix, suffix of prompts
#1465
opened Feb 6, 2026 by
mnoukhov
Loading…
Now,
dpo.py matches dpo_tune_cache.py almost perfectly on the single GPU experiments
#1451
opened Feb 1, 2026 by
finbarrtimbers
Loading…
Add DPO OLMo-core support with MFU improvements
#1440
opened Jan 30, 2026 by
finbarrtimbers
Loading…
3 tasks
Significantly improves
dpo.py performance: ~40% MFU
#1430
opened Jan 27, 2026 by
finbarrtimbers
•
Draft
Now, the GPU tests CI action automatically appends the result to prevent it from re-running.
#1409
opened Jan 21, 2026 by
finbarrtimbers
Loading…
Add optional wandb system metrics logging for generator process
#1403
opened Jan 20, 2026 by
jacob-morrison
•
Draft
Add GRPO main entry point and scripts (GRPO olmo-core: PR 5 of 5)
#1399
opened Jan 20, 2026 by
finbarrtimbers
Loading…
1 of 3 tasks
Add OLMo-core Ray actor (GRPO olmo-core: PR 4 of 5)
#1398
opened Jan 20, 2026 by
finbarrtimbers
Loading…
1 of 2 tasks
Add GRPO callbacks for OLMo-core Trainer (GRPO olmo-core: PR 3 of 5)
#1397
opened Jan 20, 2026 by
finbarrtimbers
Loading…
Use simple-parsing for DPO argument parsing
#1393
opened Jan 20, 2026 by
finbarrtimbers
Loading…
3 tasks
Refactor DPO config: move fields and remove duplicates
#1392
opened Jan 20, 2026 by
finbarrtimbers
Loading…
3 tasks
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.