-
Notifications
You must be signed in to change notification settings - Fork 243
Pull requests: allenai/OLMo-core
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add conversion overrides for Llama, Qwen3, and Gemma 4 models so they roundtrip properly
#677
opened May 13, 2026 by
finbarrtimbers
Collaborator
Loading…
Stream HF→OLMo state conversion to lower load_hf_model peak memory
#661
opened Apr 27, 2026 by
finbarrtimbers
Collaborator
Loading…
2 of 3 tasks
Fix NaN loss when all labels on a rank are label_ignore_index
#657
opened Apr 10, 2026 by
Suanmd
Loading…
4 of 5 tasks
Use metadata-backed document boundaries for SFT datasets
#653
opened Apr 2, 2026 by
taivu1998
Contributor
Loading…
Fix race condition in SamplingInstanceSource index cache
#651
opened Mar 27, 2026 by
IanMagnusson
•
Draft
1 task done
Adds option to disable post train permanent checkpoint
#631
opened Mar 4, 2026 by
epwalsh
Contributor
Loading…
Fix: respect
-t/--tokenizer CLI flag over sibling tokenizer/ directory
#629
opened Mar 3, 2026 by
mario-sanz
Loading…
Add ReduceType.weighted_mean for weighted metric reduction
#604
opened Feb 11, 2026 by
finbarrtimbers
Collaborator
Loading…
3 tasks done
Fix RoPE positions to reset at document boundaries when using doc_lens
#591
opened Feb 3, 2026 by
finbarrtimbers
Collaborator
Loading…
Mount oe-adapt-default as well when mounting oe-training-default
#578
opened Jan 29, 2026 by
jacob-morrison
Contributor
Loading…
Modify launch script so that it has a configurable timeout
#557
opened Jan 20, 2026 by
finbarrtimbers
Collaborator
•
Draft
Add a Llama-like 7B training script for data ablations
#505
opened Dec 16, 2025 by
epwalsh
Contributor
Loading…
Add DataMixtureMonitorCallback for per-source data mixture metrics
#501
opened Dec 15, 2025 by
DivijChawla
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.