Verify HF export at startup; rewrite save_state_dict_as_hf by finbarrtimbers · Pull Request #1671 · allenai/open-instruct

finbarrtimbers · 2026-05-08T17:01:20Z

Replace olmo-core's save_hf_model path with a direct convert_state_to_hf → AutoModelForCausalLM.save_pretrained flow, so HF export reuses the same converter the verifier checks against.

Add verify_can_save_as_hf which builds the olmo-core model and the target HF model on meta, runs the converter, and asserts the produced state-dict keys exactly match the HF model.

…noreply@anthropic.com>

gemini-code-assist

Code Review

This pull request replaces the olmo-core HF saving path with a direct conversion flow, adds startup verification for HF exports, and introduces a pruning mechanism for permanent checkpoints. Feedback identifies a critical AttributeError due to incorrect StateType usage, notes that the checkpoint pruning parameter is not yet propagated through the call stack, and highlights redundant configuration saving logic and an unused parameter.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 905a12faf8

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

…t_state_to_hf; prune permanent checkpoints Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

…noreply@anthropic.com>

…verrides shim now that upstream handles llama/qwen3/gemma3 norm mappings Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

… config save (PR #1671 review) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

… Opus 4.7 <noreply@anthropic.com>

…ude Opus 4.7 <noreply@anthropic.com>

…o f-string Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

farhatkevin

This looks good to me once the merge conflicts are resolved and the stale PruningCheckpointerCallback mention is removed from the changelog.

…ne guards Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

…thored-By: Claude Opus 4.7 <noreply@anthropic.com>

…ield ordering Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

@gemini-code-assist

…i#1672) * Move maybe_evaluate to grpo_utils; dedupe calculate_token_counts Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> * Verify HF export at startup; rewrite save_state_dict_as_hf via convert_state_to_hf; prune permanent checkpoints Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> * Update CHANGELOG with PR allenai#1671 link Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> * GRPO OLMo-core feature parity: EvalCallback, setup_eval, checkpointer, scheduler types Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> * Update CHANGELOG with PR allenai#1672 link Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> * Drop pre-norm Qwen3/Llama OLMo-core->HF override shim; upstream olmo-core now provides these mappings Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> * Minimize diff: drop PruningCheckpointerCallback, ty:ignore, and unrelated script tweaks Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> * Unify checkpoint_state_freq disable sentinel: <=0 disables on both grpo_fast and olmo_core paths Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> * Apply suggestion from @gemini-code-assist[bot] Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> * Update grpo_callbacks.py * Use keyword args in EvalCallback.post_step's maybe_evaluate call Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> * Move save_freq/checkpoint_state_freq divergence warning into GRPOExperimentConfig.__post_init__ Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> --------- Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

finbarrtimbers added a commit that referenced this pull request May 8, 2026

Update CHANGELOG with PR #1671 link Co-Authored-By: Claude Opus 4.7 <…

6bfecf1

…noreply@anthropic.com>

gemini-code-assist Bot reviewed May 8, 2026

View reviewed changes

Comment thread open_instruct/olmo_core_utils.py Outdated

Comment thread open_instruct/olmo_core_utils.py Outdated

Comment thread open_instruct/olmo_core_utils.py Outdated

Comment thread open_instruct/olmo_core_utils.py Outdated

chatgpt-codex-connector Bot reviewed May 8, 2026

View reviewed changes

Comment thread open_instruct/olmo_core_utils.py Outdated

Comment thread open_instruct/olmo_core_utils.py Outdated

finbarrtimbers mentioned this pull request May 8, 2026

GRPO OLMo-core feature parity: eval, checkpointer, schedulers #1672

Merged

finbarrtimbers added 2 commits May 11, 2026 12:31

Verify HF export at startup; rewrite save_state_dict_as_hf via conver…

307ab05

…t_state_to_hf; prune permanent checkpoints Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Update CHANGELOG with PR #1671 link Co-Authored-By: Claude Opus 4.7 <…

370c2f1

…noreply@anthropic.com>

finbarrtimbers force-pushed the finbarr/hf-export-verify branch from 6bfecf1 to 370c2f1 Compare May 11, 2026 18:31

finbarrtimbers added 7 commits May 13, 2026 09:19

cleaned up pr

d40581c

Bump olmo-core to f1b69d79; drop _register_pre_norm_olmo_core_to_hf_o…

8a55ecc

…verrides shim now that upstream handles llama/qwen3/gemma3 norm mappings Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Drop unused model_config arg from save_state_dict_as_hf and redundant…

1164a03

… config save (PR #1671 review) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Merge branch 'main' into finbarr/hf-export-verify

b63f785

Add type annotations to export_to_hf in dpo.py Co-Authored-By: Claude…

0bf1b68

… Opus 4.7 <noreply@anthropic.com>

Inline get_hf_config helper in olmo_core_utils.py Co-Authored-By: Cla…

9ae776d

…ude Opus 4.7 <noreply@anthropic.com>

Annotate save_state_dict_as_hf and switch verify_can_save_as_hf log t…

335e8cf

…o f-string Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

farhatkevin self-requested a review May 14, 2026 18:25

farhatkevin approved these changes May 14, 2026

View reviewed changes

finbarrtimbers added 4 commits May 14, 2026 11:28

Merge branch 'main' into finbarr/hf-export-verify

6de8d8b

Add trailing newline to olmo_core_utils.py to fix style-check

7188008

Make model_name_or_path required on ModelConfig and drop redundant No…

a74f0d9

…ne guards Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Trim CHANGELOG entry for #1671 to reflect what actually shipped Co-Au…

5252948

…thored-By: Claude Opus 4.7 <noreply@anthropic.com>

finbarrtimbers enabled auto-merge May 14, 2026 23:14

finbarrtimbers disabled auto-merge May 14, 2026 23:15

finbarrtimbers added 2 commits May 14, 2026 17:20

Make olmo_core_utils.ModelConfig kw_only to fix DPOExperimentConfig f…

3e93968

…ield ordering Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Merge branch 'main' into finbarr/hf-export-verify

bae9119

finbarrtimbers enabled auto-merge May 14, 2026 23:21

finbarrtimbers disabled auto-merge May 14, 2026 23:21

finbarrtimbers enabled auto-merge May 14, 2026 23:21

finbarrtimbers added this pull request to the merge queue May 14, 2026

finbarrtimbers removed this pull request from the merge queue due to a manual request May 14, 2026

finbarrtimbers added this pull request to the merge queue May 15, 2026

Merged via the queue into main with commit 1bc3178 May 15, 2026
7 checks passed

finbarrtimbers deleted the finbarr/hf-export-verify branch May 15, 2026 13:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Verify HF export at startup; rewrite save_state_dict_as_hf#1671

Verify HF export at startup; rewrite save_state_dict_as_hf#1671
finbarrtimbers merged 15 commits into
mainfrom
finbarr/hf-export-verify

finbarrtimbers commented May 8, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

Uh oh!

Uh oh!

farhatkevin left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

finbarrtimbers commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

farhatkevin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

finbarrtimbers commented May 8, 2026 •

edited

Loading