Skip to content

Convert: Fix NemotronH Config Parsing#21664

Open
anavp-nvidia wants to merge 2 commits intoggml-org:masterfrom
anavp-nvidia:fix-nemotron-h-config-parsing
Open

Convert: Fix NemotronH Config Parsing#21664
anavp-nvidia wants to merge 2 commits intoggml-org:masterfrom
anavp-nvidia:fix-nemotron-h-config-parsing

Conversation

@anavp-nvidia
Copy link
Copy Markdown
Contributor

Overview

Fixing NVIDIA-Nemotron-3-Nano-4B-BF16 conversion failure due to config parsing issue (KeyError: '-') with latest versions of transformers (5.5.0 and 5.5.1 at least) .

The fix overrides set_vocab in NemotronHModel with a self-contained implementation that uses trust_remote_code=True. This allows the model's own config class to be loaded, which correctly handles the pattern.

Requirements

  • I have read and agree with the contributing guidelines
  • AI usage disclosure: Yes, AI tools were used to validate the changes.

@anavp-nvidia anavp-nvidia requested a review from CISC as a code owner April 9, 2026 08:51
@ggerganov
Copy link
Copy Markdown
Member

cc @richarddd @michaelw9999

@github-actions github-actions bot added the python python script changes label Apr 9, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

python python script changes

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants