Commit 16d890f
committed
fix(aero_realtime): fix NaN loss and duplicate system prompts
- Make time_embedding.inv_freq persistent=True so it survives FSDP2
state dict save/load cycle (was corrupted → NaN loss)
- Remove auto-injected system prompt from chat template that was
added before every turn due to per-message apply_chat_template1 parent 7c6dd02 commit 16d890f
2 files changed
Lines changed: 1 addition & 4 deletions
File tree
- src/lmms_engine
- datasets/processor
- models/aero_realtime
Lines changed: 0 additions & 3 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
615 | 615 | | |
616 | 616 | | |
617 | 617 | | |
618 | | - | |
619 | | - | |
620 | | - | |
621 | 618 | | |
622 | 619 | | |
623 | 620 | | |
| |||
Lines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
138 | 138 | | |
139 | 139 | | |
140 | 140 | | |
141 | | - | |
| 141 | + | |
142 | 142 | | |
143 | 143 | | |
144 | 144 | | |
| |||
0 commit comments