Skip to content

Fix for layer distillation with sequence parallel training#431

Merged
oleksost merged 2 commits intomainfrom
sp_layer_distillation
Dec 23, 2025
Merged

Fix for layer distillation with sequence parallel training#431
oleksost merged 2 commits intomainfrom
sp_layer_distillation

Commits

Commits on Dec 19, 2025

Commits on Dec 23, 2025