Skip to content

[LLM] Rewrite GSM8K reward function to follow standard GRPO conventions#3542

Closed
vmoens wants to merge 2 commits intogh/vmoens/234/basefrom
gh/vmoens/234/head
Closed

[LLM] Rewrite GSM8K reward function to follow standard GRPO conventions#3542
vmoens wants to merge 2 commits intogh/vmoens/234/basefrom
gh/vmoens/234/head

Commits

Commits on Mar 5, 2026

Commits on Mar 20, 2026