Update RL training skill by Kovbo · Pull Request #633 · OpenPipe/ART

Kovbo · 2026-03-27T19:33:41Z

Summary

rewrite the RL training skill into a shorter interactive wizard that still collects required choices one question at a time
update RL guidance to use batch-scaled max_exceptions, explicit validation/checkpoint guidance, openai/gpt-5.4 as the default RULER judge, and neutral base-model prompting

Kovbo added 2 commits March 27, 2026 12:33

Update RL and SFT training skills

196f4ae

Drop train-sft changes from skill update PR

69d7359

Kovbo changed the title ~~Update RL and SFT training skills~~ Update RL training skill Mar 27, 2026

Kovbo merged commit 1905677 into main Mar 27, 2026
5 checks passed