Skip to content

New options for preference tuning: rpo alpha, logprobs normalization, reference-free, simpo gamma #488

New options for preference tuning: rpo alpha, logprobs normalization, reference-free, simpo gamma

New options for preference tuning: rpo alpha, logprobs normalization, reference-free, simpo gamma #488

Triggered via pull request June 12, 2025 16:51
Status Success
Total duration 37s
Artifacts

_integration_tests.yml

on: pull_request
Matrix: build
Fit to window
Zoom out
Zoom in