[ICLR'25] MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
-
Updated
Jun 6, 2025 - Python
[ICLR'25] MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
Add a description, image, and links to the ma-rlhf topic page so that developers can more easily learn about it.
To associate your repository with the ma-rlhf topic, visit your repo's landing page and select "manage topics."