Hey,
I'm interested by what's happening here ! The current stack for RL training for diffusion is awfull (flow-grpo forks ...)
I've been following https://github.com/chenyingshu/mm_grpo recently but the project seems to be on hold.
Your codebase seems really good already. Do you have any ETA ? Are you working alone on this ?
Hey,
I'm interested by what's happening here ! The current stack for RL training for diffusion is awfull (flow-grpo forks ...)
I've been following https://github.com/chenyingshu/mm_grpo recently but the project seems to be on hold.
Your codebase seems really good already. Do you have any ETA ? Are you working alone on this ?