Skip to content

psyonp/BinaryPPO

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 

Repository files navigation

BinaryPPO

An offline LLM reinforcement learning framework that reformulates binary classification as a reward maximization problem.

About

An offline LLM reinforcement learning framework that reformulates binary classification as a reward maximization problem.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors