NLP Group @ HKUST

simpleRL-reason Public

Simple RL training for reasoning

Python 3.8k 289

CodeIO Public

[ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Python 568 33

deita Public

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Python 593 35

ceval Public

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

Python 1.8k 82

Toolathlon Public

[ICLR 2026] The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Python 315 32

KernelGYM Public

[KernelGYM & Dr. Kernel] A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations

Python 156 17

Provide feedback