Skip to content
View TeenLucifer's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Northwestern Polytechnical University

Block or report TeenLucifer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. llm_base llm_base Public

    Pretrain、Posttrain、RAG、Agent等大模型相关的基础项目合集

    Python 35 5

  2. grpo_reproduce grpo_reproduce Public

    A comparison of deepseek grpo and qwen gspo on Qwen2.5-1.5B-Instruct fine tunning.

    Python 159 14

  3. dapo_reproduce dapo_reproduce Public

    Python 12 1

  4. vlm_reproduce vlm_reproduce Public

    Python 36 2

  5. ppo_reproduce ppo_reproduce Public

    Jupyter Notebook 8 1

  6. dpo_reproduce dpo_reproduce Public

    Python 5 1