Skip to content

Pinned Loading

  1. understand-r1-zero understand-r1-zero Public

    Understanding R1-Zero-Like Training: A Critical Perspective

    Python 1.2k 57

  2. zero-bubble-pipeline-parallelism zero-bubble-pipeline-parallelism Public

    Forked from NVIDIA/Megatron-LM

    Zero Bubble Pipeline Parallelism

    Python 451 33

  3. lorahub lorahub Public

    [COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

    Python 669 42

  4. oat oat Public

    🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

    Python 639 61

  5. stde stde Public

    Official implementation of Stochastic Taylor Derivative Estimator (STDE) NeurIPS2024

    Python 128 10

  6. feedback-conditional-policy feedback-conditional-policy Public

    Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"

    Python 60 2

Repositories

Showing 10 of 101 repositories
  • jrystal Public

    A JAX-based Differentiable Density Functional Theory Framework for Materials

    sail-sg/jrystal’s past year of commit activity
    Python 45 Apache-2.0 1 5 2 Updated Mar 18, 2026
  • TeamHOI Public

    [CVPR 2026] TeamHOI: Learning a Unified Policy for Cooperative Human-Object Interactions with Any Team Size

    sail-sg/TeamHOI’s past year of commit activity
    Python 27 MIT 0 0 0 Updated Mar 12, 2026
  • odc Public

    On demand communication

    sail-sg/odc’s past year of commit activity
    Python 32 2 1 3 Updated Mar 3, 2026
  • Stable-RL Public

    Rethinking the Trust Region in LLM Reinforcement Learning

    sail-sg/Stable-RL’s past year of commit activity
    Python 49 Apache-2.0 5 0 5 Updated Mar 2, 2026
  • oat Public

    🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

    sail-sg/oat’s past year of commit activity
    Python 639 Apache-2.0 61 6 1 Updated Jan 29, 2026
  • sail-sg/LifelongSafetyAlignment’s past year of commit activity
    Python 11 0 1 0 Updated Jan 13, 2026
  • feedback-conditional-policy Public

    Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"

    sail-sg/feedback-conditional-policy’s past year of commit activity
    Python 60 2 0 0 Updated Jan 5, 2026
  • InfNeRF Public

    InfNeRF: Towards Infinite Scale NeRF Rendering with O(log n) Space Complexity

    sail-sg/InfNeRF’s past year of commit activity
    Python 12 Apache-2.0 1 1 0 Updated Jan 3, 2026
  • SkyLadder Public Forked from jzhang38/TinyLlama

    The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling

    sail-sg/SkyLadder’s past year of commit activity
    Python 42 Apache-2.0 610 1 0 Updated Dec 29, 2025
  • d4ft Public

    A JAX library for Density Functional Theory.

    sail-sg/d4ft’s past year of commit activity
    Python 55 Apache-2.0 5 16 0 Updated Nov 25, 2025