Skip to content
Change the repository type filter

All

    Repositories list

    • ART

      Public
      Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
      Python
      6288k4631Updated Dec 5, 2025Dec 5, 2025
    • Notebooks to demonstrate ART (Agent Reinforcement Trainer) in practice!
      Shell
      4620Updated Nov 22, 2025Nov 22, 2025
    • Training setup for Langchain's Open Deep Research
      Python
      177210Updated Aug 28, 2025Aug 28, 2025
    • verl

      Public
      verl: Volcano Engine Reinforcement Learning for LLMs
      Python
      2.8k000Updated Jul 29, 2025Jul 29, 2025
    • Python
      0500Updated Jul 18, 2025Jul 18, 2025
    • Display ART repository star count on a tablet
      HTML
      0100Updated Jul 14, 2025Jul 14, 2025
    • Train an agent to generate high quality summaries
      Jupyter Notebook
      103901Updated Jul 1, 2025Jul 1, 2025
    • A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      12k000Updated Jun 27, 2025Jun 27, 2025
    • Python
      40000Updated Jun 18, 2025Jun 18, 2025
    • skypilot-catalog

      Public
      53000Updated May 26, 2025May 26, 2025
    • NodeJS library that generates Typescript or Javascript clients based on the OpenAPI specification
      TypeScript
      550000Updated May 14, 2025May 14, 2025
    • Python
      22000Updated Apr 24, 2025Apr 24, 2025
    • Python
      0000Updated Apr 24, 2025Apr 24, 2025
    • Detect and redact PII locally with SOTA performance
      Python
      158710Updated Mar 25, 2025Mar 25, 2025
    • best-hn

      Public
      Jupyter Notebook
      11000Updated Mar 25, 2025Mar 25, 2025
    • OpenPipe Reinforcement Learning Experiments
      Jupyter Notebook
      53200Updated Mar 14, 2025Mar 14, 2025
    • Train your own SOTA deductive reasoning model
      Python
      810710Updated Mar 6, 2025Mar 6, 2025
    • vllm-lora

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      12k100Updated Nov 20, 2024Nov 20, 2024
    • sglang

      Public
      SGLang is a fast serving framework for large language models and vision language models.
      Python
      3.6k000Updated Nov 20, 2024Nov 20, 2024
    • trl

      Public
      Train transformer language models with reinforcement learning.
      Python
      2.3k000Updated Oct 14, 2024Oct 14, 2024
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      12k000Updated Jun 24, 2024Jun 24, 2024
    • alpaca_eval

      Public
      An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
      Jupyter Notebook
      292000Updated Jun 20, 2024Jun 20, 2024
    • JS Client library for Mistral AI platform
      JavaScript
      47000Updated Jun 5, 2024Jun 5, 2024
    • OpenPipe

      Public
      Turn expensive prompts into cheap fine-tuned models
      TypeScript
      1632.8k51Updated May 25, 2024May 25, 2024
    • step-one

      Public
      This repo is only used for searching reddit
      Python
      3300Updated Apr 26, 2024Apr 26, 2024
    • OpenAPI support for tRPC 🧩 - with streaming :)
      TypeScript
      196200Updated Feb 23, 2024Feb 23, 2024
    • axolotl

      Public
      Go ahead and axolotl questions
      Python
      1.2k000Updated Feb 8, 2024Feb 8, 2024
    • tsoa

      Public
      Build OpenAPI-compliant REST APIs using TypeScript and Node
      TypeScript
      529100Updated Dec 18, 2023Dec 18, 2023