All

28 repositories

ART
Public
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.5, …
agent reinforcement-learning rl
agent reinforcement-learning rl lora llms qwen agentic-ai grpo qwen3
Python
•
Apache License 2.0
•799•9.2k•64•53•Updated Apr 18, 2026Apr 18, 2026
art-notebooks
Public
Notebooks to demonstrate ART (Agent Reinforcement Trainer) in practice!
Shell
•
Apache License 2.0
•6•6•2•0•Updated Feb 19, 2026Feb 19, 2026
Summary-RL
Public
Train an agent to generate high quality summaries
Jupyter Notebook
•10•41•0•1•Updated Jan 29, 2026Jan 29, 2026
open_deep_research_training
Public
Training setup for Langchain's Open Deep Research
Python
•
MIT License
•17•76•1•0•Updated Aug 28, 2025Aug 28, 2025
verl
Public
verl: Volcano Engine Reinforcement Learning for LLMs
Python
•
Apache License 2.0
•3.7k•1•0•0•Updated Jul 29, 2025Jul 29, 2025
art-langgraph
Public
Python
•
MIT License
•0•5•0•0•Updated Jul 18, 2025Jul 18, 2025
art-star-count
Public
Display ART repository star count on a tablet
HTML
•0•1•0•0•Updated Jul 14, 2025Jul 14, 2025
vllm-completions
Public
A high-throughput and memory-efficient inference and serving engine for LLMs
Python
•
Apache License 2.0
•16k•0•0•0•Updated Jun 27, 2025Jun 27, 2025
ArcticInference
Public
Python
•
Apache License 2.0
•58•0•0•0•Updated Jun 18, 2025Jun 18, 2025
skypilot-catalog
Public
58•0•0•0•Updated May 26, 2025May 26, 2025
openapi-typescript-codegen
Public
NodeJS library that generates Typescript or Javascript clients based on the OpenAPI specification
TypeScript
•
MIT License
•542•0•0•0•Updated May 14, 2025May 14, 2025
email-deep-research
Public
Python
•
Apache License 2.0
•3•20•0•0•Updated Apr 24, 2025Apr 24, 2025
S3LoRAResolver
Public
Python
•
Apache License 2.0
•0•0•0•0•Updated Apr 24, 2025Apr 24, 2025
pii-redaction
Public
Detect and redact PII locally with SOTA performance
local-models pii-redaction llms
local-models pii-redaction llms
Python
•
MIT License
•18•99•1•0•Updated Mar 25, 2025Mar 25, 2025
best-hn
Public
Jupyter Notebook
•1•10•0•0•Updated Mar 25, 2025Mar 25, 2025
rl-experiments
Public
OpenPipe Reinforcement Learning Experiments
Jupyter Notebook
•
MIT License
•5•32•0•0•Updated Mar 14, 2025Mar 14, 2025
deductive-reasoning
Public
Train your own SOTA deductive reasoning model
Python
•
MIT License
•8•110•1•0•Updated Mar 6, 2025Mar 6, 2025
vllm-lora
Public
A high-throughput and memory-efficient inference and serving engine for LLMs
Python
•
Apache License 2.0
•16k•1•0•0•Updated Nov 20, 2024Nov 20, 2024
sglang
Public
SGLang is a fast serving framework for large language models and vision language models.
Python
•
Apache License 2.0
•5.5k•0•0•0•Updated Nov 20, 2024Nov 20, 2024
trl
Public
Train transformer language models with reinforcement learning.
Python
•
Apache License 2.0
•2.7k•0•0•0•Updated Oct 14, 2024Oct 14, 2024
vllm
Public
A high-throughput and memory-efficient inference and serving engine for LLMs
Python
•
Apache License 2.0
•16k•0•0•0•Updated Jun 24, 2024Jun 24, 2024
alpaca_eval
Public
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Jupyter Notebook
•
Apache License 2.0
•307•0•0•0•Updated Jun 20, 2024Jun 20, 2024
mistral-client-js
Public
JS Client library for Mistral AI platform
JavaScript
•
Apache License 2.0
•46•0•0•0•Updated Jun 5, 2024Jun 5, 2024
OpenPipe
Public
Turn expensive prompts into cheap fine-tuned models
ai llm prompt-engineering
ai llm prompt-engineering llmops
TypeScript
•
Apache License 2.0
•170•2.8k•5•2•Updated May 25, 2024May 25, 2024
step-one
Public
This repo is only used for searching reddit
Python
•3•3•0•0•Updated Apr 26, 2024Apr 26, 2024
trpc-openapi
Public
OpenAPI support for tRPC 🧩 - with streaming :)
TypeScript
•
MIT License
•198•2•0•0•Updated Feb 23, 2024Feb 23, 2024
axolotl
Public
Go ahead and axolotl questions
Python
•
Apache License 2.0
•1.3k•0•0•0•Updated Feb 8, 2024Feb 8, 2024
tsoa
Public
Build OpenAPI-compliant REST APIs using TypeScript and Node
TypeScript
•
MIT License
•530•1•0•0•Updated Dec 18, 2023Dec 18, 2023

ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenPipe

All

All

28 repositories

ART

art-notebooks

Summary-RL

open_deep_research_training

verl

art-langgraph

art-star-count

vllm-completions

ArcticInference

skypilot-catalog

openapi-typescript-codegen

email-deep-research

S3LoRAResolver

pii-redaction

best-hn

rl-experiments

deductive-reasoning

vllm-lora

sglang

trl

vllm

alpaca_eval

mistral-client-js

OpenPipe

step-one

trpc-openapi

axolotl

tsoa

All

All

Repositories list

28 repositories