rlvr
Here are 50 public repositories matching this topic...
Awesome List for Agentic RL
-
Updated
Apr 18, 2026 - HTML
[EMNLP'25] s3 - ⚡ Efficient & Effective Search Agent Training via RL for RAG (RLVR for Search with Minimal Data)
-
Updated
Nov 5, 2025 - Python
Official repository for "RLVR-World: Training World Models with Reinforcement Learning" (NeurIPS 2025), https://arxiv.org/abs/2505.13934
-
Updated
Oct 28, 2025 - Python
[ICLR 2026] An official implementation of "CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning"
-
Updated
Apr 17, 2026 - Python
🌱 A little course on Reinforcement Learning Environments for evaluating and training Language Models
-
Updated
Apr 17, 2026 - Python
A curated list of awesome resources about reward construction for AI agents. This repository covers cutting-edge research, and practical guides on defining and collecting rewards to build more intelligent and aligned AI agents.
-
Updated
Sep 1, 2025
Procedural data generators suite for synthetic pretraining and formal reasoning
-
Updated
Apr 17, 2026 - Python
🐝 SwarmBench: Benchmarking LLMs' Swarm Intelligence
-
Updated
May 21, 2025 - Python
This is the official code of DeepSearch [ICLR 2026]
-
Updated
Oct 22, 2025 - Python
The official repository of the paper "Do Reasoning Models Enhance Embedding Models?"
-
Updated
Apr 17, 2026 - Python
CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR
-
Updated
Apr 7, 2026 - Python
grpo to train long form QA and instructions with long-form reward model
-
Updated
Jul 17, 2025 - Python
MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMs
-
Updated
Jul 6, 2025 - Python
PersonaMem-v2: Towards Personalized Intelligence via Learning Implicit User Personas and Agentic Memory
-
Updated
Apr 1, 2026 - Python
Improve this page
Add a description, image, and links to the rlvr topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rlvr topic, visit your repo's landing page and select "manage topics."