rl-training

An easy python package to run quick basic QA evaluations. This package includes standardized QA evaluation metrics and semantic evaluation metrics: Black-box and Open-Source large language model prompting and evaluation, exact match, F1 Score, PEDANT semantic match, transformer match. Our package also supports prompting OPENAI and Anthropic API.

qa-automation-test rl-training llm exact-matching llm-evaluation llm-evaluation-toolkit llm-evaluation-framework reward-modeling

Updated Jul 18, 2025
Python

sb-ai-lab / Sim4Rec

Star

Simulator for training and evaluation of Recommender Systems

recommender-system recommendation user-modeling evaluation-framework synthetic-data rl-training

Updated Mar 24, 2025
Jupyter Notebook

zli12321 / free-form-grpo

Star

grpo to train long form QA and instructions with long-form reward model

reinforcement-learning-algorithms evaluation-framework reward-design rl-training long-form-text-generation qwen2-5 grpo rlvr

Updated Jul 17, 2025
Python

collinear-ai / tau-trait

Star

TraitBasis applied to TauBench

rl-envs rl-training agent-benchmark

Updated Nov 11, 2025
Python

SalesforceAIResearch / MAS-Orchestra

Star

MAS-Orchestra: train, inspect, and vibe-code multi-agent systems with RL-learned orchestration and MASBench

orchestration multi-agent-systems rl-training

Updated Apr 18, 2026
Python

Amirhosein-gh98 / Guided-by-Gut

Star

The official PyTorch implementation for the Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence

efficient tree-search gg prm self-consistency confidence dvts rl-training llm inference-time-compute grpo test-time-scaling guided-by-gut

Updated Jun 9, 2025
Python

sotheara-leang / txt-summarization

Star

Deep Reinforced Model for Abstractive Summarization

pytorch text-summarization abstractive-summarization rl-training mle-training temporal-attention share-decoder-weight

Updated Nov 22, 2022
Python

jeffasante / RepoGym

Star

A reinforcement learning environment for AI coding agents built from real Git history. RepoGym automatically extracts bug-fix tasks from open-source repositories, runs agent patches in Docker sandboxes, and returns reward signals based on test suite delta.

rust benchmark reinforcement-learning dynamic-analysis ai-agents rl-training llm-evaluation

Updated Mar 14, 2026
Rust

Improve this page

Add a description, image, and links to the rl-training topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the rl-training topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rl-training

Here are 12 public repositories matching this topic...

inclusionAI / AWorld

NVIDIA-NeMo / Gym

ZJU-REAL / ClawGUI

rohithreddy024 / Text-Summarizer-Pytorch

zli12321 / qa_metrics

sb-ai-lab / Sim4Rec

zli12321 / free-form-grpo

collinear-ai / tau-trait

SalesforceAIResearch / MAS-Orchestra

Amirhosein-gh98 / Guided-by-Gut

sotheara-leang / txt-summarization

jeffasante / RepoGym

Improve this page

Add this topic to your repo