sutton-barto

Here are 5 public repositories matching this topic...

0bserver07 / Study-Reinforcement-Learning

RL study guide — foundations through RLHF, DPO, GRPO, RLVR, agentic RL, and offline RL. Hand-written CS294 notes, 19 lecture drafts, 5 tested exercises, citations that resolve.

machine-learning reinforcement-learning deep-learning q-learning policy-gradient study-notes lecture-notes ppo dpo rlhf constitutional-ai deepseek-r1 grpo llm-alignment rlvr sutton-barto agentic-rl

Updated May 15, 2026
Python

MouseTrap-codes / n-armed-bandits

Star

n-armed bandit algorithms comparison + simulation app

python flask machine-learning reinforcement-learning plotly epsilon-greedy ucb multi-armed-bandit sutton-barto

Updated Jan 4, 2026
Python

GeoffreyWang1117 / SuttonRL-Implementation

Star

Interactive RL learning platform: 13 chapters from Sutton & Barto, 18K+ lines, fill-in-the-blank exercises with bilingual explanations. Bandits → DP → MC → TD → Policy Gradient → DQN → PPO → SAC → MARL → RLHF

python education reinforcement-learning interactive-learning sutton-barto

Updated Nov 21, 2025
Python

danielcregg / reinforcement-learning-an-introduction

Star

Fork of ShangtongZhang/reinforcement-learning-an-introduction - Python implementations of algorithms from Sutton and Barto's RL textbook (2nd Edition)

python machine-learning reinforcement-learning artificial-intelligence sutton-barto

Updated Feb 20, 2026
Python

SaiSampathKedari / Reinforcement-Learning

Star

Reinforcement learning algorithms with mathematical derivations and Sutton & Barto figure reproductions.

reinforcement-learning reinforcement-learning-algorithms sutton-barto-book sutton-barto

Updated May 23, 2026
Jupyter Notebook

Improve this page

Add a description, image, and links to the sutton-barto topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the sutton-barto topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sutton-barto

Here are 5 public repositories matching this topic...

0bserver07 / Study-Reinforcement-Learning

MouseTrap-codes / n-armed-bandits

GeoffreyWang1117 / SuttonRL-Implementation

danielcregg / reinforcement-learning-an-introduction

SaiSampathKedari / Reinforcement-Learning

Improve this page

Add this topic to your repo