A 17-notebook journey through Reinforcement Learning — from Bellman to AlphaZero to RLHF.
Learn by doing: each notebook is a workbook with guided # TODO scaffolds you fill in yourself. Solutions are included so you can check your work.
notebooks/— Complete notebooks with solutions filled inexercises/— Same notebooks with# TODOblocks for you to completeutils/— Shared plotting and environment helpers
# Clone the repo
git clone https://github.com/YOUR_USERNAME/rl-notebooks.git
cd rl-notebooks
# Create a virtual environment
python -m venv .venv
source .venv/bin/activate
# Install dependencies
uv pip install -r requirements.txt
# Launch Jupyter
jupyter notebookStart with exercises/01_agents_envs_mdps.ipynb if you want to learn by doing, or notebooks/01_agents_envs_mdps.ipynb if you want to read the completed version.
- Python 3.10+
- Basic comfort with NumPy
- High-school math (we introduce all the RL math gently)
- Curiosity 🙂
Every notebook works in Colab — click the badges in the table above to open any notebook or exercise directly.
MIT