🤖 AI Masters 2025: Reinforcement Learning Course

Welcome to the official repository for the AI Masters 2025 Reinforcement Learning Course! This repo contains all lecture slides, seminar materials, and homework assignments for the course. Whether you're a student, a fellow instructor, or an RL enthusiast, you'll find everything you need to dive deep into the world of Reinforcement Learning.

📚 About the Course

This course covers the fundamentals and advanced topics of Reinforcement Learning (RL), including Markov Decision Processes, Dynamic Programming, Temporal Difference Learning, Deep RL, Policy Gradients, Multi-Agent Systems, Offline RL, and practical applications. The course is designed for graduate students and professionals eager to master RL both in theory and practice.

🎓 Lectures

#	Date	Topic	PDF Link
1	09/02/2026	Introduction, Markov Decision Process	PDF
2	16/02/2026	Dynamic Programming	PDF
3	02/03/2026	Temporal Differences	PDF
4	16/03/2026	Approximation, Deep Q-Network, Rainbow Extensions	PDF
5	23/03/2026	Policy Gradient Optimization, Actor-Critic (PG, REINFORCE, A2C)	PDF
6	30/03/2026	Advanced Actor-Critic Methods (TRPO, PPO, GAE)	PDF
7	06/04/2026	Continuous Action Space (DDPG, TD3, SAC)	PDF
8	13/04/2026	Multi-Agent Reinforcement Learning (IDQN, IPPO, QMIX, MAPPO)	PDF
9	27/04/2026	Integrating Reinforcement Learning and Planning	PDF
10	20/04/2026	Learning from Demonstrations. Transformers in RL	PDF-2025
11	04/05/2026	Linear Dynamical Systems	PDF-2025
12	18/05/2026	Practical Applications of RL, GRPO	PDF-2025

🧑‍💻 Seminars

#	Topic	Folder Link
1	Cross Entropy	seminar_01_cross_entropy
2	Value Iteration	seminar_02_value_iteration
3	Q-Learning, SARSA, TD(λ)	seminar_03_q_learning_sarsa_td_lambda
4	Deep Q-Networks	seminar_04_dqn
5	REINFORCE, Actor-Critic	seminar_05_reiforce_actor_critic
6	PPO	seminar_06_ppo
7	DDPG, TD3, SAC	seminar_07_ddpg_td3_sac
8	QMIX	TBA
9	Decision Transformer	TBA
10	MCTS	TBA
11	LQR	TBA
12	GRPO, SFT	TBA

📝 Homeworks

#	Topic	Folder Link
1	homework_1	homework_1
2	homework_2	homework_2
3	homework_3	homework_3
4	homework_4	homework_4
5	homework_5_ppo	homework_5_ppo
6	homework_5_sac	homework_5_sac
7	homework_6_dt_mem	homework_6_dt_mem

🚀 How to Use

Browse the Lectures section for slides and theory.
Explore the Seminars for hands-on coding and problem-solving sessions.
Complete the Homeworks to reinforce your understanding and gain practical RL experience.

Feel free to fork, clone, or star this repository. Contributions and feedback are always welcome!

👨‍🏫 Authors

Happy Reinforcement Learning!

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
homeworks		homeworks
lectures		lectures
seminar_01_cross_entropy		seminar_01_cross_entropy
seminar_02_value_iteration		seminar_02_value_iteration
seminar_03_q_learning_sarsa_td_lambda		seminar_03_q_learning_sarsa_td_lambda
seminar_04_dqn		seminar_04_dqn
seminar_05_reiforce_actor_critic		seminar_05_reiforce_actor_critic
seminar_06_ppo		seminar_06_ppo
seminar_07_ddpg_td3_sac		seminar_07_ddpg_td3_sac
seminar_08_qmix		seminar_08_qmix
seminar_10_mcts		seminar_10_mcts
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🤖 AI Masters 2025: Reinforcement Learning Course

📚 About the Course

🗂️ Table of Contents

🎓 Lectures

🧑‍💻 Seminars

📝 Homeworks

🚀 How to Use

👨‍🏫 Authors

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🤖 AI Masters 2025: Reinforcement Learning Course

📚 About the Course

🗂️ Table of Contents

🎓 Lectures

🧑‍💻 Seminars

📝 Homeworks

🚀 How to Use

👨‍🏫 Authors

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages