🐦 Flappy Bird — Tabular Q-Learning View Project README flappy.mp4 🚕 Taxi Driver — Deep Q-Network (DQN) View Project README taxi-driver.mp4 🎯 CartPole — Proximal Policy Optimization (PPO) View Project README cartpole.mp4 🤖 Franka Panda Manipulation — PPO with Continuous Control View Project README panda-initial.mp4