Multi-Agent Reinforcement Learning (MARL) - Independent PPO (IPPO) #6

MaxGalindo150 · 2024-12-05T06:35:11Z

Objective:
This PR introduces the foundational components for the MARL module and implements the first multi-agent algorithm, Independent PPO (IPPO).

Key Changes:

Added the MultiAgentRolloutBuffer for storing experiences across multiple agents.
Implemented MultiAgentPolicy for decentralized policy management.
Adapted the environment wrapper to support multi-agent scenarios.
Developed the IPPO algorithm with unit tests and basic environment validation.

To-Do:

Implement MultiAgentRolloutBuffer.
Develop MultiAgentPolicy.
Create a MultiAgentEnvWrapper.
Implement IPPO algorithm.
Validate IPPO in cooperative and competitive scenarios.
Add basic documentation for MARL module.

Next Steps:
Once IPPO is fully integrated, subsequent PRs will focus on:

Implementing MAPPO with centralized training and decentralized execution.
Adding support for other MARL algorithms such as MADDPG and QMIX.

MaxGalindo150 added 2 commits December 5, 2024 00:02

cleaning library

d6e4d66

PR: Multi-Agent Reinforcement Learning (MARL) - Independent PPO (IPPO)

1ffce40

MaxGalindo150 marked this pull request as draft December 5, 2024 06:40

MaxGalindo150 added the enhancement New feature or request label Dec 5, 2024

MaxGalindo150 added 4 commits December 5, 2024 23:30

We need to get the attributes of the unwrapped environment.

705eaca

make_marl_vec_env

ad932ea

marl_monitor

7af7e3f

take it easy

f0efba7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-Agent Reinforcement Learning (MARL) - Independent PPO (IPPO) #6

Multi-Agent Reinforcement Learning (MARL) - Independent PPO (IPPO) #6

Uh oh!

MaxGalindo150 commented Dec 5, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Multi-Agent Reinforcement Learning (MARL) - Independent PPO (IPPO) #6

Are you sure you want to change the base?

Multi-Agent Reinforcement Learning (MARL) - Independent PPO (IPPO) #6

Uh oh!

Conversation

MaxGalindo150 commented Dec 5, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant