Skip to content

Conversation

@MaxGalindo150
Copy link
Owner

Objective:
This PR introduces the foundational components for the MARL module and implements the first multi-agent algorithm, Independent PPO (IPPO).

Key Changes:

  • Added the MultiAgentRolloutBuffer for storing experiences across multiple agents.
  • Implemented MultiAgentPolicy for decentralized policy management.
  • Adapted the environment wrapper to support multi-agent scenarios.
  • Developed the IPPO algorithm with unit tests and basic environment validation.

To-Do:

  • Implement MultiAgentRolloutBuffer.
  • Develop MultiAgentPolicy.
  • Create a MultiAgentEnvWrapper.
  • Implement IPPO algorithm.
  • Validate IPPO in cooperative and competitive scenarios.
  • Add basic documentation for MARL module.

Next Steps:
Once IPPO is fully integrated, subsequent PRs will focus on:

  1. Implementing MAPPO with centralized training and decentralized execution.
  2. Adding support for other MARL algorithms such as MADDPG and QMIX.

@MaxGalindo150 MaxGalindo150 marked this pull request as draft December 5, 2024 06:40
@MaxGalindo150 MaxGalindo150 added the enhancement New feature or request label Dec 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant