Mandala RL

Self-play reinforcement learning system for training a strong Mandala bot using MCTS + neural networks (AlphaZero-style).

Overview

This project trains a Mandala bot through 100% self-play using:

Monte Carlo Tree Search (MCTS) for game tree exploration
Policy/Value neural network for position evaluation
Self-play data generation
Iterative training and evaluation
Deterministic Elo-based evaluation ladder

Optimized for minimal GPU compute on Apple Silicon (MPS backend).

Quick Start

# Install dependencies
pip install -r requirements.txt

# Train the bot
python scripts/train.py --config configs/default.yaml

# Evaluate against previous versions
python scripts/evaluate.py --checkpoint data/checkpoints/model_latest.pt

Project Structure

mandala_rl/
├── game/          # Mandala game engine and rules
├── mcts/          # Monte Carlo Tree Search implementation
├── network/       # Policy/Value neural network
├── selfplay/      # Self-play game generation
├── training/      # Training loop and replay buffer
└── evaluation/    # Elo rating and arena evaluation

Requirements

Python 3.10+
PyTorch 2.0+ with MPS support
Apple Silicon Mac (M1/M2/M3)

Name		Name	Last commit message	Last commit date
Latest commit History 4,657 Commits
.context/plans		.context/plans
configs		configs
cpp		cpp
data		data
docs		docs
dominion		dominion
lost_cities		lost_cities
mandala_rl		mandala_rl
scripts		scripts
templates		templates
tests		tests
workflows		workflows
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
DEVLOG.md		DEVLOG.md
Dockerfile		Dockerfile
GAME_RULES.md		GAME_RULES.md
HYBRID_TRAINING.md		HYBRID_TRAINING.md
Procfile		Procfile
README.md		README.md
REPLAY_VIEWER.md		REPLAY_VIEWER.md
TODOS.md		TODOS.md
TRAINING_RESUME.md		TRAINING_RESUME.md
pyproject.toml		pyproject.toml
railway.json		railway.json
requirements.txt		requirements.txt
runtime.txt		runtime.txt
serve.py		serve.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mandala RL

Overview

Quick Start

Project Structure

Requirements

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Mandala RL

Overview

Quick Start

Project Structure

Requirements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages