Expert Specialization in Mixture of Expert Models (Master Thesis)

This repo contains the code for my master thesis which analyzed expert specialization in the OLMoE-1B-7B-0924 Mixture-of-Experts large language model.

Important Notes

To run any scripts you need to have uv installed. If you dont want to install it you can also run pip install -r requirements.txt and run scripts in the normal way.
Dependencies are pinned to the versions I used, but newer and older versions might also work.
All scripts will produce plots as PDFs as found in the thesis and also might save some result data as CSV or numpy files.
All scripts are not optimized for memory usage so they can consume a lot of RAM. Consider running a script for only a single layer at a time. You can do this by passing --layers or -l to almost all scripts.
Check out all options that can be passed to scripts by running uv run insert_script_name.py --help

Routing Decisiveness

In this section I investigated how confident the router is in its decision to route tokens to experts.

Router Entropy

To collect router values and calculate entropy values across layers run for example:

uv run router_decisive.py --num_tokens 100000 --layers 1 7 15

Routing Distributions

To collect router weights (router logits), router softmax probabilities and renormalized softmax probabilities run for example:

uv run router_stats.py --num_tokens 100000 --layers 1

Expert Transformations and Activation Spaces

In this section I looked at the MoE expert outputs in activation space and their weight matrices and compared similarities.

Expert Transformations

To compare expert output similarities in activation space via cosine or CKA run for example:

uv run sim.py --num_tokens 10000 --layers 1  --method CKA

To create 2D UMAP visualizations for activations from expert outputs run for example:

uv run dim_reduc.py --num_tokens 10000 --layers 1 7 15

Additional Router Measures

To create some advanced MoE router metrics (Expert Selection Frequency, Domain Specialization, Expert Token Overlap) run for example:

uv run router_metrics.py --num_tokens 10000 --layers 1 7 15  --num_experts 64 --topk 8 --num_domains 22

MoE Experts Case Study

In this section I selected individual experts, created a profile for them and then causally investigated their effect on model outputs by modifiying the routing value assigned to an expert.

To create a expert profile for a given expert as a JSON file run:

uv run profiles.py --expert_id 9 --layer 1

To steer the model using the MoE router by boosting or ablating the routing weight (softmax value) for a specific expert run for example:

uv run interv.py --prompt "Data collected by Barack Obama" --target_idx 4 --expert_id 9 --layer 1

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
configs.py		configs.py
dim_reduc.py		dim_reduc.py
hooks.py		hooks.py
interv.py		interv.py
master_thesis.pdf		master_thesis.pdf
profiles.py		profiles.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
router_decisive.py		router_decisive.py
router_metrics.py		router_metrics.py
router_stats.py		router_stats.py
sim.py		sim.py
utils.py		utils.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Expert Specialization in Mixture of Expert Models (Master Thesis)

Important Notes

Routing Decisiveness

Router Entropy

Routing Distributions

Expert Transformations and Activation Spaces

Expert Transformations

Additional Router Measures

MoE Experts Case Study

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Expert Specialization in Mixture of Expert Models (Master Thesis)

Important Notes

Routing Decisiveness

Router Entropy

Routing Distributions

Expert Transformations and Activation Spaces

Expert Transformations

Additional Router Measures

MoE Experts Case Study

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages