GitHub - eabdullin/lemath

LeMath (LeJEPA-style finetune for math reasoning)

This repo contains a single training script train.py that:

Loads a base model (e.g. Qwen3)
Streams AI-MO/NuminaMath-CoT
Injects <|jeton|> tokens before each solution paragraph (optionally merging short paragraphs into larger blocks)
Trains with NLL + LeJEPA-style latent losses (SIGReg if lejepa is installed)
Uses Adafactor with micro-batch size 1 (and optional gradient accumulation)

Install deps (once):

/venv/main/bin/pip install -U pip
/venv/main/bin/pip install -r requirements.txt

Run:

/venv/main/bin/python train.py \
  --model_name_or_path "unsloth/Qwen3-30B-A3B" \
  --output_dir "./out" \
  --max_steps 1000 \
  --grad_accum 1

/venv/main/bin/python -m pytest

--load_in_4bit is supported for loading, but full fine-tuning quantized weights generally won’t work without adapters. This script is written for full fine-tuning.
If lejepa can’t be imported, train.py uses a lightweight isotropy regularizer as a fallback.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.cursor/rules		.cursor/rules
eval		eval
paper		paper
tests		tests
.gitignore		.gitignore
README.md		README.md
aime25_eval_outputs.jsonl		aime25_eval_outputs.jsonl
data_utils.py		data_utils.py
install.sh		install.sh
loss_utils.py		loss_utils.py
pytest.ini		pytest.ini
save_merged.py		save_merged.py
train.py		train.py
wandb_download_checkpoint.py		wandb_download_checkpoint.py
wandb_upload_checkpoint.py		wandb_upload_checkpoint.py