BabyLM

Neural language models semestral work (BabyLM challenge 2026)

Commands

# 1. Tokenize: train BPE then pack the corpus into a uint16 bin
uv run python main.py train-tokenizer
uv run python main.py tokenize-corpus

# 2. Train: pretrain GPT-BERT; each save is a self-contained HF dir
uv run python main.py pretrain --config configs/small.json --wandb

# 3. Eval: run the strict-small zero-shot suite on a checkpoint dir
# One-time: unzip EWoK fast + download/filter full EWoK.
# Requires HF_TOKEN in .env and accepting terms at
# https://huggingface.co/datasets/ewok-core/ewok-core-1.0
./scripts/setup_eval_data.sh
./scripts/eval.sh checkpoints/final fast causal

# 4. Chat: interactive text completion from a checkpoint
uv run python chat.py <run_name>
# e.g. uv run python chat.py bs64_s100000_wu500_lr0.0003_mlr0.1_wd0.1_gc1.0_ga4_mp0.15_hn15_hd16

MetaCentrum cluster

First-time setup (local)

# Make scripts executable
chmod +x scripts/*.sh

# Set credentials
cp .env.example .env
# edit .env: set METACENTRUM_USER and WANDB_API_KEY
chmod 600 .env

# Upload project (includes .env)
./scripts/upload_to_cluster.sh

First-time setup (on the cluster, do once)

ssh your_username@tarkil.grid.cesnet.cz

# Install uv
curl -LsSf https://astral.sh/uv/install.sh | sh

echo ". \"/storage/praha1/home/$USER/.local/bin/env\"" >> ~/.profile
echo "export UV_CACHE_DIR=/storage/brno2/home/$USER/.uv_cache/" >> ~/.profile
source ~/.profile

# Verify
uv --version

Running jobs

# Submit (on the cluster)
qsub BabyLM/scripts/submit_babylm.sh

# Monitor (on the cluster)
qstat -u $USER

# Download checkpoints (locally, after job finishes)
./scripts/download_results.sh

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
BabyLM		BabyLM
eval @ 467793f		eval @ 467793f
scripts		scripts
.env.example		.env.example
.gitignore		.gitignore
.gitmodules		.gitmodules
.python-version		.python-version
README.md		README.md
chat.py		chat.py
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BabyLM

Commands

MetaCentrum cluster

First-time setup (local)

First-time setup (on the cluster, do once)

Running jobs

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

BabyLM

Commands

MetaCentrum cluster

First-time setup (local)

First-time setup (on the cluster, do once)

Running jobs

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages