Project Euler — Cross-Language Benchmarks

Per-invocation cost of solving Project Euler problems in 10 programming languages.

Solutions written by Claude (Opus + Sonnet) across C, C++, Rust, Go, Zig, Java, C#, JavaScript, Python, and ARM64 Assembly. Benchmarked on Apple Silicon. See JOURNEY.md for the full story — including the reset from 200+ problems back to a verified 10×10 core and the disciplined expansion since.

Current scope: 100 problems × 10 languages = 1000 measurements

Every (language, problem) in scope has passed an audit for state-leak safety, answer correctness, and methodology fit. We extend carefully — each new problem gets the same audit before it appears in these numbers.

See RESULTS.md for the rankings, the per-problem detail grid, and the full methodology.

What we measure

One thing: how long does it take to run, from a fresh OS process.

For each (language, problem):

Build the binary.
Run it 10 times, each in a fresh fork + exec invocation.
Compare the answer against the canonical (each source file's // Answer: header comment). Abort on mismatch.
Report the median wall time across the 10 runs.

This matches what a real CLI user, cron job, or shell-loop invocation pays. It doesn't reward language-internal caches (Rust OnceLock, primesieve internal state, @lru_cache) that disappear at process exit anyway — the OS clears them between invocations, so each language is honestly measured at its actual per-invocation cost.

What we don't measure (and why)

In-process warm iterations. A "1000 iterations in a tight loop" metric is meaningful for server / daemon scenarios, but those are a different question with different right answers. See JOURNEY.md — particularly the "From In-Process Warm to Process-Per-Iteration" chapter — for the full reasoning behind retiring that metric.
Compile time as a headline number. Build cost is real but in our model the binary is built once and invoked many times. Recorded as diagnostic data, not part of the ranking.

Reproducibility

cd ProjectEuler.Benchmarks
cmd/euler-bench/euler-bench per-iter --lang all --problems 1-100 --iters 10 --write
python3 report.py

The Go tool (cmd/euler-bench/) is the single source of truth for measurement — one binary builds, runs, validates answers, and writes sanitized data atomically. No flock, no hook chain, no per-language scripts. See JOURNEY.md for the data-architecture refactor story.

Trust + safety

This repo is public; the lang repos are private. Per the project's PE compliance rules, data/<lang>.json files NEVER contain an answer field, for any problem. Full data with answers lives in data/private/ (gitignored, local-only) for verification + debugging. Triple-layer defense:

The euler-bench per-iter --write writer has no code path that emits answers to public files.
A post-write readback assertion fails loudly if any answer key reaches a public file.
A pre-commit hook (scripts/sanitization_gate.py) runs the same check independently.

Repo layout

Path	What
`RESULTS.md`	The numbers — rankings, per-problem grid, methodology
`JOURNEY.md`	The story — how we got here, what we learned, what we tried that didn't work
`cmd/euler-bench/`	The Go benchmark + write tool (`run`, `failures`, `status`, `collect`, `per-iter`)
`report.py`	Markdown + chart generator (reads `data/`, writes `RESULTS.md` + `charts/`)
`data/<lang>.json`	Public bench data (sanitized)
`data/private/`	Full bench data (gitignored)
`scripts/sanitization_gate.py`	Pre-commit hook: enforce no-answer-in-public-data
`archive/legacy/`	Pre-2026-05-23 site (three-mode-report era + per-tier coverage) — historical reference only

License + contact

Project Euler problems and answers belong to Project Euler. Per their publishing policy, solution discussion above problem 100 is restricted. This repo strictly observes that boundary — machine-readable answer values appear in no public file regardless of problem number. Discussion in MDs (story, methodology, scope explanations) follows the ≤100 rule.

Solutions were generated and audited by Claude. Methodology discussion + the open question of what we should add next live in the GitHub issues of the public repo.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Euler — Cross-Language Benchmarks

Current scope: 100 problems × 10 languages = 1000 measurements

What we measure

What we don't measure (and why)

Reproducibility

Trust + safety

Repo layout

License + contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 491 Commits
archive/legacy		archive/legacy
charts		charts
cmd/euler-bench		cmd/euler-bench
data		data
porting_guides		porting_guides
scripts		scripts
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
JOURNEY.md		JOURNEY.md
README.md		README.md
RESULTS.md		RESULTS.md
report.py		report.py

Folders and files

Latest commit

History

Repository files navigation

Project Euler — Cross-Language Benchmarks

Current scope: 100 problems × 10 languages = 1000 measurements

What we measure

What we don't measure (and why)

Reproducibility

Trust + safety

Repo layout

License + contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages