Ci/metrics history by orionpapadakis · Pull Request #114 · beehive-lab/GPULlama3.java

orionpapadakis · 2026-05-07T14:31:02Z

Performance metrics history pipeline

Adds end-to-end collection of benchmark metrics from every CI run into a persistent docs/perf-history.jsonl file committed back to main.

What changed

scripts/write_metrics_sidecar.py (new)
Helper script that writes a *.meta.json sidecar from KEY=VALUE shell arguments. Coerces JSON-native types (booleans, numbers) automatically.

scripts/process_metrics.py (rewritten)

Discovers metric+sidecar pairs recursively via rglob("*.json")
Schema is fully open-ended — no required fields in either file
Missing fields become null (not 0) in the JSONL output
Each row includes flat compat fields plus nested "benchmark" and "metrics" objects for forward compatibility

.github/workflows/build-and-run.yml

Every ./llama-tornado step now sets JAVA_TOOL_OPTIONS to emit a metrics JSON and calls write_metrics_sidecar.py to write the matching sidecar (same stem, same directory)
All artifact pairs are uploaded and consumed by publish-performance-history
publish-performance-history only runs on pushes to main (not PRs or forks), avoiding duplicate entries and push-permission errors

Result

One JSONL row per benchmark step per merge to main, covering all backends, models, quantizations, and inference configurations currently in CI.

…formance history updates

…uild-and-run job

…adability

…s) for all running steps

…scovery and metadata handling

…etadata JSON files

…ify testing scenarios

…r PR merges and main branch pushes

orionpapadakis added 7 commits May 7, 2026 17:32

[wip][ci] Add metrics tracking and publishing in CI workflows

2b05a0d

[ci] Introduce GPULlama3 performance history visualization page

7238d64

[ci] Add history logging in JSONL format

b99a5bf

[ci] Enhance CI workflows with performance history recording

a8928ba

[ci] Add process_metrics.py script for benchmark processing and per…

e472d02

…formance history updates

[ci] Remove redundant leading slash in model paths

12f391e

[ci][wip] Update CI workflows to use JSON format for metrics storage

63fb264

orionpapadakis force-pushed the ci/metrics-history branch from 0407389 to 63fb264 Compare May 7, 2026 14:32

orionpapadakis added 7 commits May 8, 2026 16:21

[ci] Refactor workflow to decouple performance history updates from b…

725b0d4

…uild-and-run job

[ci] Remove redundant whitespace in build-and-run.yml to improve re…

987b7c8

…adability

[ci] Enable performance metrics collection (metadata sidecar + metric…

16c43b2

…s) for all running steps

[ci] Simplify process_metrics.py workflow with recursive metrics di…

56d2593

…scovery and metadata handling

[ci] Add write_metrics_sidecar.py script for generating benchmark m…

0e16384

…etadata JSON files

[ci] Update guard condition in publish-performance-history to simpl…

116fa65

…ify testing scenarios

[ci] Restore full guard condition in publish-performance-history fo…

c75664a

…r PR merges and main branch pushes

orionpapadakis marked this pull request as ready for review May 12, 2026 11:26

stratika self-requested a review May 12, 2026 11:51

stratika assigned orionpapadakis May 12, 2026

stratika added the enhancement New feature or request label May 12, 2026

stratika approved these changes May 12, 2026

View reviewed changes

stratika merged commit 2d442a0 into main May 12, 2026
3 of 4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ci/metrics history#114

Ci/metrics history#114
stratika merged 14 commits into
mainfrom
ci/metrics-history

orionpapadakis commented May 7, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

orionpapadakis commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Performance metrics history pipeline

What changed

Result

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

orionpapadakis commented May 7, 2026 •

edited

Loading