Skip to content

Batch benchmark results for missed merge validations#627

Merged
pei-tinybird merged 2 commits into
mainfrom
fix/retry-benchmark-validation
Jun 18, 2026
Merged

Batch benchmark results for missed merge validations#627
pei-tinybird merged 2 commits into
mainfrom
fix/retry-benchmark-validation

Conversation

@pei-tinybird

@pei-tinybird pei-tinybird commented Jun 18, 2026

Copy link
Copy Markdown
Contributor

Models benchmarked:
anthropic/claude-opus-4 openai/gpt-5.1-chat mistralai/mistral-large-2512 poolside/laguna-xs.2:free meta-llama/llama-3.1-8b-instruct google/gemma-3-12b-it google/gemma-3n-e4b-it openai/gpt-oss-20b:free google/gemma-3-27b-it

This PR replays the benchmark result validation for every failed Validate benchmark results on merge run found in the available GitHub Actions history. In each case, the workflow reached the Tinybird validation step but failed while pushing config/cleanup changes back to main.

It also serializes the merge-validation workflow with a concurrency group so future validations do not race each other when updating main.

@pei-tinybird pei-tinybird self-assigned this Jun 18, 2026
@vercel

vercel Bot commented Jun 18, 2026

Copy link
Copy Markdown

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
llm-benchmark Ready Ready Preview, Comment Jun 18, 2026 10:43am

@pei-tinybird pei-tinybird merged commit 7c6e4ac into main Jun 18, 2026
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant