🤖 refactor: remove dead code from terminal bench Python scripts #1747

ammar-agent · 2026-01-18T00:06:21Z

Summary

streamline upload-tbench-results.py with shared run metadata, deterministic ordering, and safer parsing
simplify leaderboard submission flow with centralized command/JSON handling and clearer artifact grouping

Testing

make static-check (fails: zizmor known-vulnerable-actions audit hit a 403 from GitHub API)

Generated with mux • Model: openai:gpt-5.2-codex • Thinking: high • Cost: $20.48

- Remove unused extract_model_from_config() and extract_thinking_from_config() functions from upload-tbench-results.py (model/thinking are read inline) - Remove unused n_total_trials variable from build_rows() - Remove unused _last_environment field from MuxAgent class Net -15 LoC

github-actions bot added the refactor label Jan 18, 2026

🤖 refactor: streamline terminal bench Python tooling

9487cbc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

🤖 refactor: remove dead code from terminal bench Python scripts #1747

🤖 refactor: remove dead code from terminal bench Python scripts #1747

ammar-agent commented Jan 18, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

🤖 refactor: remove dead code from terminal bench Python scripts #1747

Are you sure you want to change the base?

🤖 refactor: remove dead code from terminal bench Python scripts #1747

Conversation

ammar-agent commented Jan 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ammar-agent commented Jan 18, 2026 •

edited

Loading