Workflow runs · sbintuitions/flexeval

Actions

All workflows
Workflows
- Copilot code review Copilot code review
- pages-build-deployment pages-build-deployment
- Publish PyPI Publish PyPI
- Run batch-api tests Run batch-api tests
- Run tests Run tests
- Update Documentation Update Documentation
Management
- Caches
- Deployments

All workflows

Actions

Loading...
Loading

Showing runs from all workflows

686 workflow runs

[WIP] Enable access to reasoning_text and tool_calls in post-hoc LLM judges via flexeval_file. Run batch-api tests #146: Pull request #285 synchronize by junya-takayama

2m 10s load_lmoutput

load_lmoutput

2m 10s

[WIP] Enable access to reasoning_text and tool_calls in post-hoc LLM judges via flexeval_file. Run tests #993: Pull request #285 synchronize by junya-takayama

19m 57s load_lmoutput

load_lmoutput

19m 57s

[WIP] Enable access to reasoning_text and tool_calls in post-hoc LLM judges via flexeval_file. Run tests #992: Pull request #285 synchronize by junya-takayama

17m 52s load_lmoutput

load_lmoutput

17m 52s

[WIP] Enable access to reasoning_text and tool_calls in post-hoc LLM judges via flexeval_file. Run batch-api tests #145: Pull request #285 synchronize by junya-takayama

4m 38s load_lmoutput

load_lmoutput

4m 38s

[WIP] Enable access to reasoning_text and tool_calls in post-hoc LLM judges via flexeval_file. Run tests #991: Pull request #285 opened by junya-takayama

20m 25s load_lmoutput

load_lmoutput

20m 25s

[WIP] Enable access to reasoning_text and tool_calls in post-hoc LLM judges via flexeval_file. Run batch-api tests #144: Pull request #285 opened by junya-takayama

4m 51s load_lmoutput

load_lmoutput

4m 51s

pages build and deployment pages-build-deployment #57: by github-pages Bot

25s gh-pages

gh-pages

25s

v0.17.1 Update Documentation #54: Release v0.17.1 published by junya-takayama

3m 53s

v0.17.1 Run batch-api tests #143: Release v0.17.1 published by junya-takayama

4m 29s

v0.17.1 Publish PyPI #72: Release v0.17.1 published by junya-takayama

30s

Merge pull request #284 from sbintuitions/fix_truncate_base64 Run tests #990: Commit 83123c9 pushed by junya-takayama

21m 20s main

main

21m 20s

Fix non-string types like bool / int being saved as string in outputs.jsonl Run tests #989: Pull request #284 synchronize by junya-takayama

20m 42s fix_truncate_base64

fix_truncate_base64

20m 42s

Fix non-string types like bool / int being saved as string in outputs.jsonl Run tests #988: Pull request #284 synchronize by junya-takayama

21m 24s fix_truncate_base64

fix_truncate_base64

21m 24s

Fix non-string types like bool / int being saved as string in outputs.jsonl Run tests #987: Pull request #284 opened by junya-takayama

22m 2s fix_truncate_base64

fix_truncate_base64

22m 2s

Merge pull request #283 from sbintuitions/support_references_with_ope… Run tests #986: Commit 55feb20 pushed by junya-takayama

20m 58s main

main

20m 58s

Support reference answers in the OpenAIMessagesDataset Run tests #985: Pull request #283 synchronize by junya-takayama

20m 57s support_references_with_openaidataset

support_references_with_openaidataset

20m 57s

Support reference answers in the OpenAIMessagesDataset Run tests #984: Pull request #283 synchronize by junya-takayama

21m 28s support_references_with_openaidataset

support_references_with_openaidataset

21m 28s

Support reference answers in the OpenAIMessagesDataset Run tests #983: Pull request #283 opened by junya-takayama

22m 18s support_references_with_openaidataset

support_references_with_openaidataset

22m 18s

pages build and deployment pages-build-deployment #56: by github-pages Bot

23s gh-pages

gh-pages

23s

v0.17.0 Update Documentation #53: Release v0.17.0 published by junya-takayama

2m 50s

v0.17.0 Publish PyPI #71: Release v0.17.0 published by junya-takayama

39s

v0.17.0 Run batch-api tests #142: Release v0.17.0 published by junya-takayama

4m 22s

Merge pull request #282 from sbintuitions/update_vllm_0.17.0 Run tests #982: Commit b5d1d2a pushed by junya-takayama

21m 40s main

main

21m 40s

Upgrade vLLM to v0.17.1 Run tests #981: Pull request #282 synchronize by junya-takayama

21m 13s update_vllm_0.17.0

update_vllm_0.17.0

21m 13s

Upgrade vLLM to v0.17.1 Run tests #980: Pull request #282 synchronize by junya-takayama

21m 1s update_vllm_0.17.0

update_vllm_0.17.0

21m 1s

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Actions

Workflows

Management

All workflows

Actions

Loading...
Loading

All workflows

Uh oh!

Filter by Workflow

Sorry, something went wrong.

Sorry, something went wrong.

No matching workflows.

Filter by Event

Sorry, something went wrong.

Sorry, something went wrong.

No matching events.

Filter by Status

Sorry, something went wrong.

Sorry, something went wrong.

No matching statuses.

Filter by Branch

Sorry, something went wrong.

Sorry, something went wrong.

No matching branches.

Filter by Actor

Sorry, something went wrong.

Sorry, something went wrong.

No matching users.

Actions: sbintuitions/flexeval

Actions

All workflows All workflows Actions Loading... Loading Sorry, something went wrong. Uh oh! There was an error while loading. Please reload this page.

All workflows

All workflows

Actions

Loading...
Loading