Releases: dreadnode/AIRTBench-Code
Releases · dreadnode/AIRTBench-Code
v1.0.1
What's Changed
Features
- feat: add failed flag submissions csv as part of dataset by @GangGreenTemperTatum in #22
- feat: add cache retry logic for unsupported providers by @GangGreenTemperTatum in #40
Documentation
- docs: add docs section in readme by @GangGreenTemperTatum in #3
- docs: add airtbench dataset readme by @GangGreenTemperTatum in #23
- docs: update placeholder docs links and citation ref by @GangGreenTemperTatum in #15
- docs: add live arxiv links by @GangGreenTemperTatum in #28
Bug Fixes
- fix: missing renovate config file by @GangGreenTemperTatum in #5
- fix: change rigging decorator agent to o3-mini by @GangGreenTemperTatum in #18
- fix: pythonkernel class trying to call non-existent self.post() metho… by @GangGreenTemperTatum in #31
Chores
- chore: replace large-8 runner by @GangGreenTemperTatum in #4
- chore: refactor from dn.score by @GangGreenTemperTatum in #17
- chore: pin deps in dockerfile for future version updates by @GangGreenTemperTatum in #32
- chore: add final chat step of llm challenge interaction to pipeline by @GangGreenTemperTatum in #38
Dependencies
- chore(deps): update renovatebot/github-action action to v42 by @dreadnode-renovate-bot in #13
- chore(deps): update pre-commit hook pre-commit/mirrors-mypy to v1.16.0 by @dreadnode-renovate-bot in #12
- chore(deps): update pre-commit hook astral-sh/ruff-pre-commit to v0.11.13 by @dreadnode-renovate-bot in #11
- chore(deps): update pre-commit hook adrienverge/yamllint to v1.37.1 by @dreadnode-renovate-bot in #10
- chore(deps): update actions/setup-python action to v5.6.0 by @dreadnode-renovate-bot in #8
- chore(deps): update actions/create-github-app-token action to v2.0.6 by @dreadnode-renovate-bot in #6
- chore(deps): update renovatebot/github-action action to v42.0.5 by @dreadnode-renovate-bot in #16
- chore(deps): update pre-commit hook astral-sh/ruff-pre-commit to v0.12.0 by @dreadnode-renovate-bot in #27
- chore(deps): update renovatebot/github-action action to v42.0.6 by @dreadnode-renovate-bot in #25
- chore(deps): update pre-commit hook pre-commit/mirrors-mypy to v1.16.1 by @dreadnode-renovate-bot in #24
- chore(deps): update renovatebot/github-action action to v43 by @dreadnode-renovate-bot in #33
- chore(deps): update renovatebot/github-action action to v43.0.1 by @dreadnode-renovate-bot in #34
- chore(deps): update pre-commit hook astral-sh/ruff-pre-commit to v0.12.1 by @dreadnode-renovate-bot in #36
- fix(deps): update dependency dreadnode to v1.10.0 by @dreadnode-renovate-bot in #37
- chore(deps): update renovatebot/github-action action to v43.0.2 by @dreadnode-renovate-bot in #39
New Contributors
- @dreadnode-renovate-bot made their first contribution in #13
Full Changelog: v1.0.0...v1.0.1
v1.0.0
What's Changed
- feat: airtbench ai agent code by @GangGreenTemperTatum in #1
- feat: Notebooks/add non llm notebooks by @GangGreenTemperTatum in #2
Full Changelog: https://github.com/dreadnode/AIRTBench-Code/commits/v1.0.0