Releases: ServiceNow/AgentLab
Releases · ServiceNow/AgentLab
v0.4.1
What's Changed
- moved this
post_hoc_get_actionto private codebase by @optimass in #218 - Update repro o1 o3 mini by @jardinetsouffleton in #216
- Add VLLMChatModel support to chat API by @jardinetsouffleton in #220
- Add missing agent imports by @jardinetsouffleton in #223
- adding log_prob option for chat models by @TLSDC in #219
- fixing openai api typo for logprobs by @TLSDC in #224
- Switching _agents_on_benchmark to Study method for flexibility by @TLSDC in #228
- typo in hf chat models by @TLSDC in #229
- Add-claude-3.7 to repro journal by @jardinetsouffleton in #222
- fixing stale test by @TLSDC in #232
- Create a simple pure visual agent. by @recursix in #235
- Gaia bench with tape agent and multitool env by @ollmer in #214
- Generalized AgentLab Env by @recursix in #210
- adding a tag on screenshots for mouse_click coordinate actions by @TLSDC in #240
- Aj/llama4 support by @amanjaiswal73892 in #238
- fix coords tagging in agent_xray.py by @TLSDC in #242
- Allac/next agent by @recursix in #253
- Enhance_xray by @recursix in #256
- Fix count tokens by @gabrielhuang in #254
- Improve xray by @recursix in #259
- Update cost-tracking for OAI chatcompletions and response API by @amanjaiswal73892 in #260
- New experiments by @recursix in #258
- Adding a simple debug agent to manually test actions by @TLSDC in #237
- Update test_study.py by @TLSDC in #244
- Deep debug by @recursix in #262
- Multiaction Support and refactoring by @amanjaiswal73892 in #257
- OSWorld benchmark by @ollmer in #255
- Update relaunch flag to False and adjust model_args for OSWORLD_OAI by @amanjaiswal73892 in #263
- rename langchain BaseMessage by @amanjaiswal73892 in #243
- Adding new llms by @recursix in #266
- Deep debug by @recursix in #265
- Episode to html by @recursix in #267
- Fix: cache breakpoint handling in StructuredDiscussion class. by @amanjaiswal73892 in #269
- Osworld Docker Docs by @ollmer in #271
- Add a citation section by @xhluca in #274
- Xray fixes for backward (pre bgym v0.14.2) compatibility by @amanjaiswal73892 in #275
- Add Litellm API integration by @amanjaiswal73892 in #273
- Add Tutorial, GPT-5 and improve installation. by @amanjaiswal73892 in #278
- xray bugfix by @Emilianopp in #276
- Hints retrieval in tool use agent by @ollmer in #277
- Remove Langchain Dependency by @amanjaiswal73892 in #282
- Fix loading exp_dir using row_index in xray and add error handling. by @amanjaiswal73892 in #283
- Add up-to-date Azure OpenAI Agents by @patricebechard in #284
- Fix tests for Azure OpenAI models, add GPT-4.1-nano by @patricebechard in #286
- Human in the Loop Agent UI and Agent interface by @amanjaiswal73892 in #290
- Make transformer lazy import by @amanjaiswal73892 in #292
- increase timeout indefinetely in agentlab-mentor by @amanjaiswal73892 in #293
- make langchain optional in xray by @amanjaiswal73892 in #299
- Add AgentLab landing page and project pages by @imenelydiaker in #303
- Allow custom VLLM endpoint URL by @imenelydiaker in #306
- add step navigation buttons in xray. by @amanjaiswal73892 in #300
- Support for AnthropicBedrock models by @patricebechard in #307
- Hint use agent by @amanjaiswal73892 in #316
- Automated release versioning and structured PyPI/TestPyPI workflow by @amanjaiswal73892 in #308
- CUA like agent with tool use and hint support. by @amanjaiswal73892 in #318
New Contributors
- @gabrielhuang made their first contribution in #254
- @Emilianopp made their first contribution in #276
- @imenelydiaker made their first contribution in #303
Full Changelog: v0.4.0...v0.4.1
v0.4.0
What's Changed
- chore: update init.py by @eltociear in #181
- parallel study evaluation by @recursix in #180
- Refactor HuggingFace model initialization to include base model name … by @jardinetsouffleton in #190
- limiting to python 3.11 and above by @ThibaultLSDC in #194
- Implement parallel processing for studies using ProcessPoolExecutor a… by @recursix in #195
- Update README.md, fix typo by @ollmer in #196
- small API change - passing exp_root to study.run() by @optimass in #200
- added
study.shuffle_exps()feature by @optimass in #202 - trying to fix tests by @ThibaultLSDC in #206
- Add new agent configurations for Claude Sonnet 3.5 and vision models by @jardinetsouffleton in #213
New Contributors
- @eltociear made their first contribution in #181
- @optimass made their first contribution in #200
Full Changelog: v0.3.2...v0.4.0
v0.3.2
What's Changed
- displaying exp names in ray dashboard by @ThibaultLSDC in #123
- Fixing goal not being used in ui_assistant mode by @ThibaultLSDC in #124
- Fixing discussion object when adding images w/o detail by @ThibaultLSDC in #128
- Adding descriptive prompts for screenshot/som by @ThibaultLSDC in #129
- Study to multi eval by @recursix in #126
- Update README.md by @recursix in #158
- Update README.md by @recursix in #159
- Enhance README with examples for loading experiment results by @recursix in #160
- Warning notice + link to BrowserGym by @gasse in #164
- 405b results on workarena L2 by @ThibaultLSDC in #163
- Ab res by @ThibaultLSDC in #161
- Add fix for self-hosted HF models by @jardinetsouffleton in #167
- WebArena/VisualWebArena results by @ThibaultLSDC in #168
- Adding suffix to tracker decorator by @ThibaultLSDC in #169
- Multiple output chat and retry function by @ThibaultLSDC in #171
- For webarena agent by @recursix in #172
- fixing pypi workflow dependency by @ThibaultLSDC in #174
- fix: update demo_mode assignment in GenericAgentArgs class by @recursix in #175
- Adapt multiple samples for HF models by @jardinetsouffleton in #173
- automated readthedocs by @ThibaultLSDC in #177
Full Changelog: v0.3.1...v0.3.2
v0.3.2.dev9
version bump
v0.3.2.dev11
last one
v0.3.2.dev10
tmp updates just for a check
v0.3.2.dev7
one last test
v0.3.2.dev6
testing some more stuff
v0.3.2.dev5
version
v0.3.2.dev4
cleaning up