Releases · ServiceNow/AgentLab

05 Dec 20:50

amanjaiswal73892

v0.4.1

5dfec6d

v0.4.1 Latest

Latest

What's Changed

moved this post_hoc_get_action to private codebase by @optimass in #218
Update repro o1 o3 mini by @jardinetsouffleton in #216
Add VLLMChatModel support to chat API by @jardinetsouffleton in #220
Add missing agent imports by @jardinetsouffleton in #223
adding log_prob option for chat models by @TLSDC in #219
fixing openai api typo for logprobs by @TLSDC in #224
Switching _agents_on_benchmark to Study method for flexibility by @TLSDC in #228
typo in hf chat models by @TLSDC in #229
Add-claude-3.7 to repro journal by @jardinetsouffleton in #222
fixing stale test by @TLSDC in #232
Create a simple pure visual agent. by @recursix in #235
Gaia bench with tape agent and multitool env by @ollmer in #214
Generalized AgentLab Env by @recursix in #210
adding a tag on screenshots for mouse_click coordinate actions by @TLSDC in #240
Aj/llama4 support by @amanjaiswal73892 in #238
fix coords tagging in agent_xray.py by @TLSDC in #242
Allac/next agent by @recursix in #253
Enhance_xray by @recursix in #256
Fix count tokens by @gabrielhuang in #254
Improve xray by @recursix in #259
Update cost-tracking for OAI chatcompletions and response API by @amanjaiswal73892 in #260
New experiments by @recursix in #258
Adding a simple debug agent to manually test actions by @TLSDC in #237
Update test_study.py by @TLSDC in #244
Deep debug by @recursix in #262
Multiaction Support and refactoring by @amanjaiswal73892 in #257
OSWorld benchmark by @ollmer in #255
Update relaunch flag to False and adjust model_args for OSWORLD_OAI by @amanjaiswal73892 in #263
rename langchain BaseMessage by @amanjaiswal73892 in #243
Adding new llms by @recursix in #266
Deep debug by @recursix in #265
Episode to html by @recursix in #267
Fix: cache breakpoint handling in StructuredDiscussion class. by @amanjaiswal73892 in #269
Osworld Docker Docs by @ollmer in #271
Add a citation section by @xhluca in #274
Xray fixes for backward (pre bgym v0.14.2) compatibility by @amanjaiswal73892 in #275
Add Litellm API integration by @amanjaiswal73892 in #273
Add Tutorial, GPT-5 and improve installation. by @amanjaiswal73892 in #278
xray bugfix by @Emilianopp in #276
Hints retrieval in tool use agent by @ollmer in #277
Remove Langchain Dependency by @amanjaiswal73892 in #282
Fix loading exp_dir using row_index in xray and add error handling. by @amanjaiswal73892 in #283
Add up-to-date Azure OpenAI Agents by @patricebechard in #284
Fix tests for Azure OpenAI models, add GPT-4.1-nano by @patricebechard in #286
Human in the Loop Agent UI and Agent interface by @amanjaiswal73892 in #290
Make transformer lazy import by @amanjaiswal73892 in #292
increase timeout indefinetely in agentlab-mentor by @amanjaiswal73892 in #293
make langchain optional in xray by @amanjaiswal73892 in #299
Add AgentLab landing page and project pages by @imenelydiaker in #303
Allow custom VLLM endpoint URL by @imenelydiaker in #306
add step navigation buttons in xray. by @amanjaiswal73892 in #300
Support for AnthropicBedrock models by @patricebechard in #307
Hint use agent by @amanjaiswal73892 in #316
Automated release versioning and structured PyPI/TestPyPI workflow by @amanjaiswal73892 in #308
CUA like agent with tool use and hint support. by @amanjaiswal73892 in #318

New Contributors

@gabrielhuang made their first contribution in #254
@Emilianopp made their first contribution in #276
@imenelydiaker made their first contribution in #303

Full Changelog: v0.4.0...v0.4.1

Contributors

ollmer, recursix, and 9 other contributors

Assets 2

11 Feb 15:10

TLSDC

v0.4.0

fecf700

v0.4.0

What's Changed

chore: update init.py by @eltociear in #181
parallel study evaluation by @recursix in #180
Refactor HuggingFace model initialization to include base model name … by @jardinetsouffleton in #190
limiting to python 3.11 and above by @ThibaultLSDC in #194
Implement parallel processing for studies using ProcessPoolExecutor a… by @recursix in #195
Update README.md, fix typo by @ollmer in #196
small API change - passing exp_root to study.run() by @optimass in #200
added study.shuffle_exps() feature by @optimass in #202
trying to fix tests by @ThibaultLSDC in #206
Add new agent configurations for Claude Sonnet 3.5 and vision models by @jardinetsouffleton in #213

New Contributors

@eltociear made their first contribution in #181
@optimass made their first contribution in #200

Full Changelog: v0.3.2...v0.4.0

Contributors

ollmer, recursix, and 4 other contributors

Assets 2

09 Dec 20:07

github-actions

v0.3.2

b5c023a

v0.3.2

What's Changed

displaying exp names in ray dashboard by @ThibaultLSDC in #123
Fixing goal not being used in ui_assistant mode by @ThibaultLSDC in #124
Fixing discussion object when adding images w/o detail by @ThibaultLSDC in #128
Adding descriptive prompts for screenshot/som by @ThibaultLSDC in #129
Study to multi eval by @recursix in #126
Update README.md by @recursix in #158
Update README.md by @recursix in #159
Enhance README with examples for loading experiment results by @recursix in #160
Warning notice + link to BrowserGym by @gasse in #164
405b results on workarena L2 by @ThibaultLSDC in #163
Ab res by @ThibaultLSDC in #161
Add fix for self-hosted HF models by @jardinetsouffleton in #167
WebArena/VisualWebArena results by @ThibaultLSDC in #168
Adding suffix to tracker decorator by @ThibaultLSDC in #169
Multiple output chat and retry function by @ThibaultLSDC in #171
For webarena agent by @recursix in #172
fixing pypi workflow dependency by @ThibaultLSDC in #174
fix: update demo_mode assignment in GenericAgentArgs class by @recursix in #175
Adapt multiple samples for HF models by @jardinetsouffleton in #173
automated readthedocs by @ThibaultLSDC in #177