This is the codebase to reproduce the results of the paper Test-Time Adaptation for LLM Agents via Environment Interaction. We propose the adaptation framework -- Grounded Test-Time Adaptation (GTTA).
| Syntactic Alignment (SA) | Dynamics Grounding (DG) |
|---|---|
![]() |
![]() |
We adopt NNetnav's codebase for web navigation exploration and task evaluation. To reproduce our results on WebArena, please refer to this.
For BFCLv3 experiment, we modify our method based on the official gorilla codebase. To reproduce our results on BFCLv3, please refer to this.
For Tau-Bench experiment, please refer to official codebase with syntactic alignment (SA) (parametric adaptation) enabled.
If you find this work useful, please cite:
@inproceedings{chentest,
title={Test-Time Adaptation for LLM Agents via Environment Interaction},
author={Chen, Arthur and Liu, Zuxin and Zhang, Jianguo and Prabhakar, Akshara and Liu, Zhiwei and Heinecke, Shelby and Savarese, Silvio and Zhong, Victor and Xiong, Caiming},
booktitle={The Fourteenth International Conference on Learning Representations}
}This work is licensed under the MIT License.

