Add vLLM backend for open weight model evaluation by lewtun · Pull Request #34 · SalesforceAIResearch/AgentLite

lewtun · 2024-09-04T12:31:25Z

This PR adds support for a vLLM backend so that Hugging Face models like Salesforce/xLAM-v0.1-r can be evaluated. I've shown how this works for WebShop and am happy to extend it to the other benchmarks if the API looks good to you.

salesforce-cla · 2024-09-04T12:31:30Z

Thanks for the contribution! Before we can merge this, we need @lewtun to sign the Salesforce Inc. Contributor License Agreement.

lewtun · 2024-09-04T12:32:44Z

agentlite/llm/agent_llms.py

+class VllmChatModel(BaseLLM):
+    def __init__(self, llm_config: LLMConfig):
+        super().__init__(llm_config)
+        self.client = OpenAI(base_url="http://localhost:8000/v1", api_key="EMPTY")


This is the default endpoint in vLLM, but I could add it as an env variable if preferred

lewtun · 2024-09-04T12:33:38Z

benchmark/webshop/evaluate_webshop.py

    rewards = []
    all_task_ids = list(range(0, 251))
-    REWARD_LOG_FILE = f"{args.llm}_{args.agent_arch}_results_webshop.csv"
+    REWARD_LOG_FILE = f"{args.llm.replace('/', '_')}_{args.agent_arch}_results_webshop.csv"


This is needed because Hugging Face model repos are in the form {org}/{repo_name} which causes problems when trying to write the file to disk.

JimSalesforce

evaluate local model

Add vLLM backend for open weight model evaluation

70efd32

salesforce-cla bot added the cla:missing label Sep 4, 2024

salesforce-cla bot added cla:signed and removed cla:missing labels Sep 4, 2024

lewtun commented Sep 4, 2024

View reviewed changes

JimSalesforce approved these changes Jun 27, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add vLLM backend for open weight model evaluation#34

Add vLLM backend for open weight model evaluation#34
lewtun wants to merge 1 commit intoSalesforceAIResearch:mainfrom
huggingface:add-vllm

lewtun commented Sep 4, 2024

Uh oh!

salesforce-cla bot commented Sep 4, 2024

Uh oh!

lewtun Sep 4, 2024

Uh oh!

lewtun Sep 4, 2024

Uh oh!

JimSalesforce left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

lewtun commented Sep 4, 2024

Uh oh!

salesforce-cla bot commented Sep 4, 2024

Uh oh!

lewtun Sep 4, 2024

Choose a reason for hiding this comment

Uh oh!

lewtun Sep 4, 2024

Choose a reason for hiding this comment

Uh oh!

JimSalesforce left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments