Skip to content

Blog on ResponsesAPI & MCP#179

Open
qandrew wants to merge 7 commits intovllm-project:mainfrom
qandrew:mcp-responses-blog
Open

Blog on ResponsesAPI & MCP#179
qandrew wants to merge 7 commits intovllm-project:mainfrom
qandrew:mcp-responses-blog

Conversation

@qandrew
Copy link
Copy Markdown

@qandrew qandrew commented Mar 11, 2026

No description provided.

Signed-off-by: Andrew Xia <axia@fb.com>
Signed-off-by: Andrew Xia <axia@fb.com>
Signed-off-by: Andrew Xia <axia@fb.com>
Comment thread _posts/2026-03-11-responses-api.md Outdated
Comment thread _posts/2026-03-11-responses-api.md Outdated
Signed-off-by: Andrew Xia <axia@fb.com>
Signed-off-by: Andrew Xia <axia@fb.com>
Andrew Xia added 2 commits March 17, 2026 10:09
Signed-off-by: Andrew Xia <axia@fb.com>
Signed-off-by: Andrew Xia <axia@fb.com>
@qandrew qandrew marked this pull request as ready for review March 17, 2026 17:31
@qandrew
Copy link
Copy Markdown
Author

qandrew commented Mar 17, 2026

@chaunceyjiang @esmeetu @yeqcharlotte @sfeng33 please take a look, this is ready now :)

Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 6fdb94ca11

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

```python
from openai import OpenAI

client = OpenAI(base_url="http://localhost:8000/v1")
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2 Badge Pass an API key when creating OpenAI client

This example initializes OpenAI without api_key (and without showing OPENAI_API_KEY setup), which causes the snippet to fail in a fresh environment before any request is sent. Readers following this “Getting Started” flow are likely to hit a client initialization error unless they already have the env var set, so the sample should include a placeholder key (for example api_key="null") or an explicit env export step.

Useful? React with 👍 / 👎.

Copy link
Copy Markdown
Member

@mgoin mgoin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@qandrew is the responses api and mcp support really stable enough to blog about like this? I was under the impression that support for non-gpt-oss models was still under development

Comment on lines +184 to +186
## Evals

With vLLM's ResponsesAPI implementation, we were able to replicate Kimi K2's HLE score of 23.9. We used the open-source HLE test harness with OpenAI's o3-mini as a judge. We also ran GPT-OSS against the vLLM ResponsesAPI with MCP tools (including browser, python, and container). With high reasoning on GPT-OSS 120B, we achieved a score of 0.97 on AIME 2025, which matches OpenAI's GPT-OSS model card.
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you show an example of how the eval was run? Does it work with models other than gpt-oss?

@chaunceyjiang
Copy link
Copy Markdown
Contributor

@qandrew is the responses api and mcp support really stable enough to blog about like this?

@mgoin I think this is necessary, since the Responses API is gradually replacing Chat Completions.

layout: post
title: "Enabling ResponsesAPI and MCP on vLLM"
author: "Meta"
image: /assets/logos/vllm-logo-text-light.png
Copy link
Copy Markdown
Member

@esmeetu esmeetu Mar 20, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we make a architechure or overview image of ResponseAPI here?
Besides, add frontend tag for this blog.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants