Gambiarra to accept tool calls. by rivaldodev · Pull Request #559 · microsoft/BitNet

rivaldodev · 2026-05-19T14:04:31Z

Adds a patch mechanism to support OpenAI-style tool calls in the bundled llama.cpp server without changing the submodule pointer.

Changes

Adds patches/llama-server-tools.patch.
Adds utils/apply_local_patches.py.
Updates setup_env.py to apply local patches before building.
Enables the server to accept tools / tool_choice requests instead of rejecting them.
Converts strict JSON tool-call responses into OpenAI-compatible message.tool_calls.
Handles both tool_calls and tool_call response shapes.
Converts tool result messages into user-visible observations so the model can produce a final answer.

rivaldodev · 2026-05-19T14:07:56Z

@microsoft-github-policy-service agree

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Adds a local patching mechanism to modify the bundled llama.cpp server at build time (without moving the submodule pointer) so it can accept OpenAI-style tools / tool_choice requests and translate strict JSON tool-call outputs into OpenAI-compatible message.tool_calls.

Changes:

Introduces a Python patch applier that updates llama.cpp sources (including a CORS preflight tweak) prior to builds.
Adds a unified diff patch that adapts the llama.cpp server utilities for tool calls / tool results conversion.
Updates the environment setup script to apply patches automatically before compiling.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 6 comments.

File	Description
utils/apply_local_patches.py	Implements local unified-diff patch application and a direct CORS block replacement.
setup_env.py	Runs the local patch step automatically before build steps.
patches/llama-server-tools.patch	Patch content that modifies llama.cpp server utils to support tool calls and tool result formatting.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot

Pull request overview

Copilot reviewed 3 out of 3 changed files in this pull request and generated 4 comments.

Comments suppressed due to low confidence (2)

patches/llama-server-tools.patch:1

nlohmann::json::contains() throws type_error when the JSON value is not an object. If the model outputs valid JSON that’s an array/number/string (e.g., [], "ok"), this will crash the server instead of returning “no tool calls”. Add an early guard like if (!parsed.is_object()) return json::array(); (or equivalent) before calling contains().

diff --git a/examples/server/utils.hpp b/examples/server/utils.hpp

patches/llama-server-tools.patch:1

This fallback treats any JSON object containing "name" (or "function") as a tool call, which can misclassify normal “JSON-only” answers (common for structured output) as tool invocations and incorrectly set finish_reason = "tool_calls". Tighten the detection criteria (e.g., require tool_calls/tool_call, or require arguments alongside name, and/or only attempt this upgrade when the request actually provided tools).

diff --git a/examples/server/utils.hpp b/examples/server/utils.hpp

Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

Copilot

Pull request overview

Copilot reviewed 3 out of 3 changed files in this pull request and generated no new comments.

Gambiarra to accept tool calls.

9b76190

Copilot AI review requested due to automatic review settings May 19, 2026 14:04

Copilot AI reviewed May 19, 2026

View reviewed changes

Copilot started reviewing on behalf of rivaldodev May 19, 2026 14:17 View session

feat: enhance CORS patching and refactor patch application logic

7a0f8af

rivaldodev requested a review from Copilot May 19, 2026 14:34

Copilot AI reviewed May 19, 2026

View reviewed changes

Comment thread utils/apply_local_patches.py Outdated

Comment thread utils/apply_local_patches.py

Comment thread utils/apply_local_patches.py

Comment thread setup_env.py

Copilot started reviewing on behalf of rivaldodev May 19, 2026 14:55 View session

Address Copilot patch review feedback

dca304a

rivaldodev requested a review from Copilot May 19, 2026 15:00

Copilot started reviewing on behalf of rivaldodev May 19, 2026 15:09 View session

Handle escaped tool call JSON

acd69a4

Copilot AI reviewed May 19, 2026

View reviewed changes

rivaldodev requested a review from Copilot May 19, 2026 15:26

Copilot started reviewing on behalf of rivaldodev May 19, 2026 15:28 View session

Copilot AI reviewed May 19, 2026

View reviewed changes

Support streaming tool call conversion

709ec29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gambiarra to accept tool calls.#559

Gambiarra to accept tool calls.#559
rivaldodev wants to merge 5 commits into
microsoft:mainfrom
rivaldodev:gambiarra

rivaldodev commented May 19, 2026

Uh oh!

rivaldodev commented May 19, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rivaldodev commented May 19, 2026

Uh oh!

rivaldodev commented May 19, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants