srv operator(): got exception: {"error":{"code":500,"message":"...
Error: Jinja Exception: Cannot pass both content and thinking in an
assistant message with tool calls! Put the analysis message in one
or the other, but not both.","type":"server_error"}}
Summary
I had a working build of llama-cli and llama-server that was previously stably running gpt-oss-120b from ggml huggingface. After pulling and rebuilding many commits later, I found that during multi-turn chat requests (specifically while coding against this model using OpenCode) that 500 errors were starting to appear.
I narrowed down the issue to a specific PR that caused the regression. There is a potential one-line fix for this.
Details
'content' is not definedwhen a multi-turn conversation has an assistant message with bothreasoning_contentandtool_callsreasoning_content->thinkingcopy leavescontenton the adjusted message, which the template'sthinkingbranch doesn't expectError output:
Server error output