Eval bug: reasoning off gives reasoning medium for gpt-oss

### Name and Version

```
./llama-cli --version
load_backend: loaded RPC backend from /home/tb/code/llama-b8287/libggml-rpc.so
ggml_vulkan: Found 1 Vulkan devices:
ggml_vulkan: 0 = AMD Radeon 780M Graphics (RADV PHOENIX) (radv) | uma: 1 | fp16: 1 | bf16: 0 | warp size: 64 | shared memory: 65536 | int dot: 1 | matrix cores: KHR_coopmat
load_backend: loaded Vulkan backend from /home/tb/code/llama-b8287/libggml-vulkan.so
load_backend: loaded CPU backend from /home/tb/code/llama-b8287/libggml-cpu-zen4.so
version: 8287 (acb7c7906)
built with GNU 11.4.0 for Linux x86_64
```

Using `--reasoning off` for gpt-oss gives "Reasoning: medium", maybe it should give "Reasoning: low" ?

Using `--reasoning-budget 0` for gpt-oss gives "Reasoning: medium", maybe it should give "Reasoning: low" ?

Using `--chat-template-kwargs '{"reasoning_effort": "low"}'` for gpt-oss gives "Reasoning: low" (ok).

```
./llama-server -hf unsloth/gpt-oss-20b-GGUF:F16 --threads -1 --parallel 1 --ctx-size 16384 --temp 1.0 --min-p 0.0 --top-p 1.0 --top-k 0 --n_predict 4096 --reasoning off --direct-io 2>&1 | grep -i reasoning
Reasoning: medium
```

maybe related to https://github.com/ggml-org/llama.cpp/pull/20297 @pwilkin 

### Operating systems

Linux

### GGML backends

Vulkan

### Hardware

AMD Ryzen 7 PRO 7840U

### Models

_No response_

### Problem description & steps to reproduce

use "--reasoning off" with "unsloth/gpt-oss-20b-GGUF:F16"

### First Bad Commit

_No response_

### Relevant log output

<details>
<summary>Logs</summary>


```console

```
</details>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Eval bug: reasoning off gives reasoning medium for gpt-oss #20458

Name and Version

Operating systems

GGML backends

Hardware

Models

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Eval bug: reasoning off gives reasoning medium for gpt-oss #20458

Description

Name and Version

Operating systems

GGML backends

Hardware

Models

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions