Skip to content

ggml-webgpu: Update register tiling matmul to use f32 accumulation#21644

Open
reeselevine wants to merge 3 commits intoggml-org:masterfrom
reeselevine:reg-tile-accum-fix
Open

ggml-webgpu: Update register tiling matmul to use f32 accumulation#21644
reeselevine wants to merge 3 commits intoggml-org:masterfrom
reeselevine:reg-tile-accum-fix

Conversation

@reeselevine
Copy link
Copy Markdown
Contributor

@reeselevine reeselevine commented Apr 8, 2026

Overview

Partially fixes issue here: #21602

Also fixes issue compiling with WebGPU GPU profiling turned on.

Requirements

@abhijitramesh
Copy link
Copy Markdown
Contributor

abhijitramesh commented Apr 9, 2026

The f32 accumulator fix seems to be helping us. I hooked this branch with wllama and ran the Qwen models; it seems to be working as expected on Firefox and Safari now. Chrome still seems to be broken, but it's no longer printing @@@.

Chrome output:
image

@github-actions github-actions bot added ggml changes relating to the ggml tensor library for machine learning WebGPU labels Apr 9, 2026
@abhijitramesh
Copy link
Copy Markdown
Contributor

image Now it works on Chrome as well!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ggml changes relating to the ggml tensor library for machine learning WebGPU

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants