CPU backends precisions support #18286

truecoder34 · 2025-12-22T12:23:06Z

truecoder34
Dec 22, 2025

Hi all,

Does CPU backends supported in ggml (ZDNN, ZenDNN, BLAS, KleidiAI) only able to run inference of models with F16/B16 and F32 precision only, and any quantized models will be run with Naive CPU backend?
Is this limitation actual for all CPU Backends excluding Naive CPU one ?

Thank you in advance .

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CPU backends precisions support #18286

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

CPU backends precisions support #18286

Uh oh!

truecoder34 Dec 22, 2025

Replies: 0 comments

truecoder34
Dec 22, 2025