Replies: 1 comment
-
|
Hi @huanglsh , have you solved this issue? This behavior also looks strange for me. Actually it will be great to check this implementation on ARM Machine supporting SVE 512 bits. Hi @xctan , seems you have developed this implementation, could you please clarify why do we need +2 offset ? Thank you in advance. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
While reading through the function
ggml_vec_dot_q8_0_q8_0()inggml.c/src/ggml-cpu/arch/arm/quants.c, I noticed something in the SVE implementation that I’d like to better understand.When vector_length equals 512, the following code appears:
From my understanding, the second load might have been intended to use
x1->qsinstead ofx0->qs + 2, like this:Could someone help clarify whether I’m misunderstanding the logic,
or if both loads are indeed meant to come from the same pointer?
Thanks in advance! I’m trying to understand the data layout and SVE usage here more deeply.
Beta Was this translation helpful? Give feedback.
All reactions