Skip to content

Commit c7c6abc

Browse files
ggml-cpu: add 128-bit impls for iq2_xs, iq3_s, iq3_xxs, tq2_0
Co-authored-by: Rehan Qasim <rehan.qasim@10xengineers.ai>
1 parent 5143632 commit c7c6abc

2 files changed

Lines changed: 415 additions & 1717 deletions

File tree

ggml/src/ggml-cpu/arch-fallback.h

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -203,7 +203,6 @@
203203
// repack.cpp
204204
#define ggml_quantize_mat_q8_0_4x1_generic ggml_quantize_mat_q8_0_4x1
205205
#define ggml_quantize_mat_q8_0_4x4_generic ggml_quantize_mat_q8_0_4x4
206-
#define ggml_quantize_mat_q8_0_4x8_generic ggml_quantize_mat_q8_0_4x8
207206
#define ggml_quantize_mat_q8_K_4x1_generic ggml_quantize_mat_q8_K_4x1
208207
#define ggml_quantize_mat_q8_K_4x4_generic ggml_quantize_mat_q8_K_4x4
209208
#define ggml_quantize_mat_q8_K_4x8_generic ggml_quantize_mat_q8_K_4x8

0 commit comments

Comments
 (0)