Commit d70a53a

authored and

committed

ggml : block interleaving support for Q4_K quantization for x86 AVX2 architecture (ggml-org#12332)

* Add block interleaving support for Q4_K quantization * Remove whitespaces and fix CI/CD issues * Update pointer of bsums from int16_t to const int16_t * Add vector version of quantize_q8_K_4x8 function * Update code formatting based on review comments

1 parent 06d5f99 commit d70a53aCopy full SHA for d70a53a

1 file changed

+1493

-12

lines changed

ggml/src/ggml-cpu
- ggml-cpu-aarch64.cpp

1 file changed

+1493

-12

lines changed

Comments

(0)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commit d70a53a

1 file changed

1 file changed

File tree

1 file changed

1 file changed

0 commit comments