Skip to content

v1.1.3

Choose a tag to compare

@github-actions github-actions released this 01 Apr 04:34

1.1.3 (2026-04-01)

Bug Fixes

  • compute: add Q5_0Storage B-weight handling to CPU MatMul (e7927e5)
  • compute: Q5_0 GEMV byte-wise loads for ARM64 alignment (5c7ec7a)
  • compute: skip Q4Storage in UploadWeights F32 loop (revert overaggressive skip) (2e91650)
  • compute: skip transpose reshape fast-path for square matrices (eab19d0)