Skip to content

Releases: l3utterfly/llama.cpp

b4219

29 Nov 10:53
266b851

Choose a tag to compare

sycl : Reroute permuted mul_mats through oneMKL (#10408)

This PR fixes the failing MUL_MAT tests for the sycl backend.

b4200

27 Nov 12:46
46c69e0

Choose a tag to compare

ci : faster CUDA toolkit installation method and use ccache (#10537)

* ci : faster CUDA toolkit installation method and use ccache

* remove fetch-depth

* only pack CUDA runtime on master

b4098

16 Nov 07:52
772703c

Choose a tag to compare

vulkan: Optimize some mat-vec mul quant shaders (#10296)

Compute two result elements per workgroup (for Q{4,5}_{0,1}). This reuses
the B loads across the rows and also reuses some addressing calculations.
This required manually partially unrolling the loop, since the compiler
is less willing to unroll outer loops.

Add bounds-checking on the last iteration of the loop. I think this was at
least partly broken before.

Optimize the Q4_K shader to vectorize most loads and reduce the number of
bit twiddling instructions.

b4033

05 Nov 08:45
a9e8a9a

Choose a tag to compare

ggml : fix arch check in bf16_to_fp32 (#10164)

b3982

27 Oct 09:02
cc2983d

Choose a tag to compare

sync : ggml

b3902

10 Oct 03:40
c81f3bb

Choose a tag to compare

cmake : do not build common library by default when standalone (#9804)

Layla v3.3.0

18 Jan 04:15

Choose a tag to compare

llama.cpp used in the Layla v3.3.0 release

Layla v3.2.0

16 Jan 04:04

Choose a tag to compare

Merge branch 'master' into layla-build

Layla v3.0.0

09 Jan 14:23
128de35

Choose a tag to compare

server : update readme about token probs (#4777)

* updated server readme to reflect the gg/server-token-probs-4088 commit

added explanation for the API's completion result which now includes `completion_probabilities`. Also added a JSON schema that shows the type/structure of `completion_probabilities`.

* simplified the `completion_probabilities` JSON schema 

It's now easier to understand what the structure of `completion_probabilities` looks like.

* minor : fix trailing whitespace

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

Layla v2.0.0

18 Dec 06:54

Choose a tag to compare

Merge branch 'master' into layla-build