Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[WebGPU] Implement async tensor api and event api devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning WebGPU
#22099 opened Apr 18, 2026 by nikhilJain17 Contributor Draft
[SYCL] Add Zero-Copy path with Cache Flushing for Intel UMA (Lunar Lake/Meteor Lake) ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#22098 opened Apr 18, 2026 by i-Charlys Loading…
hip: bypass memory pool for flash attention f16 temp buffers ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#22094 opened Apr 18, 2026 by TheTom Draft
convert : support sentence-transformer 5.4 config files python python script changes
#22087 opened Apr 18, 2026 by Bing-su Contributor Loading…
[WIP]hexagon: hmx opt phase2 ggml changes relating to the ggml tensor library for machine learning Hexagon
#22086 opened Apr 18, 2026 by chraac Contributor Draft
mtmd: add pos_0 to mtmd_image_tokens_get_decoder_pos (breaking change) examples testing Everything test related
#22082 opened Apr 18, 2026 by ngxson Contributor Loading…
[SYCL] Update oneapi 2025.3.3, Seperate SYCL build, release Ubuntu 24 package. devops improvements to build systems and github actions documentation Improvements or additions to documentation SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#22078 opened Apr 18, 2026 by NeoZhangJianyu Contributor Loading…
common/autoparser : allow space after tool call testing Everything test related
#22073 opened Apr 18, 2026 by aldehir Contributor Loading…
sycl: Battlemage (BMG) optimizations — AOT, Q5_K reorder, PAD stride fix, new ops, oneMKL routing ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#22066 opened Apr 17, 2026 by aicss-genai Loading…
Extend LoRA hotswapping support examples python python script changes server
#22061 opened Apr 17, 2026 by skiz Loading…
GGML: Allow static build with dynamic loaded backends ggml changes relating to the ggml tensor library for machine learning
#22059 opened Apr 17, 2026 by ervanalb Loading…
2 tasks done
spec: save the dynamic/static ngram cache file
#22055 opened Apr 17, 2026 by petersid2022 Loading…
quant: handle shared-KV layer tensors in imatrix-dependent quantization testing Everything test related
#22054 opened Apr 17, 2026 by ajfonthemove Loading…
3 tasks
CUDA: refactor mma data loading for AMD ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#22051 opened Apr 17, 2026 by JohannesGaessler Contributor Loading…
ggml-webgpu: reset CPU/GPU profiling time when freeing context ggml changes relating to the ggml tensor library for machine learning WebGPU
#22050 opened Apr 17, 2026 by yomaytk Contributor Loading…
Reduce CPU overhead in meta backend: cache subgraph splits when cgraph is unchanged ggml changes relating to the ggml tensor library for machine learning
#22041 opened Apr 17, 2026 by gaugarg-nv Contributor Loading…
server: Skip API key verification for static files examples server
#22038 opened Apr 17, 2026 by roj234 Contributor Loading…
mtmd, llama : Update HunyuanVL vision-language model support examples model Model specific python python script changes
#22037 opened Apr 17, 2026 by ManaEstras Loading…
3 tasks done
[SYCL] Fix reorder MMVQ assert on unaligned vocab sizes ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#22035 opened Apr 17, 2026 by PMZFX Contributor Loading…
server: log prompts to directory examples server
#22031 opened Apr 17, 2026 by jacekpoplawski Contributor Loading…
mtmd, llama, ggml : Update HunyuanVL support Apple Metal https://en.wikipedia.org/wiki/Metal_(API) examples ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs OpenCL Issues specific to the OpenCL backend python python script changes SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#22029 opened Apr 17, 2026 by ManaEstras Loading…
3 tasks done
ProTip! Add no:assignee to see everything that’s not assigned.