Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[WIP] Cross Entropy Loss on Metal
#18121 opened Dec 17, 2025 by iliailmer Draft
arg: allow -kvu flag for llama-perplexity
#18117 opened Dec 16, 2025 by TrevorS Loading…
ci : clean up webui jobs devops improvements to build systems and github actions
#18116 opened Dec 16, 2025 by CISC Loading…
ggml-hexagon: swiglu_oai operation ggml changes relating to the ggml tensor library for machine learning
#18114 opened Dec 16, 2025 by joeldushouyu Loading…
ggml-hexagon: Add lightweight atomic synchronization support to htp_ops_context for inter-task coordination ggml changes relating to the ggml tensor library for machine learning
#18113 opened Dec 16, 2025 by ngdxzy Loading…
model : add ASR support for LFM2-Audio-1.5B (conformer) examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes testing Everything test related
#18106 opened Dec 16, 2025 by ngxson Loading…
ggml-cuda: Delta-Net linear attention for Qwen3-Next ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#18102 opened Dec 16, 2025 by hauhaut Loading…
ggml-cpu: ARM64: repack version of q8_0 (dotprod and i8mm) ggml changes relating to the ggml tensor library for machine learning
#18096 opened Dec 16, 2025 by Alcpz Loading…
ggml: migrate work_data to stack allocation ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#18083 opened Dec 16, 2025 by GermanAizek Loading…
vulkan/cuda: fix topk_moe with exp_probs_b ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related Vulkan Issues specific to the Vulkan backend
#18071 opened Dec 15, 2025 by jeffbolznv Loading…
vulkan: support GGML_UNARY_OP_XIELU ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#18062 opened Dec 15, 2025 by jeffbolznv Loading…
vulkan: in graph_optimize, try to group ADD operations ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#18060 opened Dec 15, 2025 by jeffbolznv Loading…
CLI: llama-cli and llama-completion cosmetics devops improvements to build systems and github actions documentation Improvements or additions to documentation python python script changes script Script related SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#18053 opened Dec 15, 2025 by andrew-aladev Loading…
vulkan: Implement set_tensor_async and the event interfaces ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#18047 opened Dec 15, 2025 by jeffbolznv Loading…
chat-parser: handle whitespace around JSON in tool call parsing testing Everything test related
#18044 opened Dec 15, 2025 by ochafik Draft
convert : keep file part order from model index python python script changes
#18043 opened Dec 14, 2025 by CISC Loading…
[Speculative decoding] feat: add EAGLE3 speculative decoding support examples ggml changes relating to the ggml tensor library for machine learning model Model specific python python script changes
#18039 opened Dec 14, 2025 by ichbinhandsome Draft
Extend run-org-model.py examples python python script changes
#18034 opened Dec 14, 2025 by pwilkin Loading…
vulkan: use 4 rows for scalar FA large tile size ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#18033 opened Dec 14, 2025 by jeffbolznv Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.