-
Notifications
You must be signed in to change notification settings - Fork 16.9k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
server: tests: fetch random media marker via /apply-template (#21962)
examples
python
python script changes
server
#21980
opened Apr 16, 2026 by
ServeurpersoCom
Contributor
Loading…
cmake: fix persistent ARM CPU detection warning on Clang/macOS
ggml
changes relating to the ggml tensor library for machine learning
#21977
opened Apr 16, 2026 by
dindinw
Contributor
Loading…
hexagon: add SOLVE_TRI op
ggml
changes relating to the ggml tensor library for machine learning
Hexagon
#21974
opened Apr 16, 2026 by
mengshengwu
Contributor
•
Draft
Use/support modern nixpkgs on Darwin (#21381)
devops
improvements to build systems and github actions
nix
Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
#21972
opened Apr 15, 2026 by
charles-dyfis-net
Contributor
Loading…
model : support NVFP4 tensors for Gemma4
model
Model specific
#21971
opened Apr 15, 2026 by
CISC
Member
Loading…
model: using single llm_build per arch
model
Model specific
#21970
opened Apr 15, 2026 by
ngxson
Contributor
Loading…
Update CMakeLists.txt
ggml
changes relating to the ggml tensor library for machine learning
#21969
opened Apr 15, 2026 by
bjodom
Loading…
feat: CUDA 10.2 / C++14 compatibility for Jetson TX2 (compute 6.2)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#21968
opened Apr 15, 2026 by
sourceupdev
•
Draft
added spirv-headers to nix
devops
improvements to build systems and github actions
merge ready
A maintainer can use this label to indicate that they consider the changes final and ready to merge.
nix
Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
#21965
opened Apr 15, 2026 by
yuannan
Contributor
Loading…
ggml-webgpu(shader): support conv2d kernels.
ggml
changes relating to the ggml tensor library for machine learning
WebGPU
#21964
opened Apr 15, 2026 by
Constannnnnt
Contributor
Loading…
Add SVE for simd-gemm.h
ggml
changes relating to the ggml tensor library for machine learning
#21958
opened Apr 15, 2026 by
pt13762104
Contributor
Loading…
metal: Implement ROLL op
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
#21946
opened Apr 15, 2026 by
kushagharahi
Contributor
Loading…
openvino: driver setup, CI split, thread safety, and NPU optimizations
devops
improvements to build systems and github actions
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
OpenVINO
#21944
opened Apr 15, 2026 by
wine99
Contributor
Loading…
opencl: refactor q8_0 set_tensor and mul_mat host side dispatch for Adreno
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#21938
opened Apr 15, 2026 by
lhez
Contributor
Loading…
build: include libmtmd in Apple XCFramework (opt-in LLAMA_BUILD_MTMD)
build
Compilation issues
examples
#21935
opened Apr 15, 2026 by
theabecaster
Loading…
5 tasks done
cmake: remove CMP0194 policy to restore MSVC builds
ggml
changes relating to the ggml tensor library for machine learning
#21934
opened Apr 15, 2026 by
texasich
Contributor
Loading…
gguf-py: add type and range validation to GGUFWriter.add_key_value
python
python script changes
#21931
opened Apr 15, 2026 by
anmolg1997
Loading…
nix: support unified apple-sdk
devops
improvements to build systems and github actions
nix
Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
#21928
opened Apr 14, 2026 by
kushagharahi
Contributor
Loading…
fix: llama-finetune backward pass crashes
examples
ggml
changes relating to the ggml tensor library for machine learning
#21924
opened Apr 14, 2026 by
System64fumo
Loading…
common : add --hf-prune-old-files (-hfp) parameter to automatically delete outdated HF files
#21923
opened Apr 14, 2026 by
Cr4xy
Loading…
sycl : fused MoE mul_mat_vec_q for TG
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#21920
opened Apr 14, 2026 by
abotsis
Loading…
ggml: improve SPIR-V headers detection with __has_include while preserving original _WIN32 logic
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#21918
opened Apr 14, 2026 by
EmilAskerov
Loading…
Added sve tuned code for gemm_q8_0_4x8_q8_0() kernel
ggml
changes relating to the ggml tensor library for machine learning
#21916
opened Apr 14, 2026 by
hrushitfujitsu
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:master.