Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

ggml : check cuda and metal argsort limits and add test Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#16323 opened Sep 28, 2025 by CISC Loading…
perplexity : show more kl-divergence data examples
#16321 opened Sep 28, 2025 by ddh0 Loading…
ggml : fix dependencies for ggml_set_rows ggml changes relating to the ggml tensor library for machine learning
#16318 opened Sep 28, 2025 by ggerganov Loading…
vulkan: in flash attention, bounds check against nem1 (don't rely on GGML_KQ_MASK_PAD) ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#16316 opened Sep 28, 2025 by jeffbolznv Loading…
ggml : fix unaligned access in AMX code ggml changes relating to the ggml tensor library for machine learning
#16315 opened Sep 28, 2025 by ggerganov Draft
ggml : remove SVE paths ggml changes relating to the ggml tensor library for machine learning
#16314 opened Sep 28, 2025 by ggerganov Loading…
ggml-backend : unify the dl_load_library() return type ggml changes relating to the ggml tensor library for machine learning
#16313 opened Sep 28, 2025 by haiyuewa Loading…
ggml : remove KQ mask padding ggml changes relating to the ggml tensor library for machine learning
#16309 opened Sep 28, 2025 by ggerganov Draft
1 of 3 tasks
cuda : Disable host buffers on integrated GPUs (#15034) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#16308 opened Sep 28, 2025 by ai-fonsi Loading…
ci: Properly install rocwmma for hip builds devops improvements to build systems and github actions
#16305 opened Sep 28, 2025 by IMbackK Loading…
ci: update vulkan ci devops improvements to build systems and github actions
#16294 opened Sep 27, 2025 by netrunnereve Loading…
vulkan: Fix validation failure in quantized flash attention ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#16292 opened Sep 27, 2025 by jeffbolznv Loading…
hip : substituted bpermute ops with swizzle ops (gfx906, maybe all AMD) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#16291 opened Sep 27, 2025 by iacopPBK Loading…
Update convert_hf_to_gguf_update.py python python script changes
#16280 opened Sep 26, 2025 by cpumaxx Loading…
rpc : add support for multiple devices examples ggml changes relating to the ggml tensor library for machine learning
#16276 opened Sep 26, 2025 by rgerganov Draft
Support FP16 as intermediate results in graph computation ggml changes relating to the ggml tensor library for machine learning
#16270 opened Sep 26, 2025 by hipudding Draft
musa: update compile flags ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#16265 opened Sep 26, 2025 by yeahdongcn Loading…
Correct XTC threshold args documentation
#16260 opened Sep 25, 2025 by Volko61 Loading…
Refactor llama-model.cpp
#16252 opened Sep 25, 2025 by pwilkin Loading…
CANN: Update several operators to support FP16 data format Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#16251 opened Sep 25, 2025 by hipudding Loading…
ProTip! Updated in the last three days: updated:>2025-09-25.