llama.cpp

History

Piotr Wilkin cb3ef311af Add missing op parameters to the profiler; add support for test-backend-ops to run performance tests with exactly the tensor shapes from the run		2026-04-13 15:29:49 +02:00
..
ggml-alloc.h	llama: automatically set parameters not set by the user in such a way that maximizes GPU utilization (#16653 )	2025-12-15 09:24:59 +01:00
ggml-backend.h	ggml: backend-agnostic tensor parallelism (experimental) (#19378 )	2026-04-09 16:42:19 +02:00
ggml-blas.h	…
ggml-cann.h	docs : Minor cleanups (#19252 )	2026-02-02 08:38:55 +02:00
ggml-cpp.h	…
ggml-cpu.h	Add missing op parameters to the profiler; add support for test-backend-ops to run performance tests with exactly the tensor shapes from the run	2026-04-13 15:29:49 +02:00
ggml-cuda.h	ggml: backend-agnostic tensor parallelism (experimental) (#19378 )	2026-04-09 16:42:19 +02:00
ggml-hexagon.h	…
ggml-metal.h	…
ggml-opencl.h	…
ggml-openvino.h	ggml : add OpenVINO backend (#15307 )	2026-03-14 07:56:55 +02:00
ggml-opt.h	chore : correct typos [no ci] (#20041 )	2026-03-05 08:50:21 +01:00
ggml-profiler.h	Add missing op parameters to the profiler; add support for test-backend-ops to run performance tests with exactly the tensor shapes from the run	2026-04-13 15:29:49 +02:00
ggml-rpc.h	ggml : bump RPC version (#20330 )	2026-03-10 21:36:57 +02:00
ggml-sycl.h	…
ggml-virtgpu.h	ggml-virtgpu: make the code thread safe (#19204 )	2026-02-04 10:46:18 +08:00
ggml-vulkan.h	…
ggml-webgpu.h	…
ggml-zdnn.h	…
ggml-zendnn.h	ggml-zendnn : add ZenDNN backend for AMD CPUs (#17690 )	2025-12-07 00:13:33 +08:00
ggml.h	ggml : deprecate GGML_OP_ADD1 (#21363 )	2026-04-07 15:28:27 +03:00
gguf.h	llama: fix llama-model-saver (#20503 )	2026-03-25 12:53:16 +02:00