llama.cpp/ggml
Alberto Cabrera Pérez afc0e89698
sycl: refactor quantization to q8_1 (#14815)
* sycl: quantization to q8_1 refactor

* Refactored src1 copy logic in op_mul_mat
2025-07-28 11:05:53 +01:00
..
cmake cmake : Indent ggml-config.cmake (ggml/1310) 2025-07-28 08:15:01 +03:00
include ggml: Add initial WebGPU backend (#14521) 2025-07-16 18:18:51 +03:00
src sycl: refactor quantization to q8_1 (#14815) 2025-07-28 11:05:53 +01:00
.gitignore vulkan : cmake integration (#8119) 2024-07-13 18:12:39 +02:00
CMakeLists.txt ggml-cpu : disable GGML_NNPA by default due to instability (#14880) 2025-07-25 19:09:03 +02:00