llama.cpp/ggml/src
Judd c24f4e2688
ggml : update `ggml_rope_multi` (#12665)
* update `rope_multi`:

1. add `ggml_rope_multi_inplace`;
1. use `GGML_MROPE_SECTIONS` instead of 4.

* Apply suggestions from code review

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2025-08-13 13:45:15 +03:00
..
ggml-blas ggml : fix field name when new ggml_backend (#14944) 2025-08-08 14:37:22 +02:00
ggml-cann CANN: GGML_OP_CPY optimization (#15070) 2025-08-12 16:12:13 +08:00
ggml-cpu ggml : repack block_iq4_nlx8 (#14904) 2025-08-13 11:09:39 +03:00
ggml-cuda CUDA: Optimize `reduce_rows_f32` kernel, leading up to 25x perf improvement on kernel-level and 10% perf increase for Gemma3n (#15132) 2025-08-13 10:04:46 +02:00
ggml-hip HIP: add cmake option to enable compiler output of kernel resource usage metrics (#15103) 2025-08-07 16:44:14 +02:00
ggml-metal llama : add gpt-oss (#15091) 2025-08-05 22:10:36 +03:00
ggml-musa musa: upgrade musa sdk to rc4.2.0 (#14498) 2025-07-24 20:05:37 +01:00
ggml-opencl opencl: allow mixed f16/f32 `add` (#15140) 2025-08-12 02:42:41 -07:00
ggml-rpc ggml-rpc: chunk send()/recv() to avoid EINVAL for very large tensors over RPC (macOS & others) (#15188) 2025-08-13 08:54:30 +03:00
ggml-sycl sycl: Fix and disable more configurations of mul_mat (#15151) 2025-08-12 13:58:22 +02:00
ggml-vulkan ggml : fix field name when new ggml_backend (#14944) 2025-08-08 14:37:22 +02:00
ggml-webgpu ggml: Add basic SET_ROWS support in WebGPU (#15137) 2025-08-06 15:14:40 -07:00
CMakeLists.txt cmake: Add GGML_BACKEND_DIR option (#15074) 2025-08-04 21:29:14 +02:00
ggml-alloc.c llama : add gpt-oss (#15091) 2025-08-05 22:10:36 +03:00
ggml-backend-impl.h ggml : upgrade init_tensor API to return a ggml_status (#11854) 2025-02-28 14:41:47 +01:00
ggml-backend-reg.cpp cmake: Add GGML_BACKEND_DIR option (#15074) 2025-08-04 21:29:14 +02:00
ggml-backend.cpp ggml : fix fallback to CPU for ununsupported ops (#15118) 2025-08-06 14:37:35 +02:00
ggml-common.h llama : add gpt-oss (#15091) 2025-08-05 22:10:36 +03:00
ggml-impl.h llama : add gpt-oss (#15091) 2025-08-05 22:10:36 +03:00
ggml-opt.cpp mnist: fix segmentation fault (ggml/1227) 2025-05-19 13:29:56 +03:00
ggml-quants.c gguf-py : add Numpy MXFP4 de/quantization support (#15111) 2025-08-08 17:48:26 -04:00
ggml-quants.h llama : add gpt-oss (#15091) 2025-08-05 22:10:36 +03:00
ggml-threading.cpp ggml : build backends as libraries (#10256) 2024-11-14 18:04:35 +01:00
ggml-threading.h remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797) 2024-12-12 19:02:49 +01:00
ggml.c ggml : update `ggml_rope_multi` (#12665) 2025-08-13 13:45:15 +03:00
ggml.cpp ggml : Print backtrace on uncaught C++ exceptions (ggml/1232) 2025-06-01 13:43:57 +03:00
gguf.cpp ggml : prevent integer overflow in gguf tensor size calculation (#14595) 2025-07-09 14:33:53 +02:00