llama.cpp

History

Nikhil Jain 57487a64c8 [WebGPU] Plug memory leaks and free resources on shutdown (#19315 ) * Fix memory leaks in shader lib, backend, backend_context, buffer_context, and webgpu_buf_pool * Free pools * Cleanup * More cleanup * Run clang-format * Fix arg-parser and tokenizer test errors that free an unallocated buffer * Fix device lost callback to not print on device teardown * Fix include and run clang-format * remove unused unused * Update binary ops --------- Co-authored-by: Reese Levine <reeselevine1@gmail.com>		2026-02-10 08:04:00 -08:00
..
wgsl-shaders	ggml-webgpu: JIT compile binary operators and handle binding overlaps (#19310 )	2026-02-06 10:33:30 -08:00
CMakeLists.txt	ggml webgpu: add support for emscripten builds (#17184 )	2025-12-03 10:25:34 +01:00
ggml-webgpu-shader-lib.hpp	[WebGPU] Plug memory leaks and free resources on shutdown (#19315 )	2026-02-10 08:04:00 -08:00
ggml-webgpu.cpp	[WebGPU] Plug memory leaks and free resources on shutdown (#19315 )	2026-02-10 08:04:00 -08:00
pre_wgsl.hpp	ggml webgpu: initial flashattention implementation (#18610 )	2026-01-08 08:23:39 -08:00