llama.cpp/ggml/src/ggml-webgpu
Nikhil Jain 57487a64c8
[WebGPU] Plug memory leaks and free resources on shutdown (#19315)
* Fix memory leaks in shader lib, backend, backend_context, buffer_context, and webgpu_buf_pool

* Free pools

* Cleanup

* More cleanup

* Run clang-format

* Fix arg-parser and tokenizer test errors that free an unallocated buffer

* Fix device lost callback to not print on device teardown

* Fix include and run clang-format

* remove unused unused

* Update binary ops

---------

Co-authored-by: Reese Levine <reeselevine1@gmail.com>
2026-02-10 08:04:00 -08:00
..
wgsl-shaders ggml-webgpu: JIT compile binary operators and handle binding overlaps (#19310) 2026-02-06 10:33:30 -08:00
CMakeLists.txt ggml webgpu: add support for emscripten builds (#17184) 2025-12-03 10:25:34 +01:00
ggml-webgpu-shader-lib.hpp [WebGPU] Plug memory leaks and free resources on shutdown (#19315) 2026-02-10 08:04:00 -08:00
ggml-webgpu.cpp [WebGPU] Plug memory leaks and free resources on shutdown (#19315) 2026-02-10 08:04:00 -08:00
pre_wgsl.hpp ggml webgpu: initial flashattention implementation (#18610) 2026-01-08 08:23:39 -08:00