llama.cpp/.github
Reese Levine 74b8fc17f9
ggml webgpu: profiling, CI updates, reworking of command submission (#16452)
* Add profiling

* More detailed profiling

* Rework command submission to avoid global locks

* Update wait handling

* try new method of waiting on futures

* Add serializing of command submission in some cases

* Add new pool for timestamp queries and clean up logging

* Serialize command submission in CI and leave a TODO note

* Update webgpu CI

* Add myself as WebGPU codeowner

* Deadlock avoidance

* Leave WebGPU/Vulkan CI serialized

* Fix divide by 0

* Fix logic in division by inflight_threads

* Update CODEOWNERS and remove serialize submit option
2025-10-07 13:48:56 -07:00
..
ISSUE_TEMPLATE ggml: initial IBM zDNN backend (#14975) 2025-08-15 21:11:22 +08:00
actions ci : refactor sdk caching to minimize storage (#16414) 2025-10-06 17:40:21 +02:00
workflows ggml webgpu: profiling, CI updates, reworking of command submission (#16452) 2025-10-07 13:48:56 -07:00
copilot-instructions.md ci : add copilot-instructions.md (#15286) 2025-08-21 11:47:52 +02:00
labeler.yml ggml: initial IBM zDNN backend (#14975) 2025-08-15 21:11:22 +08:00
pull_request_template.md repo : update links to new url (#11886) 2025-02-15 16:40:57 +02:00