|
openvino
|
kq_mask naming fix
|
2026-01-15 14:38:53 -08:00 |
|
CMakeLists.txt
|
Use shared_buffer for GPU NPU; Refactor
|
2026-01-15 11:39:08 -08:00 |
|
ggml-decoder.cpp
|
kq_mask naming fix
|
2026-01-15 14:38:53 -08:00 |
|
ggml-decoder.h
|
Initial stateful graph support
|
2026-01-15 11:39:08 -08:00 |
|
ggml-openvino.cpp
|
Fix --direct-io 0
|
2026-02-11 10:15:09 +08:00 |
|
ggml-quants.hpp
|
NPU always requant to q4_0_128
|
2026-01-15 11:39:08 -08:00 |
|
utils.cpp
|
Fix llama-bench -p -n where p<=256
|
2026-02-11 10:15:09 +08:00 |
|
utils.h
|
Fix llama-bench -p -n where p<=256
|
2026-02-11 10:15:09 +08:00 |