Update OPENVINO.md

2026-01-13 14:53:27 -08:00 · 2026-01-13 14:53:27 -08:00 · d3649c11cb
parent e9ed5c4cb6
commit d3649c11cb
1 changed files with 0 additions and 2 deletions
--- a/docs/backend/OPENVINO.md
+++ b/docs/backend/OPENVINO.md
@ -108,8 +108,6 @@ GGML_OPENVINO_DEVICE=GPU ./llama-bench -fa 1

 ### NPU Notes

- Smaller context sizes are recommended (e.g. `-c 512`)
- Static compilation mode is enabled automatically
 - Model caching is not yet supported
 - Does not support llama-server -np > 1 (multiple parallel sequences)
 - Only supports llama-perplexity -b 512 or smaller