llama.cpp

Commit Graph

Author	SHA1	Message	Date
Pascal	9d3818ed7a	webui: remove artificial cache limit, let GC handle cleanup on conversation change	2026-02-13 14:00:06 +01:00
Pascal	1c9085c9a6	webui: fix UI freeze at high token rates with RAF yield The markdown coalescing loop was processing chunks back-to-back without yielding to the browser's paint cycle. At high token rates (250+ tok/s), this caused complete UI freeze as the main thread was perpetually busy. Add a requestAnimationFrame yield between processing batches. This allows the browser to paint at screen FPS regardless of token throughput. Chunks arriving during the yield are coalesced and processed together, so we skip intermediate states and jump straight to the latest content. Before: Chunk->process->Chunk->process->... (browser never paints = freeze) After: Chunk->process->[RAF]->coalesced chunks->process->[RAF]->... (screen FPS) Tested with 250 tok/s streams on 50K+ token contexts: smooth scrolling and responsive UI throughout.	2026-02-13 14:00:06 +01:00
Pascal	e7140051b7	webui: incremental MDAST transform caching for streaming performance Replace full AST re-transformation with per-block caching strategy. Previously, each streaming chunk triggered processor.run() on the entire document (12 rehype/remark plugins including KaTeX and highlight.js). Now transforms individual MDAST nodes and caches results by position hash. In append-only streaming mode, stable blocks are reused directly from cache, only the unstable trailing block is re-transformed. - Add SvelteMap FIFO cache (5000 blocks, evicts oldest 1000 on overflow) - Add getMdastNodeHash() for MDAST node fingerprinting by position - Add isAppendMode() to detect streaming append patterns - Add transformMdastNode() for single-node transformation with cache lookup - Remove stringifyProcessedNode() (dead code after refactor) Reduces streaming complexity from O(N × transforms) to O(1) for stable blocks. Targets 200K token contexts without UI degradation on mobile devices.	2026-02-13 14:00:06 +01:00
Aleksander Grygier	874de71573	refactor: Cleanup	2026-02-13 14:00:05 +01:00
Aleksander Grygier	0e8a4ccfee	refactor: Components	2026-02-13 13:57:25 +01:00
Aleksander Grygier	f486ce9f30	(webui) REFACTOR: UI primitives and polish (#19551 ) * webui: UI primitives and polish (non-MCP) * chore: update webui build output	2026-02-12 12:21:00 +01:00

6 Commits