* llama: automatically fit args to free memory
llama-fit-params tool
* fix CI
* hints for bug reports, ensure no reallocation
* fix segfault with Vulkan
* add llama-fit-params to CI
* fix CI
* fix CI
* fix CI
* minor adjustments
* fix assignment of 1 dense layer
* fix logger not being reset on model load failure
* remove --n-gpu-layer hint on model load failure
* fix llama-fit-params verbosity
* fix edge case
* fix typo [no ci]
- Add mBART encoder/decoder architecture for text generation
- Implement Swin Transformer for vision encoding
- Add cross-attention support for multimodal fusion
- Create conversion scripts for facebook/nougat-base model
- Add nougat-cli tool for document OCR processing
- Support multiple output formats (markdown, LaTeX, plain text)
🤖 Generated with Claude Code
Co-Authored-By: Claude <noreply@anthropic.com>