Commit Graph

1 Commits

Author SHA1 Message Date
h9-tec d3aea508a1 models : add Nougat OCR support with mBART and Swin Transformer
- Add mBART encoder/decoder architecture for text generation
- Implement Swin Transformer for vision encoding
- Add cross-attention support for multimodal fusion
- Create conversion scripts for facebook/nougat-base model
- Add nougat-cli tool for document OCR processing
- Support multiple output formats (markdown, LaTeX, plain text)

🤖 Generated with Claude Code

Co-Authored-By: Claude <noreply@anthropic.com>
2025-11-20 10:34:12 +02:00