o7si
|
d0a6a31470
|
model : add support for JinaBertModel with non-gated ffn (#18475)
* WIP: Initial commit for fixing JinaBert original FF type support
* convert: add jina-v2-de tokenizer variant for German_Semantic_V3
* convert: fix token collision in BERT phantom vocab conversion
* convert: add feed_forward_type metadata
* model: add feed_forward_type metadata for jina-bert-v2
* model: jina-bert-v2 support standard GELU FFN variant
* model: remove ffn_type, detect FFN variant from tensor dimensions
* Update src/llama-model.cpp
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
* Update src/llama-model.cpp
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
* Update src/models/bert.cpp
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
* Update src/models/bert.cpp
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
* revert collision fix to be handled in separate PR
---------
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
|
2026-01-01 18:38:51 +01:00 |