model-conversion : add --embeddings flag to modelcard.template [no ci] (#15801)
This commit updates the modelcard.template file used in the model conversion scripts for embedding models to include the llama-server --embeddings flag in the recommended command to run the model. The motivation for this change was that when using the model-conversion "tool" to upload the EmbeddingGemma models to Hugging Face this flag was missing and the embedding endpoint was there for not available when copy&pasting the command.
This commit is contained in:
parent
4fd1242bef
commit
5d6688de08
|
|
@ -7,7 +7,7 @@ base_model:
|
||||||
Recommended way to run this model:
|
Recommended way to run this model:
|
||||||
|
|
||||||
```sh
|
```sh
|
||||||
llama-server -hf {namespace}/{model_name}-GGUF
|
llama-server -hf {namespace}/{model_name}-GGUF --embeddings
|
||||||
```
|
```
|
||||||
|
|
||||||
Then the endpoint can be accessed at http://localhost:8080/embedding, for
|
Then the endpoint can be accessed at http://localhost:8080/embedding, for
|
||||||
|
|
|
||||||
Loading…
Reference in New Issue