llama.cpp

Commit Graph

Author	SHA1	Message	Date
HanishKVC	006a398ebf	ChatON:DeepSeekCoder: Update tmplid and wrt detailed meta json	2024-05-06 11:27:56 +05:30
HanishKVC	1b2e921186	ChatON:DeepSeek: Update support wrt detailed meta json	2024-05-06 11:27:56 +05:30
HanishKVC	403a6c4323	ChatON:Gemma: update for detailed meta json Also as part of same add user role entry for system role also.	2024-05-06 11:27:56 +05:30
HanishKVC	b9e31304a5	ChatON: Update to new detailed format wrt llama2 and llama3 Wrt llama2 * add bos wrt llama2 system and user begins, but not assistant * split system suffix into suffix and end, and add systemuser-system flags so that end can be avoided wrt system+user message combo * add eos wrt assistant end * With these potentially this should work with main and server flows Wrt llama3 * add empty begin, end fields and systemuser-system flags * This should potentially work with main and server flows	2024-05-06 11:27:56 +05:30
HanishKVC	6b23f15ffe	ChatON:ChatOnMetaJSon: Add suffix wrt assistant messages	2024-05-06 11:27:56 +05:30
HanishKVC	f1f39c5256	ChatON:Add Monarch model template, which uses Begin + Prefix Inturn Begin/BoS is added only for non 1st user messages in a system+user prompts chain.	2024-05-06 11:27:56 +05:30
HanishKVC	0f713d4c4f	ChatOn: meta json update wrt the new begin related fields	2024-05-06 11:27:56 +05:30
HanishKVC	84367b9fd1	ChatON: Add template for DeepSeek Was looking at the tokenized vector, and noticed that the EOS mentioned by existing chat_apply_template of llama.cpp, is different from what I noticed in tokenizer_config.json of deepseek llm, so I have added two entries * "deepseek-alt" which matches llama.cpp's chat_apply_template and * "deepseek" which matches that in tokenizer_config.json. This impacts the assistant suffix and reverse prompt entries. CasOfThis: Need to look into other entries which I added previously at a later time. However as the default logic should be picking the EOS from model file, so I assume reverse-prompt being outofsync, may not matter beyond a limit, potentially.	2024-05-06 11:27:56 +05:30
HanishKVC	f4b54069f6	ChatON: Add template for Gemma	2024-05-06 11:27:56 +05:30
HanishKVC	2a8028fba8	ChatON: Add Zephyr template to meta-json file	2024-05-06 11:27:56 +05:30
HanishKVC	221ccd6462	ChatOn: Add SystemUser-1st-User-Has-Prefix flag support Llama2 seems to need it, so chaton-meta-json sample file updated to use same.	2024-05-06 11:27:56 +05:30
HanishKVC	c4cf0e9075	ChatON:Cleanup: BeginEnd, Debug log Update the note Rename global-prefix\|suffix to global-begin\|end. Rename chat-apply-template to chat-apply-template-single, cas it handles only a single message. Add some debug log messages to the helper functions	2024-05-06 11:27:56 +05:30
HanishKVC	d87d27512e	ChatOn: update sample meta json a bit Move [inst] [/inst] wrt llama2 from global to individual role specific parts. Avoid an extra \n wrt prefixes of llama3	2024-05-06 11:27:55 +05:30
HanishKVC	cdbe4f06ce	Chaton:Sample Meta JSON cleanup	2024-05-06 11:27:55 +05:30
HanishKVC	1374a64200	Chaton:Meta: Add chatml meta data to sample meta json file	2024-05-06 11:27:55 +05:30
HanishKVC	093abc29a2	ChatOn: Update sample meta json to be a valid json	2024-05-06 11:27:55 +05:30
HanishKVC	dc56be951d	ChatOn:Main: Load and dump any specified chaton meta file	2024-05-06 11:27:55 +05:30

17 Commits