Commit Graph

156 Commits

Author SHA1 Message Date
Christian Zhou-Zheng 05b183fe7b compatibility fix 2024-06-10 14:00:13 -04:00
Christian Zhou-Zheng 854bd64a5d
Update gguf-py/gguf/gguf_writer.py
Co-authored-by: compilade <git@compilade.net>
2024-06-10 13:55:08 -04:00
Christian Zhou-Zheng b843445827
Update gguf-py/gguf/gguf_writer.py
Co-authored-by: compilade <git@compilade.net>
2024-06-10 13:54:41 -04:00
Christian Zhou-Zheng efa06098a6 swap bar orders 2024-06-10 07:58:17 -04:00
Christian Zhou-Zheng 4550826871
Update gguf-py/gguf/gguf_writer.py
Co-authored-by: compilade <git@compilade.net>
2024-06-10 07:55:24 -04:00
Christian Zhou-Zheng c1b1a29266
Update gguf-py/gguf/gguf_writer.py
Co-authored-by: compilade <git@compilade.net>
2024-06-10 07:55:01 -04:00
Christian Zhou-Zheng ad02c9409a
Update gguf-py/gguf/gguf_writer.py
Co-authored-by: compilade <git@compilade.net>
2024-06-10 07:54:50 -04:00
Christian Zhou-Zheng 99f9a24805
Update gguf-py/gguf/gguf_writer.py
Co-authored-by: compilade <git@compilade.net>
2024-06-10 07:54:18 -04:00
Christian Zhou-Zheng 7eea552db8
Update gguf-py/gguf/gguf_writer.py
Co-authored-by: compilade <git@compilade.net>
2024-06-10 07:54:06 -04:00
Christian Zhou-Zheng 79bd2bfcb0 catch oversights 2024-06-09 20:22:17 -04:00
Christian Zhou-Zheng f7e7983946
Update gguf-py/gguf/gguf_writer.py
Co-authored-by: compilade <git@compilade.net>
2024-06-09 20:17:25 -04:00
Christian Zhou-Zheng 1e2d9cb589 progress bar, fix split logic 2024-06-09 17:31:25 -04:00
Christian Zhou-Zheng 70a6bc91cc
Update gguf-py/gguf/gguf_writer.py
Co-authored-by: compilade <git@compilade.net>
2024-06-09 17:08:11 -04:00
Christian Zhou-Zheng 0417104397 fix linting 2024-06-09 16:05:08 -04:00
Christian Zhou-Zheng 9d7f694438 fix typing and clean up 2024-06-09 16:02:23 -04:00
Christian Zhou-Zheng f7ecd99691 appease linter 2024-06-09 13:09:05 -04:00
Christian Zhou-Zheng 5a96b8f27f remove SplitStrategy, SplitArguments 2024-06-09 13:08:06 -04:00
Christian Zhou-Zheng 0471f67f4f cleanup round 1 2024-06-09 12:40:02 -04:00
Christian Zhou-Zheng a234bf821b fix linting 2024-06-09 11:23:55 -04:00
Christian Zhou-Zheng 0779f2f74f tidy up 2024-06-09 11:20:14 -04:00
Christian Zhou-Zheng ba1be979eb fix ti data messiness 2024-06-09 11:10:33 -04:00
Christian Zhou-Zheng ff2dd7d30d try to refactor kv data (still fails) 2024-06-09 10:29:47 -04:00
Christian Zhou-Zheng 97dd416903 kv/ti data are still wrong 2024-06-09 00:34:36 -04:00
Christian Zhou-Zheng 03cc9bcbe8 use simplification from #7827 2024-06-08 23:14:26 -04:00
Christian Zhou-Zheng 666bb097a2 Merge branch 'master' into convert-split 2024-06-08 23:06:18 -04:00
Christian Zhou-Zheng 282e71fb39 edit cmd line args 2024-06-08 23:00:42 -04:00
compilade ed9f252118
gguf-py : decouple adding metadata from writing in GGUFWriter (#7827)
Main changes of this PR is to consolidate GGUFWriter.add_key and GGUFWriter.add_val into GGUFWriter.add_key_value. 

In addition use_temp_file is now opt-in instead of opt-out defaulting to False.

Also GGUFWriter now does not require output file name until when actually writing to it.

And GGUFWriter doesn't really need to eagerly prepare the data layout of the metadata
2024-06-09 12:34:29 +10:00
Christian Zhou-Zheng 02be0dd654 attempt 3 to appease the linter 2024-06-07 21:26:40 -04:00
Christian Zhou-Zheng 891b19cb81 attempt 2 to appease the linter 2024-06-07 21:20:46 -04:00
Christian Zhou-Zheng 2e70fa1055 attempt to appease the linter 2024-06-07 21:18:30 -04:00
Christian Zhou-Zheng dc5cf5fd82
Update gguf-py/gguf/gguf_writer_split.py
Co-authored-by: compilade <git@compilade.net>
2024-06-07 17:26:30 -04:00
Christian Zhou-Zheng 1312e287ec
Update gguf-py/gguf/constants.py
Co-authored-by: compilade <git@compilade.net>
2024-06-07 17:10:51 -04:00
Christian Zhou-Zheng 6d3a256d1d rename GGUFManager to GGUFWriterSplit 2024-06-07 09:12:44 -04:00
Christian Zhou-Zheng 13ffe22ca7 base-1024 bytes to base-1000 2024-06-06 10:24:11 -04:00
Christian Zhou-Zheng 83e4a3f5cc make pathlib explicit 2024-06-06 09:00:59 -04:00
Christian Zhou-Zheng 2037eabb64 move kv keys to constants.py 2024-06-06 08:49:46 -04:00
Christian Zhou-Zheng 1cbab22225 type consistency in format_n_bytes_to_str 2024-06-06 08:43:26 -04:00
Christian Zhou-Zheng 3328b0a991 Shard dataclass and un-negative dont_add_architecture 2024-06-06 08:37:35 -04:00
Christian Zhou-Zheng 6a05183b97
GGUFWriter compatibility fix
Co-authored-by: compilade <git@compilade.net>
2024-06-06 08:28:10 -04:00
Joan Fontanals f5d7b268ec
llama : add jina v2 base code (#7596)
* feat: add changes to handle jina v2 base code

* fix: do not complicate things

* fix: fix the usage of the code model

* fix: fix comments

* fix: fix linting issues

* fix: remove ollama patches

* style : minor

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2024-06-06 10:22:41 +03:00
Christian Zhou-Zheng ce7e6985d2 form shards while adding tensors, SHA256 sums agree with master 2024-06-05 18:29:39 -04:00
Christian Zhou-Zheng 5ad397d610 reduce diffs with master 2024-06-05 13:49:20 -04:00
Christian Zhou-Zheng bb5ee02096 simplify even further and standardize with GGUFWriter 2024-06-05 12:49:08 -04:00
Christian Zhou-Zheng f6fd3ea4e9 further simplify GGUFManager 2024-06-05 12:28:40 -04:00
Christian Zhou-Zheng 3e9430df33 reduce duplicated code from gguf_writer 2024-06-05 09:29:33 -04:00
Christian Zhou-Zheng efead0408c fix gguf_writer placement and remove comments 2024-06-03 19:34:01 -04:00
Christian Zhou-Zheng 140eb52f3f Merge branch 'master' into convert-split 2024-06-03 09:07:23 -04:00
Christian Zhou-Zheng 240243e63f remove unnecessary imports in gguf_manager 2024-06-03 09:01:42 -04:00
Christian Zhou-Zheng 09baf2f3b5 fix Q8 quantization 2024-06-03 08:58:29 -04:00
zhangkaihuo 6f28a333c1
llama : MiniCPM support tied embeddings (#7664)
* support lm_head

* remove the code block

---------

Co-authored-by: zhangkaihuo <zhangkaihuo@modelbest.cn>
2024-06-03 10:49:30 +03:00