Commit Graph

851 Commits

Author SHA1 Message Date
austinvhuang 0fc80fad05 libgemma refactor - review changes 2024-03-10 12:55:08 -04:00
austinvhuang cc5c24c4f8 remove app.h dependency + fix bazel build 2024-03-08 18:06:43 -05:00
austinvhuang 8c7b2cf61b add README, license to hello_world 2024-03-08 17:59:54 -05:00
austinvhuang 571a5449c4 update commit hash for gemma lib 2024-03-08 17:33:33 -05:00
austinvhuang 03147effbd update loader arg names: cache -> compressed_weights, model -> weights 2024-03-08 17:32:36 -05:00
austinvhuang dfd2fdc1dd Decouple gemma constructor from loader args, update hello_world example, add convenience version of constructor (no uncompressed weights) 2024-03-08 17:26:03 -05:00
austinvhuang 42e53e2da8 [WIP] simplify hello world example, add convenience function. TODO: update git hash in CMakeLists.txt of hello world after push 2024-03-08 14:56:22 -05:00
Jan Wassenberg fce5c8c967 Avoid fadvise on older Android. Fixes #84
PiperOrigin-RevId: 613815953
2024-03-07 22:19:22 -08:00
austinvhuang b67e28d1a0 [WIP] remove args from GetWeights, GetCompressedWeights 2024-03-08 00:00:11 -05:00
Phil Culliton 2161908f50 Added 7B support and args parsing. Still todo: more testing of 7B conversion. 2024-03-07 22:34:14 +00:00
Austin Huang 3df06f64c2
Merge pull request #81 from ufownl/feature/separated_kvcache
Separate KV cache from GemmaImpl
2024-03-07 10:10:11 -05:00
RangerUFO 170a9b4690 Make `CreateKVCache` a free function rather than a method 2024-03-07 15:52:55 +08:00
RangerUFO b841612e8c Separate KV cache from GemmaImpl 2024-03-07 15:47:31 +08:00
austinvhuang 6c0388e049 [WIP] refine Runtime struct definition 2024-03-07 01:14:07 -05:00
austinvhuang 49e654258d [WIP] clean up hello_world #includes and CMakeLists.txt 2024-03-07 01:04:25 -05:00
austinvhuang e781007836 [WIP] Remove InferenceArgs from hello_world example, fix ordering of LoaderArgs validation, revert ReplGemma EOT token behavior 2024-03-06 23:21:13 -05:00
austinvhuang 7042316013 [WIP] update GemmaInterface, Gemma, and Generate input parameter specs to remove InferenceArgs. TODO: update hello_world example after git commit hash is available for fetching 2024-03-06 22:22:59 -05:00
austinvhuang 0f6a4b49d5 [WIP] quality tweaks - for constants, defer float cast and use double for intermediate computations, add `model` to EOT token 2024-03-06 15:34:11 -05:00
austinvhuang 5b9d8a9936 [WIP] dev/examples branch merge 2024-03-06 15:10:48 -05:00
austinvhuang 10f7a086aa [WIP] decouple GemmaImpl from CLI args 2024-03-06 15:06:41 -05:00
Jan Wassenberg 3cdd5e524a Fix loop iteration in GeluMulToBF16
Also attempt to speed up builders (parallel)

PiperOrigin-RevId: 613092863
2024-03-05 23:00:47 -08:00
enum-class 843d9b0e1f fix for-loop bounderies 2024-03-06 08:25:50 +08:00
Copybara-Service c8b9675898 Merge pull request #74 from osanseviero:patch-1
PiperOrigin-RevId: 612937722
2024-03-05 12:49:09 -08:00
Phil Culliton c93e1a1e4d Resolved layer ordering, reshaping, MQA->MHA, and quantization. Works only for 2B. 2024-03-05 17:54:55 +00:00
enum-class bc845515b7 fix style, add kCamelCase style for constexpr in clang-tidy 2024-03-05 20:45:30 +08:00
enum-class 5f016fb433 use hwy/simd for RMSNorm(f, bf, f) calculation 2024-03-05 17:53:52 +08:00
enum-class 507d64e3e6 use hwy/simd for SquaredL2 calculation 2024-03-05 17:37:09 +08:00
Jan Wassenberg bb9b023502 Support Bazel builds. Fixes #16
Also fix nuq/sfp-inl: warning, cast, and disable SCALAR

PiperOrigin-RevId: 612704056
2024-03-04 22:07:25 -08:00
Omar Sanseviero 8c857b957e
Update README.md 2024-03-04 12:58:49 +01:00
Copybara-Service cd7468199c Merge pull request #65 from enum-class:narrowing-issues
PiperOrigin-RevId: 612279564
2024-03-03 18:51:59 -08:00
austinvhuang c378ac2c56 [WIP] hello world example working. TODO: refactor interfaces to decouple arguments 2024-03-03 11:36:48 -05:00
austinvhuang 39cd59caec [WIP] create skeleton for example frontend application 2024-03-03 10:33:29 -05:00
enum-class 90c99c330d use hwy::SignedFromSize traits 2024-03-03 10:56:46 +08:00
austinvhuang 3c69695c1e transformations and validations (wip) 2024-03-02 14:46:51 -05:00
austinvhuang 7d7d43e661 converter transformations (wip) 2024-03-02 08:11:55 -05:00
Omar Sanseviero 86761dc113
Update README.md 2024-03-01 23:44:38 +01:00
austinvhuang 5be9a2243f initial (wip) convert_weights script from pytorch 2024-03-01 15:52:51 -05:00
Copybara-Service 808dbdc42b Merge pull request #71 from google:docs
PiperOrigin-RevId: 611785397
2024-03-01 06:04:51 -08:00
austinvhuang b841620d6b add using gemma as a library notes to DEVELOPERS 2024-02-29 23:52:59 -05:00
Paul Chang ae7901c3f4 Minor style fix
Remove some obsolete TODOs.

PiperOrigin-RevId: 611571224
2024-02-29 13:08:26 -08:00
enum-class 626be6deab add StaticCast 2024-02-29 21:00:54 +08:00
Jan Wassenberg b6aaf6bbb8 Fix for Android's 32-bit off_t. Fixes #62
PiperOrigin-RevId: 611249534
2024-02-28 15:30:19 -08:00
Copybara-Service c805fbe780 Merge pull request #66 from google:dev-cleanup
PiperOrigin-RevId: 611207602
2024-02-28 13:22:46 -08:00
austinvhuang 0ea7b993de remove --log fixing https://github.com/google/gemma.cpp/issues/59, improve command line args help, add copybara #include sort guards in more source files, add README sections on running faster and related projects 2024-02-28 15:18:40 -05:00
Jan Wassenberg 272f17ddb3 Warning fixes: unused member, cast, unused function
PiperOrigin-RevId: 611074887
2024-02-28 05:54:22 -08:00
enum-class 06dd013397 Add clang-tidy, fix narrowing issues, fix constness 2024-02-28 20:04:09 +08:00
Copybara-Service f4a14bfdf2 Merge pull request #58 from google:dev-cleanup
PiperOrigin-RevId: 610942948
2024-02-27 18:47:38 -08:00
austinvhuang 060c8862dd whitespace cleanup 2024-02-27 21:36:43 -05:00
austinvhuang d37f9c3604 re-enable SortIncludes to conform to vanilla Google style, add comment lines to #includes in gemma.h as barriers to block destructive sorting, update doc + remove shell script 2024-02-27 21:23:33 -05:00
austinvhuang 8f3bd63bf7 Fix copybara include path substitutions errors (which break the google3 build) arising from clang-format linter automation 2024-02-27 17:11:15 -05:00