austinvhuang
|
0fc80fad05
|
libgemma refactor - review changes
|
2024-03-10 12:55:08 -04:00 |
austinvhuang
|
cc5c24c4f8
|
remove app.h dependency + fix bazel build
|
2024-03-08 18:06:43 -05:00 |
austinvhuang
|
8c7b2cf61b
|
add README, license to hello_world
|
2024-03-08 17:59:54 -05:00 |
austinvhuang
|
571a5449c4
|
update commit hash for gemma lib
|
2024-03-08 17:33:33 -05:00 |
austinvhuang
|
03147effbd
|
update loader arg names: cache -> compressed_weights, model -> weights
|
2024-03-08 17:32:36 -05:00 |
austinvhuang
|
dfd2fdc1dd
|
Decouple gemma constructor from loader args, update hello_world example, add convenience version of constructor (no uncompressed weights)
|
2024-03-08 17:26:03 -05:00 |
austinvhuang
|
42e53e2da8
|
[WIP] simplify hello world example, add convenience function. TODO: update git hash in CMakeLists.txt of hello world after push
|
2024-03-08 14:56:22 -05:00 |
Jan Wassenberg
|
fce5c8c967
|
Avoid fadvise on older Android. Fixes #84
PiperOrigin-RevId: 613815953
|
2024-03-07 22:19:22 -08:00 |
austinvhuang
|
b67e28d1a0
|
[WIP] remove args from GetWeights, GetCompressedWeights
|
2024-03-08 00:00:11 -05:00 |
Phil Culliton
|
2161908f50
|
Added 7B support and args parsing. Still todo: more testing of 7B conversion.
|
2024-03-07 22:34:14 +00:00 |
Austin Huang
|
3df06f64c2
|
Merge pull request #81 from ufownl/feature/separated_kvcache
Separate KV cache from GemmaImpl
|
2024-03-07 10:10:11 -05:00 |
RangerUFO
|
170a9b4690
|
Make `CreateKVCache` a free function rather than a method
|
2024-03-07 15:52:55 +08:00 |
RangerUFO
|
b841612e8c
|
Separate KV cache from GemmaImpl
|
2024-03-07 15:47:31 +08:00 |
austinvhuang
|
6c0388e049
|
[WIP] refine Runtime struct definition
|
2024-03-07 01:14:07 -05:00 |
austinvhuang
|
49e654258d
|
[WIP] clean up hello_world #includes and CMakeLists.txt
|
2024-03-07 01:04:25 -05:00 |
austinvhuang
|
e781007836
|
[WIP] Remove InferenceArgs from hello_world example, fix ordering of LoaderArgs validation, revert ReplGemma EOT token behavior
|
2024-03-06 23:21:13 -05:00 |
austinvhuang
|
7042316013
|
[WIP] update GemmaInterface, Gemma, and Generate input parameter specs to remove InferenceArgs. TODO: update hello_world example after git commit hash is available for fetching
|
2024-03-06 22:22:59 -05:00 |
austinvhuang
|
0f6a4b49d5
|
[WIP] quality tweaks - for constants, defer float cast and use double for intermediate computations, add `model` to EOT token
|
2024-03-06 15:34:11 -05:00 |
austinvhuang
|
5b9d8a9936
|
[WIP] dev/examples branch merge
|
2024-03-06 15:10:48 -05:00 |
austinvhuang
|
10f7a086aa
|
[WIP] decouple GemmaImpl from CLI args
|
2024-03-06 15:06:41 -05:00 |
Jan Wassenberg
|
3cdd5e524a
|
Fix loop iteration in GeluMulToBF16
Also attempt to speed up builders (parallel)
PiperOrigin-RevId: 613092863
|
2024-03-05 23:00:47 -08:00 |
enum-class
|
843d9b0e1f
|
fix for-loop bounderies
|
2024-03-06 08:25:50 +08:00 |
Copybara-Service
|
c8b9675898
|
Merge pull request #74 from osanseviero:patch-1
PiperOrigin-RevId: 612937722
|
2024-03-05 12:49:09 -08:00 |
Phil Culliton
|
c93e1a1e4d
|
Resolved layer ordering, reshaping, MQA->MHA, and quantization. Works only for 2B.
|
2024-03-05 17:54:55 +00:00 |
enum-class
|
bc845515b7
|
fix style, add kCamelCase style for constexpr in clang-tidy
|
2024-03-05 20:45:30 +08:00 |
enum-class
|
5f016fb433
|
use hwy/simd for RMSNorm(f, bf, f) calculation
|
2024-03-05 17:53:52 +08:00 |
enum-class
|
507d64e3e6
|
use hwy/simd for SquaredL2 calculation
|
2024-03-05 17:37:09 +08:00 |
Jan Wassenberg
|
bb9b023502
|
Support Bazel builds. Fixes #16
Also fix nuq/sfp-inl: warning, cast, and disable SCALAR
PiperOrigin-RevId: 612704056
|
2024-03-04 22:07:25 -08:00 |
Omar Sanseviero
|
8c857b957e
|
Update README.md
|
2024-03-04 12:58:49 +01:00 |
Copybara-Service
|
cd7468199c
|
Merge pull request #65 from enum-class:narrowing-issues
PiperOrigin-RevId: 612279564
|
2024-03-03 18:51:59 -08:00 |
austinvhuang
|
c378ac2c56
|
[WIP] hello world example working. TODO: refactor interfaces to decouple arguments
|
2024-03-03 11:36:48 -05:00 |
austinvhuang
|
39cd59caec
|
[WIP] create skeleton for example frontend application
|
2024-03-03 10:33:29 -05:00 |
enum-class
|
90c99c330d
|
use hwy::SignedFromSize traits
|
2024-03-03 10:56:46 +08:00 |
austinvhuang
|
3c69695c1e
|
transformations and validations (wip)
|
2024-03-02 14:46:51 -05:00 |
austinvhuang
|
7d7d43e661
|
converter transformations (wip)
|
2024-03-02 08:11:55 -05:00 |
Omar Sanseviero
|
86761dc113
|
Update README.md
|
2024-03-01 23:44:38 +01:00 |
austinvhuang
|
5be9a2243f
|
initial (wip) convert_weights script from pytorch
|
2024-03-01 15:52:51 -05:00 |
Copybara-Service
|
808dbdc42b
|
Merge pull request #71 from google:docs
PiperOrigin-RevId: 611785397
|
2024-03-01 06:04:51 -08:00 |
austinvhuang
|
b841620d6b
|
add using gemma as a library notes to DEVELOPERS
|
2024-02-29 23:52:59 -05:00 |
Paul Chang
|
ae7901c3f4
|
Minor style fix
Remove some obsolete TODOs.
PiperOrigin-RevId: 611571224
|
2024-02-29 13:08:26 -08:00 |
enum-class
|
626be6deab
|
add StaticCast
|
2024-02-29 21:00:54 +08:00 |
Jan Wassenberg
|
b6aaf6bbb8
|
Fix for Android's 32-bit off_t. Fixes #62
PiperOrigin-RevId: 611249534
|
2024-02-28 15:30:19 -08:00 |
Copybara-Service
|
c805fbe780
|
Merge pull request #66 from google:dev-cleanup
PiperOrigin-RevId: 611207602
|
2024-02-28 13:22:46 -08:00 |
austinvhuang
|
0ea7b993de
|
remove --log fixing https://github.com/google/gemma.cpp/issues/59, improve command line args help, add copybara #include sort guards in more source files, add README sections on running faster and related projects
|
2024-02-28 15:18:40 -05:00 |
Jan Wassenberg
|
272f17ddb3
|
Warning fixes: unused member, cast, unused function
PiperOrigin-RevId: 611074887
|
2024-02-28 05:54:22 -08:00 |
enum-class
|
06dd013397
|
Add clang-tidy, fix narrowing issues, fix constness
|
2024-02-28 20:04:09 +08:00 |
Copybara-Service
|
f4a14bfdf2
|
Merge pull request #58 from google:dev-cleanup
PiperOrigin-RevId: 610942948
|
2024-02-27 18:47:38 -08:00 |
austinvhuang
|
060c8862dd
|
whitespace cleanup
|
2024-02-27 21:36:43 -05:00 |
austinvhuang
|
d37f9c3604
|
re-enable SortIncludes to conform to vanilla Google style, add comment lines to #includes in gemma.h as barriers to block destructive sorting, update doc + remove shell script
|
2024-02-27 21:23:33 -05:00 |
austinvhuang
|
8f3bd63bf7
|
Fix copybara include path substitutions errors (which break the google3 build) arising from clang-format linter automation
|
2024-02-27 17:11:15 -05:00 |