| .. |
|
bindings
|
Abort if args are unrecognized, refactor argument passing
|
2025-12-15 03:18:45 -08:00 |
|
evals
|
Add MMLU eval to github
|
2024-05-20 10:20:53 -07:00 |
|
activations.h
|
Makes the entire runtime_config passed into the activations constructor.
|
2025-12-16 01:56:52 -08:00 |
|
api_client.cc
|
Abort if args are unrecognized, refactor argument passing
|
2025-12-15 03:18:45 -08:00 |
|
api_server.cc
|
Abort if args are unrecognized, refactor argument passing
|
2025-12-15 03:18:45 -08:00 |
|
attention.cc
|
internal change
|
2025-12-09 06:17:08 -08:00 |
|
attention.h
|
Also update attention.h to type-erased query_norm_scale
|
2025-10-28 06:48:33 -07:00 |
|
attention_test.cc
|
Makes the entire runtime_config passed into the activations constructor.
|
2025-12-16 01:56:52 -08:00 |
|
configs.cc
|
Make attention configurable on the command line.
|
2025-12-10 09:34:06 -08:00 |
|
configs.h
|
Warning fixes (sign mismatch), switch default
|
2025-12-15 02:41:19 -08:00 |
|
configs_test.cc
|
Minor: rename compression/shared -> types.h
|
2025-05-13 06:53:21 -07:00 |
|
flash_attention.cc
|
Internal changes
|
2025-12-15 07:14:37 -08:00 |
|
flash_attention.h
|
Internal changes
|
2025-11-28 02:37:06 -08:00 |
|
flash_attention_test.cc
|
Makes the entire runtime_config passed into the activations constructor.
|
2025-12-16 01:56:52 -08:00 |
|
flash_structs.h
|
Add some comments.
|
2025-11-19 01:09:15 -08:00 |
|
gemma-inl.h
|
Add tensor stats and output
|
2025-12-11 22:52:46 -08:00 |
|
gemma.cc
|
Abort if args are unrecognized, refactor argument passing
|
2025-12-15 03:18:45 -08:00 |
|
gemma.h
|
Abort if args are unrecognized, refactor argument passing
|
2025-12-15 03:18:45 -08:00 |
|
gemma_args.h
|
Abort if args are unrecognized, refactor argument passing
|
2025-12-15 03:18:45 -08:00 |
|
gemma_args_test.cc
|
Abort if args are unrecognized, refactor argument passing
|
2025-12-15 03:18:45 -08:00 |
|
kv_cache.cc
|
Clean up toPtrs to delegate to toPtr
|
2025-12-04 06:22:04 -08:00 |
|
kv_cache.h
|
internal change
|
2025-12-10 09:01:15 -08:00 |
|
kv_cache_test.cc
|
No public description
|
2025-12-12 06:37:17 -08:00 |
|
model_store.cc
|
Improves autodetection of Gemma3-1B.
|
2025-11-17 01:12:50 -08:00 |
|
model_store.h
|
Major cleanup of profiler zones, add Caller annotation for all pool.Run
|
2025-10-23 01:54:24 -07:00 |
|
query.h
|
Warning fixes (sign mismatch), switch default
|
2025-12-15 02:41:19 -08:00 |
|
run.cc
|
Abort if args are unrecognized, refactor argument passing
|
2025-12-15 03:18:45 -08:00 |
|
tensor_info.cc
|
Add tensor stats and output
|
2025-12-11 22:52:46 -08:00 |
|
tensor_info.h
|
Add tensor stats and output
|
2025-12-11 22:52:46 -08:00 |
|
tensor_info_test.cc
|
Minor: ModelWeightsPtrs -> WeightsPtrs
|
2025-07-11 06:11:51 -07:00 |
|
tensor_stats.cc
|
Add tensor stats and output
|
2025-12-11 22:52:46 -08:00 |
|
tensor_stats.h
|
Add tensor stats and output
|
2025-12-11 22:52:46 -08:00 |
|
tokenizer.cc
|
(Resubmit) Prepare profiler annotations for new API
|
2025-08-13 01:38:24 -07:00 |
|
tokenizer.h
|
6x large-batch, short-prompt prefill speedup
|
2025-06-10 09:56:20 -07:00 |
|
vit.cc
|
Fix Gemma3 image: ensure A matrix is packed, preallocate
|
2025-12-01 11:47:23 -08:00 |
|
vit.h
|
Minor: ModelWeightsPtrs -> WeightsPtrs
|
2025-07-11 06:11:51 -07:00 |
|
weights.cc
|
Minor: ParallelismStrategy->Parallelism
|
2025-11-06 06:56:10 -08:00 |
|
weights.h
|
Add tensor stats and output
|
2025-12-11 22:52:46 -08:00 |