gemma.cpp/gemma/bindings
Jan Wassenberg 9efdcfd45c 1.07x batch decode speedup: more BF16 weights and activations
BF16 att_sums and ffw_out
Support BF16 B views without decompression
Support arbitrary types in MulByConstAndAdd, AddFrom

Also update profiler annotations in ops-inl.h

PiperOrigin-RevId: 766995010
2025-06-03 23:30:18 -07:00
..
GemmaInterop.cs cleanup, new conversation methods, bugfixes 2025-05-07 08:52:44 -07:00
c_api.cc cleanup, new conversation methods, bugfixes 2025-05-07 08:52:44 -07:00
c_api.h cleanup, new conversation methods, bugfixes 2025-05-07 08:52:44 -07:00
context.cc 1.07x batch decode speedup: more BF16 weights and activations 2025-06-03 23:30:18 -07:00
context.h Replace RowVectorBatch with MatStorageT 2025-05-12 09:16:12 -07:00