mirror of https://github.com/google/gemma.cpp.git
Before 38.28, 9.17 (with profiler enabled, prompt = 330 tok) ``` Gen.FFW : 15414 x 4692352 = 24.166318 Gen.Attention.SumHeads : 15414 x 1394804 = 7.183451 !! Gen.Embedding : 361 x 49961894 = 6.026297 Gen.Attention.QKV : 15414 x 1005125 = 5.176546 Gen.Attention.DotSoftmax : 15414 x 885480 = 4.560357 RopeAndMulBy : 696528 x 11867 = 2.761818 ``` After 49.80, 8.68 ``` Gen.FFW : 14448 x 5312783 = 25.646868 Gen.Embedding : 338 x 63044815 = 7.119845 Gen.Attention.QKV : 14448 x 1115003 = 5.382557 Gen.Attention.DotSoftmax : 14448 x 897577 = 4.332957 RopeAndMulBy : 673344 x 11886 = 2.674156 Gen.Attention.SumHeads : 14448 x 518291 = 2.501993 !! ``` PiperOrigin-RevId: 662024085 |
||
|---|---|---|
| .. | ||
| activations.h | ||
| backward-inl.h | ||
| backward.cc | ||
| backward.h | ||
| backward_scalar.h | ||
| backward_scalar_test.cc | ||
| backward_test.cc | ||
| common_scalar.h | ||
| forward-inl.h | ||
| forward.cc | ||
| forward.h | ||
| forward_scalar.h | ||
| optimize_test.cc | ||
| optimizer.cc | ||
| optimizer.h | ||
| prompt.h | ||
| sampler.h | ||
| test_util.h | ||