- Add adapter layer for TQ2_0 encoding conversion - Implement branchless bitwise encoding conversion - Add SIMD-accelerated Q8_K to int32 type conversion - Integrate with ggml_vec_dot_tq2_0_q8_K_generic via threshold dispatch - Add TQ2_0 test cases to test-backend-ops - Include sparse-ternary-fma library (dense SIMD kernel) - 2.3x throughput improvement on AVX-512 |
||
|---|---|---|
| .. | ||
| cmake | ||
| include | ||
| src | ||
| .gitignore | ||
| CMakeLists.txt | ||