llama.cpp/ggml/src/ggml-cann
chen fan 14c28dfc50
CANN: weight format to NZ for Ascend310P3 (#14407)
* weight format to nz for 310p

* remove quant weight format to nz

* clean code

* fix

* make the conditions for converting weights to NZ format consistent

* clean code
2025-07-23 11:58:00 +08:00
..
CMakeLists.txt CANN: Add SOC TYPE printing in cmake configuration (#13837) 2025-05-28 11:54:20 +08:00
Doxyfile CANN: Add the basic supports of Flash Attention kernel (#13627) 2025-05-26 10:20:18 +08:00
acl_tensor.cpp CANN: Add the basic supports of Flash Attention kernel (#13627) 2025-05-26 10:20:18 +08:00
acl_tensor.h CANN: Add the basic supports of Flash Attention kernel (#13627) 2025-05-26 10:20:18 +08:00
aclnn_ops.cpp CANN: weight format to NZ for Ascend310P3 (#14407) 2025-07-23 11:58:00 +08:00
aclnn_ops.h CANN: weight format to NZ for Ascend310P3 (#14407) 2025-07-23 11:58:00 +08:00
common.h fix async_mode bug (#14432) 2025-06-28 17:35:41 +08:00
ggml-cann.cpp CANN: weight format to NZ for Ascend310P3 (#14407) 2025-07-23 11:58:00 +08:00