ggml-zendnn : add MUL_MAT_ID op support for MoE models (#21315)

* ggml-zendnn : add MUL_MAT_ID op support for MoE models
- Add MUL_MAT_ID op acceleration for Mixture-of-Experts models
- MUL_MAT_ID op fallback to CPU backend if total experts > 32
- Point ZenDNN lib to latest bits ZenDNN-2026-WW13

* ggml-zendnn : add braces to sgemm failure condition for consistency

Co-authored-by: Aaron Teo <taronaeo@gmail.com>

---------

Co-authored-by: Aaron Teo <taronaeo@gmail.com>
This commit is contained in:
Vishal Singh 2026-04-03 14:49:08 +05:30 committed by GitHub
parent b069b10ab4
commit f1ac84119c
No known key found for this signature in database
GPG key ID: B5690EEEBB952194
5 changed files with 2959 additions and 7219 deletions

File diff suppressed because it is too large Load diff