[SYCL] supprt Flash Attention for fp32/fp16/Q4/Q5/Q8 (#20190)

* support flash-attention for fp32/fp16/Q4/Q5/Q8

* rm warining

* update for JIT
This commit is contained in:
Neo Zhang 2026-03-08 12:00:07 +08:00 committed by GitHub
parent c5a778891b
commit 213c4a0b81
No known key found for this signature in database
GPG key ID: B5690EEEBB952194
65 changed files with 20091 additions and 8593 deletions

File diff suppressed because it is too large Load diff