[SYCL] supprt Flash Attention for fp32/fp16/Q4/Q5/Q8 (#20190)
* support flash-attention for fp32/fp16/Q4/Q5/Q8 * rm warining * update for JIT
This commit is contained in:
parent
c5a778891b
commit
213c4a0b81
65 changed files with 20091 additions and 8593 deletions
23688
docs/ops/SYCL.csv
23688
docs/ops/SYCL.csv
File diff suppressed because it is too large
Load diff
Loading…
Add table
Add a link
Reference in a new issue