This website requires JavaScript.
Explore
Help
Sign in
thek0tyara
/
llama-cpp-turboquant
Watch
1
Star
0
Fork
You've already forked llama-cpp-turboquant
0
Code
Issues
Pull requests
Projects
Releases
Packages
Wiki
Activity
Actions
4
7f323a589f
llama-cpp-turboquant
/
include
History
Download ZIP
Download TAR.GZ
David Huang
7f323a589f
Add
--no-op-offload
to improve
-ot
pp perf in MoE models like llama4 400B (
#13386
)
2025-05-11 14:18:39 +02:00
..
llama-cpp.h
llama : add
llama_vocab
, functions -> methods, naming (
#11110
)
2025-01-12 11:32:42 +02:00
llama.h
Add
--no-op-offload
to improve
-ot
pp perf in MoE models like llama4 400B (
#13386
)
2025-05-11 14:18:39 +02:00