llama-cpp-turboquant

History

Eve 3407364776 Q6_K AVX improvements (#10118 ) * q6_k instruction reordering attempt * better subtract method * should be theoretically faster small improvement with shuffle lut, likely because all loads are already done at that stage * optimize bit fiddling * handle -32 offset separately. bsums exists for a reason! * use shift * Update ggml-quants.c * have to update ci macos version to 13 as 12 doesnt work now. 13 is still x86		2024-11-04 23:06:31 +01:00
..
bench.yml.disabled	ggml-backend : add device and backend reg interfaces (#9707 )	2024-10-03 01:49:47 +02:00
build.yml	Q6_K AVX improvements (#10118 )	2024-11-04 23:06:31 +01:00
close-issue.yml	ci : fine-grant permission (#9710 )	2024-10-04 11:47:19 +02:00
docker.yml	musa: add docker image support (#9685 )	2024-10-10 20:10:37 +02:00
editorconfig.yml	ci: exempt master branch workflows from getting cancelled (#6486 )	2024-04-04 18:30:53 +02:00
gguf-publish.yml	ci : update checkout, setup-python and upload-artifact to latest (#6456 )	2024-04-03 21:01:13 +03:00
labeler.yml	labeler.yml: Use settings from ggerganov/llama.cpp [no ci] (#7363 )	2024-05-19 20:51:03 +10:00
nix-ci-aarch64.yml	ci : fine-grant permission (#9710 )	2024-10-04 11:47:19 +02:00
nix-ci.yml	ci : fine-grant permission (#9710 )	2024-10-04 11:47:19 +02:00
nix-flake-update.yml	ci: nix-flake-update: new token with pr permissions (#4879 )	2024-01-11 17:22:34 +00:00
nix-publish-flake.yml	workflows: nix-flakestry: drop tag filters	2023-12-31 13:14:58 -08:00
python-check-requirements.yml	py : fix requirements check '==' -> '~=' (#8982 )	2024-08-12 11:02:01 +03:00
python-lint.yml	convert.py : add python logging instead of print() (#6511 )	2024-05-03 22:36:41 +03:00
python-type-check.yml	ci : reduce severity of unused Pyright ignore comments (#9697 )	2024-09-30 14:13:16 -04:00
server.yml	common : reimplement logging (#9418 )	2024-09-15 20:46:12 +03:00