llama-quant : fail early on missing imatrix, refactor type selection, code cleanup (#19770)

* quantize : imatrix-fail early + code cleanup

* fix manual override printing

it's in the preliminary loop now, so needs to be on its own line

* revert header changes per ggerganov

* remove old #includes

* clarify naming

rename `tensor_quantization` to `tensor_typo_option` to descirbe its
functionality

* fix per barto

This commit is contained in:

ddh0

2026-03-10 01:16:05 -05:00

• committed by

GitHub

parent c96f608d98

commit 1dab5f5a44

No known key found for this signature in database

GPG key ID: B5690EEEBB952194

2 changed files with 485 additions and 316 deletions

751

src/llama-quant.cpp

View file

File diff suppressed because it is too large Load diff

Rows
Columns

llama-quant : fail early on missing imatrix, refactor type selection, code cleanup (#19770)

751 src/llama-quant.cpp View file

751

src/llama-quant.cpp

View file