llama-quant : fail early on missing imatrix, refactor type selection, code cleanup (#19770)

* quantize : imatrix-fail early + code cleanup

* fix manual override printing

it's in the preliminary loop now, so needs to be on its own line

* revert header changes per ggerganov

* remove old #includes

* clarify naming

rename `tensor_quantization` to `tensor_typo_option` to descirbe its
functionality

* fix per barto
This commit is contained in:
ddh0 2026-03-10 01:16:05 -05:00 committed by GitHub
parent c96f608d98
commit 1dab5f5a44
No known key found for this signature in database
GPG key ID: B5690EEEBB952194
2 changed files with 485 additions and 316 deletions

File diff suppressed because it is too large Load diff