llama: fix llama-model-saver (#20503)

* llama : add fd-based model loading via llama_model_load_from_fd * llama : address review feedback for fd-based model loading * llama : use FILE pointer instead of fd in public API * llama : use FILE pointer consistently, address review feedback * fixup * fix tensor names * fix llama-model-saver * roundtrip tests * fixup * refactor tests * fix prints * fix model saving * fix CI, disable Chameleon * print seed --------- Co-authored-by: Siddhesh2377 <siddheshsonar2377@gmail.com>
2026-03-25 11:53:16 +01:00 · 2026-03-25 11:53:16 +01:00 · 36dafba5c4
commit 36dafba5c4
parent 69e0ecef06
16 changed files with 338 additions and 99 deletions
--- a/include/llama.h
+++ b/include/llama.h
@ -465,6 +465,11 @@ extern "C" {
                             const char * path_model,
              struct llama_model_params   params);

+    // Load a model from an open FILE pointer
+    LLAMA_API struct llama_model * llama_model_load_from_file_ptr(
+                                   FILE * file,
+              struct llama_model_params   params);
+
    // Load a model from multiple splits (support custom naming scheme)
    // The paths must be in the correct order
    LLAMA_API struct llama_model * llama_model_load_from_splits(