llama : add fd-based model loading via llama_model_load_from_fd ( REWORK ) by Siddhesh2377 · Pull Request #20402 · ggml-org/llama.cpp

Siddhesh2377 · 2026-03-11T14:28:28Z

Adds llama_model_load_from_fd() to load GGUF models from a POSIX file descriptor instead of a file path.

On Android, apps accessing user files through SAF only get a file descriptor, not a path. The alternative is copying the model into app storage or requesting MANAGE_EXTERNAL_STORAGE, which gets rejected by Google Play. This happened with my app (ToolNeuron).

Reworked version of a previous PR that was rejected for code quality.

Not supported on Windows. The fd is dup'd internally so the caller retains ownership.

Tested locally with CI and a real model (vocab_only + mmap).

ggml/src/gguf.cpp

JohannesGaessler · 2026-03-13T17:50:40Z

ggml/include/gguf.h


    GGML_API struct gguf_context * gguf_init_empty(void);
    GGML_API struct gguf_context * gguf_init_from_file(const char * fname, struct gguf_init_params params);
+    GGML_API struct gguf_context * gguf_init_from_fd(int fd, struct gguf_init_params params);


For your purposes, would it work to expose the current gguf_init_from_file_impl as gguf_init_from_file_ptr and to use that as the basis for the implementation instead? That way we would be able to also use this code on Windows in conjunction with ggml_fopen.

Done, replaced gguf_init_from_fd with gguf_init_from_file_ptr(FILE *) and moved the dup+fdopen logic up to llama-model-loader

The llama C API should also use a file pointer if at all possible, the conversion from file descriptor to file pointer should be in your user code.

Done, switched the llama C API to FILE pointer as well. The test shows the fd to FILE* conversion on the caller side.

llama : add fd-based model loading via llama_model_load_from_fd

158239a

Siddhesh2377 requested review from JohannesGaessler and ggerganov as code owners March 11, 2026 14:28

Siddhesh2377 changed the title ~~llama : add fd-based model loading via llama_model_load_from_fd~~ llama : add fd-based model loading via llama_model_load_from_fd ( REWORK ) Mar 11, 2026

github-actions bot added testing Everything test related ggml changes relating to the ggml tensor library for machine learning labels Mar 11, 2026

JohannesGaessler mentioned this pull request Mar 13, 2026

llama: fix llama-model-saver #20503

Draft

JohannesGaessler reviewed Mar 13, 2026

View reviewed changes

Siddhesh2377 added 2 commits March 14, 2026 00:44

llama : address review feedback for fd-based model loading

a4cfaf0

llama : use FILE pointer instead of fd in public API

626823b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama : add fd-based model loading via llama_model_load_from_fd ( REWORK )#20402

llama : add fd-based model loading via llama_model_load_from_fd ( REWORK )#20402
Siddhesh2377 wants to merge 3 commits intoggml-org:masterfrom
Siddhesh2377:fd-loading

Siddhesh2377 commented Mar 11, 2026

Uh oh!

Uh oh!

JohannesGaessler Mar 13, 2026

Uh oh!

Siddhesh2377 Mar 13, 2026 •

edited

Loading

Uh oh!

JohannesGaessler Mar 13, 2026

Uh oh!

Siddhesh2377 Mar 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Siddhesh2377 commented Mar 11, 2026

Uh oh!

Uh oh!

JohannesGaessler Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

Siddhesh2377 Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

JohannesGaessler Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

Siddhesh2377 Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Siddhesh2377 Mar 13, 2026 •

edited

Loading