[Feature Request] Droid Cli Timeout Setting

Hello, I was trying Droid Cli (version 0.70.0) with a local model.

Specifically [Qwen3.5-122B-A10B-Q8_0](https://huggingface.co/unsloth/Qwen3.5-122B-A10B-GGUF) hosted with [llama.cpp's llama-server](https://github.com/ggml-org/llama.cpp/tree/master/tools/server).

I ran into timeout issues because my local model is so slow at processing the prompt after the session built up a decent amount of context.

It was caching the context fine, so something must have broken caching.  Possibly this error I had:

<img width="1902" height="311" alt="Image" src="https://github.com/user-attachments/assets/fa48c2a6-8ace-40ac-97a5-4f0ba003f1d0" />

For a while it was hitting timeouts, but retrying, so llama-server was making progress processing the context.  Processing about 4K tokens before hitting the timeout again (my rig is very slow).  My llama-server output:

<img width="1194" height="220" alt="Image" src="https://github.com/user-attachments/assets/e1132d2a-100e-41db-8902-3f78aeeed27d" />

It eventually stopped retrying to process the prompt and I got the `The AI model timed out. Please retry or switch models with /model.` error in Droid.

It would be great if we could set a custom timeout for slow machines/models such as my own.  I did not see this option in the documentation.

Thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Droid Cli Timeout Setting #758

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature Request] Droid Cli Timeout Setting #758

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions