Add Model Storage #89

fyuan1316 · 2026-01-29T08:15:11Z

Summary by CodeRabbit

Documentation
- Added a comprehensive guide for model storage and loading in cloud-native inference environments.
- Covers S3 object storage, OCI containerized model images, and PVC-based storage with clear examples.
- Includes authentication guidance, deployment procedures, verification steps, prerequisites, and operational notes.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

coderabbitai · 2026-01-29T08:15:31Z

Walkthrough

Adds a new MDX documentation file describing three model storage modalities for cloud-native inference—S3 Object Storage, OCI Model-as-Image, and PVC—plus loading mechanisms (Init Container, Sidecar), configuration examples, and deployment snippets.

Changes

Cohort / File(s)	Summary
Documentation - Model Storage Guide `docs/en/model_inference/model_management/functions/model_storage.mdx`	Adds a new comprehensive guide covering S3 (auth, Kubernetes Secret/ServiceAccount, Init Container example), OCI Model-as-Image (packaging, Sidecar/native OCI examples), and PVC (upload, storageUri, verification). Includes prerequisites, code blocks, and deployment examples.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~12 minutes

Poem

🐇 I hopped through docs to store a dream,
S3 baskets and OCI cream,
PVC burrows snug and neat,
Models safe where cloud and ground meet. ✨

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'Add Model Storage' clearly and directly describes the main change: introducing new documentation for model storage options. It is specific enough to convey the primary purpose of the changeset.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch add-model-storage

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 5

🤖 Fix all issues with AI agents

In `@docs/en/model_inference/model_management/functions/model_storage.mdx`:
- Line 22: Replace the hyphenated S3 endpoint placeholders in the table
("your-s3-service-ip:your-s3-port") with the underscore-formatted placeholders
used in the YAML ("your_s3_service_ip:your_s3_port") so the example strings
match; update all occurrences (e.g., the cell at the shown table row and the
other instance around line 36) to use the underscore format for consistency.
- Around line 5-6: The opening sentence under the "Model Storage" heading
currently lists only S3 and OCI; update that sentence to also mention PVC
(Persistent Volume Claim) so it reflects all storage options documented on the
page (e.g., "You can store a model in an S3 bucket, Open Container Initiative
(OCI) containers, or a Persistent Volume Claim (PVC)."). Locate and edit the
initial paragraph following the "Model Storage" heading and ensure terminology
matches other sections that reference PVC (use "Persistent Volume Claim (PVC)"
on first mention).
- Around line 216-220: Steps 3 and 4 repeat the sentence starter "In…" — reword
them to avoid repetition by merging into one instruction: replace the two lines
starting "In your workbench IDE, navigate to the file browser:" and "In the file
browser, navigate to the home directory." with a single line like "Open the file
browser (Files tab in JupyterLab or Explorer view in code-server) and navigate
to the home directory, which represents the root of your attached PVC." This
keeps the referenced UI elements ("Files tab", "Explorer view", "home
directory") but removes the repeated "In…" sentence starts.
- Around line 125-127: Update the prerequisite text that currently reads "PSA
(Pod Security Admission) Enforce set to Privilege" to use the correct lowercase
Kubernetes Pod Security Admission level: change it to "PSA (Pod Security
Admission) Enforce set to privileged"; ensure the rest of the prerequisite (the
Enable Modelcar line with uidModelcar set to 0) remains unchanged.
- Line 272: Update the storageUri example and add a short note clarifying the
optional model path and namespace behavior: state that the PVC URI format is
pvc://<pvc-name>/<model-path-within-pvc>, that the example storageUri:
pvc://model-pvc refers to the PVC root, and show an example with a specific path
(e.g., pvc://model-pvc/models/my-model); also add a sentence that the PVC must
exist in the same Kubernetes namespace as the InferenceService (namespace does
not apply in the URI).

🧹 Nitpick comments (1)

docs/en/model_inference/model_management/functions/model_storage.mdx (1)
24-24: Add a production safety note for HTTP.
This row reads like a recommendation; consider explicitly warning against HTTP in production.
✏️ Suggested edit
-| HTTPS Enabled | 0 | Encryption disabled for internal test/Demo environment |
+| HTTPS Enabled | 0 | Use only for internal test/demo; use HTTPS (1) in production |

coderabbitai · 2026-01-29T08:19:44Z