rfc: mutability & encryption for forge by fforbeck · Pull Request #84 · storacha/RFC

fforbeck · 2026-03-04T16:47:27Z

hannahhoward

Overall, this is good, but there are some critical missing bits, that I think will emerge from working closely with @Peeja , @alanshaw and the existing go devs.

Specifically,

"Service layer" -- is this a server? a secondary process process on the dev machine? I don't either of these are a good idea, and the server approach would break a number of design principles about our system (namely that all CIDs should be generated on the client). Personally I think a language port is gonna be WAY faster (especially with AI aided dev) and product a way less complex system, so I'd argue strongly for a full port. These aren't complex libraries and I think we could have them ported in a week or two with the AI helping. And then we have a single process for a single machine, way simpler to maintain and reason about.
There's a bit of unspecified confusion about how Pail works in the Forge context, that I think you might need to embed with @Peeja on guppy to really grok. So Guppy has a notion of "sources" -- i.e. data sources (usually large, deep directories) that get uploaded within a space. Each space has 1..n sources, and when you upload within a space, after the first upload of a source, only the "delta" gets updated-- Guppy knows how to upload just blocks to make a new updated UnixFS root. So with mutabiltiy:

You have the list of sources which get updated, and you DEFINITELY want that to be represented by Pail + UCN.
You have the directory tree structure within the sources itself. This is currently UnixFS and is updated properly each incremental upload.
So the real question is about whether to use Pail for the whole directory tree, and I think that's a complicated question that merits further examination

Reasons not to use Pail:

These are extremely big complicated directories and Pail hasn't been tested at a scale even remotely close to working with these directories
The retrieval patters and general usage for Pail is totally different than for UnixFS -- so the downstream change implications of using Pail for the whole directory tree structure are unknown.

Reasons to use Pail:

Much more fine grained "multi-writer" capabilities are unlocked if you use Pail for everything. If you used pail for just the sources list, then you'd essentially have a last-writer-wins on a per-source level -- if source X is in state A, and two different guppies make several changes to the directory tree structure, written as UnixFS, then the directory structure would by default ONLY get the changes of the last client to write. Note: we could apply a smarter merge outside of PAIL, similar to the way I merge Markdown files in Clawracha. I actually believe this wouldn't be TOO hard.

Final sidebar: Current Guppy is also smart enough to only upload diff blocks for Files when they change. Encryption will kill that ability I believe, unless there's some useful way to encode only changes that works for encrypted data. Worth a google.

alanshaw · 2026-03-05T15:55:28Z

rfc/forge-mutability-encryption.md

+│          │                                                   │
+│          ▼                                                   │
+│  9. Publish to UCN: Name.publish(pailRootCID)                │
+│     → mutable name now points to updated index               │


This is "pail without CRDT" - in the case of multiple concurrent updates to the same name, the UCN resolution is to just use the first of the alphabetically sorted CIDs (IIRC). It means if 2 users start with the same pail, and both make an update, only 1 wins.

The Pail CRDT library allows the two updates to be applied, only resorting to alphabetically sorted CIDs when the two updates have the same causal order and touch the same key.

alanshaw · 2026-03-05T16:05:46Z

rfc/forge-mutability-encryption.md

+│     - KMS info                                               │
+│          │                                                   │
+│          ▼                                                   │
+│  5. Extract encrypted content from CAR using encryptedDataCID│


Why don't we encrypt each block?

rfc: mutability & encryption for forge

b6a46df

fforbeck requested a review from a team March 4, 2026 16:47

fforbeck self-assigned this Mar 4, 2026

fforbeck mentioned this pull request Mar 4, 2026

Plan privacy/mutability story storacha/project-tracking#663

Open

hannahhoward reviewed Mar 5, 2026

View reviewed changes

alanshaw reviewed Mar 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rfc: mutability & encryption for forge#84

rfc: mutability & encryption for forge#84
fforbeck wants to merge 1 commit intomainfrom
rfc/privacy-mutability-forge

fforbeck commented Mar 4, 2026

Uh oh!

hannahhoward left a comment

Uh oh!

alanshaw Mar 5, 2026

Uh oh!

alanshaw Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

fforbeck commented Mar 4, 2026

Uh oh!

hannahhoward left a comment

Choose a reason for hiding this comment

Uh oh!

alanshaw Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

alanshaw Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants