gossipsub: experimental draft for large message segmentation extension by theUtkarshRaj · Pull Request #2 · seetadev/specs

theUtkarshRaj · 2026-05-06T18:23:01Z

Summary

This is a draft exploratory spec note for transparent segmentation of large Gossipsub payloads, plus minimal protobuf registry hooks so implementations can experiment in a consistent way.

Adds pubsub/gossipsub/extensions/experimental/large-message-segmentation.md (discussion-oriented; non-normative).
Extends pubsub/gossipsub/extensions/extensions.proto with an experimental capability flag, an RPC extension field, and LargeMessageSegmentationExtension.

Context

The goal is to sketch how segmentation could sit alongside the existing extension framework and how it differs from Partial Messages (complementary use cases: “no one has the full blob yet” vs “most of the message is already held”). Peer scoring is discussed at the reconstructed message level only; retry/retransmission is explicitly out of scope here.

Notes

Intended for early review and iteration, not as a finalized protocol.
Field numbers follow the existing experimental range convention in this file.
Related:
[DMP 2026]: Gossipsub 1.4 Large Message Handling, Specification PR and Candidate Recommendation #1

Co-authored-by: Cursor <cursoragent@cursor.com>

… interaction

theUtkarshRaj · 2026-05-08T13:04:00Z

Pushed a follow-up commit (f909f55) addressing review-readiness gaps in the draft:

Security Considerations — covers reassembly buffer exhaustion, segment flooding under forged messageID, last-segment withholding, and inconsistent totalSegments, each with mitigations.
Wire format clarification — checksum is now specified as SHA-256 over messageID || segmentIndex || payload for per-segment integrity before reassembly.
Interaction with Existing Gossipsub Mechanics — clarifies that segments are themselves gossipsub messages subject to standard MessageID/IHAVE/IWANT/IDONTWANT semantics, with the extension's messageID identifying only the parent payload.
Tentative answers to the two open questions — protocol-generated messageID via SHA-256(publisherPeerID || topic || nonce)[:16], and a 1 MiB max segment size with publisher choice below. Original questions are kept visible for discussion.

Also worth flagging: a py-libp2p reference implementation tracking this draft was opened today at libp2p/py-libp2p#1323 by @shivv23. There are two minor divergences (RPC field number, default segment size) — I've responded over there with my thoughts and we should be able to converge quickly. Cross-implementation activity at this stage is exactly what the experimental-extension lifecycle is meant to surface.

cc @MarcoPolo @cskiraly — would appreciate early signal on whether the experimental-extension framing is the right entry point here, or whether segmentation should target a separate protocol ID. Related to #1.

- Change largeMessageSegmentation field from 8473921 to 6492435 in both ControlExtensions and RPC to match py-libp2p (PR libp2p/py-libp2p#1323) - Rename RPC.largeSegmentation to RPC.largeMessageSegmentation for consistency - Note py-libp2p's 256 KiB default segment size under Open Question 2

align field number and naming with py-libp2p reference implementation

theUtkarshRaj · 2026-05-08T17:59:10Z

Quick status update: merged shivv23/specs#align-field-number into the spec branch. The protobuf field number is now 6492435 (matching py-libp2p), the RPC field is consistently named largeMessageSegmentation, and Open Question 2 in the spec text now notes the py-libp2p reference default of 256 KiB segment payload under the 1 MiB ceiling. Spec and reference implementation are now aligned on field number, segment-size policy, and (per shivv23's matching update on libp2p/py-libp2p#1323) messageID derivation.

…tion) Addresses the implementer-raised gap from py-libp2p#1323. Defines normative MUSTs/SHOULDs for per-peer caps, per-messageID memory bounds, timeouts, inconsistency handling, successful reassembly, and eviction. Promotes existing security mitigations from inferred to normative.

gossipsub: add experimental large message segmentation draft

05a41ca

Co-authored-by: Cursor <cursoragent@cursor.com>

theUtkarshRaj mentioned this pull request May 6, 2026

[DMP 2026]: Gossipsub 1.4 Large Message Handling, Specification PR and Candidate Recommendation #1

Open

7 tasks

shivv23 mentioned this pull request May 8, 2026

feat: add Large Message Segmentation extension for GossipSub v1.3 libp2p/py-libp2p#1323

Open

4 tasks

spec: add security considerations, wire format details, and gossipsub…

f909f55

… interaction

theUtkarshRaj force-pushed the experimental/large-message-segmentation branch from 27507cd to f909f55 Compare May 8, 2026 11:48

shivv23 and others added 2 commits May 8, 2026 22:47

Merge pull request #1 from shivv23/align-field-number

afb88e1

align field number and naming with py-libp2p reference implementation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gossipsub: experimental draft for large message segmentation extension#2

gossipsub: experimental draft for large message segmentation extension#2
theUtkarshRaj wants to merge 5 commits into
seetadev:masterfrom
theUtkarshRaj:experimental/large-message-segmentation

theUtkarshRaj commented May 6, 2026

Uh oh!

theUtkarshRaj commented May 8, 2026

Uh oh!

theUtkarshRaj commented May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

theUtkarshRaj commented May 6, 2026

Summary

Context

Notes

Uh oh!

theUtkarshRaj commented May 8, 2026

Uh oh!

theUtkarshRaj commented May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants