Streaming Serialization Methods for Memory-Efficient Large Subtree Processing #73

freemans13 · 2025-12-12T23:14:08Z

Add Streaming Serialization Methods for Memory-Efficient Large Subtree Processing

Summary

Adds two new methods to SubtreeData that enable memory-efficient streaming serialization and deserialization of transaction data. This allows applications to process very large subtrees (1M+ transactions) without loading all transactions into memory at once.

Motivation

When processing large subtrees with millions of transactions, the existing Serialize() and serializeFromReader() methods require all transactions to be in memory simultaneously. For production workloads with 1M transaction subtrees, this results in multi-GB memory usage per subtree.

With multiple subtrees being processed concurrently, memory consumption becomes prohibitive, leading to OOM issues in production.

Changes

New Methods

WriteTransactionsToWriter(w io.Writer, startIdx, endIdx int) error

Writes a specific range of transactions directly to a writer
Enables streaming serialization without buffering all transactions in memory
Transactions are written sequentially in the specified range
Validates that transactions are non-nil before writing

ReadTransactionsFromReader(r io.Reader, startIdx, endIdx int) (int, error)

Reads a specific range of transactions from a reader
Enables chunked deserialization by reading only part of the data at a time
Validates transaction hashes match expected values from subtree structure
Returns number of transactions successfully read

Use Case

These methods enable a chunked processing pattern:

// Writing (serialization)
for chunk in chunks {
    LoadTransactionsIntoChunk()
    WriteTransactionsToWriter(writer, chunkStart, chunkEnd)
    ProcessAndReleaseChunk()  // Free memory
}

// Reading (deserialization)  
for chunk in chunks {
    ReadTransactionsFromReader(reader, chunkStart, chunkEnd)
    ProcessChunk()
    ReleaseChunk()  // Free memory
}

…nd io.Writer

github-actions · 2025-12-12T23:14:28Z

👋 Thanks, @freemans13!

This pull request comes from a fork. For security, our CI runs in a restricted mode.
A maintainer will triage this shortly and run any additional checks as needed.

🏷️ Labeled: fork-pr, requires-manual-review
👀 We'll review and follow up here if anything else is needed.

Thanks for contributing to bsv-blockchain/go-subtree! 🚀

mrz1836

Woot!

mrz1836

Few linter issues, see CI

mrz1836

LGTM - however since this was a fork, tests and dep audit checks were not run. They will be run upon merging this branch.

sonarqubecloud · 2025-12-12T23:51:31Z

Quality Gate passed

Issues
1 New issue
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

freemans13 added 2 commits December 12, 2025 23:05

allow partial/chunked serialise/deserialise subtree using io.Reader a…

6d4a06e

…nd io.Writer

tests

9f821d9

freemans13 requested a review from mrz1836 as a code owner December 12, 2025 23:14

github-actions bot added fork-pr PR originated from a forked repository requires-manual-review PR or issue requires manual review by a maintainer or security team labels Dec 12, 2025

github-actions bot assigned mrz1836 Dec 12, 2025

freemans13 changed the title ~~Stu/subtree streaming~~ Streaming Serialization Methods for Memory-Efficient Large Subtree Processing Dec 12, 2025

mrz1836 approved these changes Dec 12, 2025

View reviewed changes

mrz1836 enabled auto-merge (squash) December 12, 2025 23:17

mrz1836 self-requested a review December 12, 2025 23:20

mrz1836 requested changes Dec 12, 2025

View reviewed changes

mrz1836 added the feature Any new significant addition label Dec 12, 2025

mrz1836 requested a review from galt-tr December 12, 2025 23:21

reduce code duplication

6270545

auto-merge was automatically disabled December 12, 2025 23:21
Head branch was pushed to by a user without write access

linting fixes

7aedb5a

mrz1836 self-requested a review December 12, 2025 23:33

mrz1836 approved these changes Dec 12, 2025

View reviewed changes

simpler implementation

ee99359

mrz1836 merged commit dd70219 into bsv-blockchain:master Dec 13, 2025
37 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Streaming Serialization Methods for Memory-Efficient Large Subtree Processing #73

Streaming Serialization Methods for Memory-Efficient Large Subtree Processing #73

Uh oh!

freemans13 commented Dec 12, 2025

Uh oh!

github-actions bot commented Dec 12, 2025

Uh oh!

mrz1836 left a comment •

edited

Loading

Uh oh!

mrz1836 left a comment

Uh oh!

mrz1836 left a comment

Uh oh!

sonarqubecloud bot commented Dec 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Streaming Serialization Methods for Memory-Efficient Large Subtree Processing #73

Streaming Serialization Methods for Memory-Efficient Large Subtree Processing #73

Uh oh!

Conversation

freemans13 commented Dec 12, 2025

Add Streaming Serialization Methods for Memory-Efficient Large Subtree Processing

Summary

Motivation

Changes

New Methods

Use Case

Uh oh!

github-actions bot commented Dec 12, 2025

👋 Thanks, @freemans13!

Uh oh!

mrz1836 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mrz1836 left a comment

Choose a reason for hiding this comment

Uh oh!

mrz1836 left a comment

Choose a reason for hiding this comment

Uh oh!

sonarqubecloud bot commented Dec 12, 2025

Quality Gate passed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mrz1836 left a comment •

edited

Loading