Skip to content

docs: Add payload size limit and deserialization timeout troubleshooting#597

Merged
muhsinking merged 1 commit intomainfrom
promptless/flash-payload-size-limit
Mar 26, 2026
Merged

docs: Add payload size limit and deserialization timeout troubleshooting#597
muhsinking merged 1 commit intomainfrom
promptless/flash-payload-size-limit

Conversation

@promptless
Copy link
Copy Markdown
Contributor

@promptless promptless bot commented Mar 26, 2026

Open this suggestion in Promptless to view citations and reasoning process

Documents the 10 MB payload size limit and 30-second deserialization timeout for Flash endpoints. Includes workarounds for large data (network volumes, compression) and explains base64 encoding overhead (~33% expansion).

Trigger Events

runpod/flash PR #289: fix: enforce payload size limit and timeout on deserialization

Promptless Research (5 files, 1 GitHub PR)

  • Reviewed runpod/flash#289
  • Viewed .long_term_context/product_knowledge/product_overview.md
  • Viewed .long_term_context/doc_workflow/client_instructions.md
  • Viewed flash/troubleshooting.mdx
  • Viewed flash/apps/requests.mdx
  • Viewed .long_term_context/style/client_style_guide.md

Agent Response

I've reviewed PR #289 and created a documentation suggestion.

Summary

PR #289 enforces two runtime protections for Flash endpoints:

  1. 10 MB payload size limit - Rejects oversized base64-encoded payloads before decoding
  2. 30-second deserialization timeout - Prevents malformed pickle streams from hanging workers

Documentation changes

I added two new troubleshooting entries to flash/troubleshooting.mdx:

Payload too large - Documents the 10 MB limit, explains that base64 encoding expands data by ~33% (so ~7.5 MB raw data hits the limit), and provides three workarounds:

  • Use network volumes for large data
  • Compress data before sending
  • Split large requests into chunks

Deserialization timeout - Documents the 30-second timeout for cloudpickle deserialization and explains that the Flash SDK handles serialization automatically.

These entries help users understand the limits before encountering errors and provide actionable solutions for working with large data like tensors, images, or model weights.

Suggestion: https://app.gopromptless.ai/suggestions/37b40b3d-c9db-459c-b510-940dd4f1bd5c


Tip: Planning a big docs refactor? Use Deep Analysis to get help with the heavy lifting 🔍


3. **Check argument types**: Input arguments must also be serializable.

### Payload too large
Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Citation: Documents the MAX_PAYLOAD_SIZE (10 MB) limit and DESERIALIZE_TIMEOUT_SECONDS (30s) timeout enforced in src/runpod_flash/runtime/serialization.py. The new PayloadTooLargeError and DeserializeTimeoutError exceptions are defined in src/runpod_flash/runtime/exceptions.py.
View source

@mintlify
Copy link
Copy Markdown

mintlify bot commented Mar 26, 2026

Preview deployment for your docs. Learn more about Mintlify Previews.

Project Status Preview Updated (UTC)
runpod-docs 🟢 Ready View Preview Mar 26, 2026, 1:38 PM

@promptless
Copy link
Copy Markdown
Contributor Author

promptless bot commented Mar 26, 2026

Just a reminder: If you'd like me to act on any feedback you have via Github comments, just type @Promptless in your suggestion and I'll get right on it! (I won't show up in the user dropdown, but I'll process any request that has @Promptless in the comment body.)

@muhsinking muhsinking merged commit a245371 into main Mar 26, 2026
3 checks passed
@muhsinking muhsinking deleted the promptless/flash-payload-size-limit branch March 26, 2026 13:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant