Feat/image input compression by afjcjsbx · Pull Request #2964 · sipeed/picoclaw

afjcjsbx · 2026-05-28T22:31:33Z

📝 Description

This PR adds configurable inbound image compression for PicoClaw's vision pipeline.

Previously, inbound images from channels were only constrained by max_media_size, with no configurable multi-level compression policy before building the model payload. This could lead to oversized inline image payloads and unnecessary pressure on multimodal providers.

With this change, PicoClaw now supports a new agents.defaults.image_input configuration block that allows:

enabling or disabling automatic inline attachment of user images
choosing a compression preset (off, low, balanced, aggressive, extreme)
bounding inline payload size
resizing images with max width / height limits
tuning JPEG quality
selecting the target output format (auto, jpeg, png)

The implementation also preserves the existing local path-tag behavior, so images remain accessible through file references while vision-capable providers can receive a compressed inline image payload when enabled.

🗣️ Type of Change

🐞 Bug fix (non-breaking change which fixes an issue)
✨ New feature (non-breaking change which adds functionality)
📖 Documentation update
⚡ Code refactoring (no functional changes, no api changes)

🤖 AI Code Generation

🤖 Fully AI-generated (100% AI, 0% Human)
🛠️ Mostly AI-generated (AI draft, Human verified/modified)
👨‍💻 Mostly Human-written (Human lead, AI assisted or none)

🔗 Related Issue

N/A

📚 Technical Context (Skip for Docs)

Reference URL: N/A
Reasoning: The previous inbound media flow did not provide a configurable compression strategy for user-supplied images before sending them to multimodal models. This PR introduces a production-ready, config-driven image optimization layer that reduces the risk of oversized payloads while preserving compatibility with the existing media resolution flow.

🧪 Test Environment

Hardware:
OS:
Model/Provider:
Channels:

📸 Evidence (Optional)

Click to view Logs/Screenshots

☑️ Checklist

My code/docs follow the style of this project.
I have performed a self-review of my own changes.
I have updated the documentation accordingly.

# Conflicts: # pkg/agent/pipeline_llm.go # pkg/agent/pipeline_setup.go # pkg/agent/turn_state.go

Merge upstream/main into main

…oads

…ross save/load

…mpression

afjcjsbx added 11 commits May 23, 2026 09:23

Merge branch 'fix/seahorse-fresh-tail-budget'

848bf77

# Conflicts: # pkg/agent/pipeline_llm.go # pkg/agent/pipeline_setup.go # pkg/agent/turn_state.go

chore: move resolved upstream merge off main

fbea699

Merge pull request #1 from afjcjsbx/codex/resolve-main-upstream-merge

e95bcaf

Merge upstream/main into main

Merge remote-tracking branch 'upstream/main'

d48fa2e

Merge remote-tracking branch 'upstream/main'

239a98e

Merge remote-tracking branch 'upstream/main'

7be20bf

Merge remote-tracking branch 'upstream/main'

cfbddcd

Merge remote-tracking branch 'origin/main'

f5f6fdc

Merge remote-tracking branch 'upstream/main'

65c09d4

Merge remote-tracking branch 'upstream/main'

8e0964b

feat(agent): add configurable image input compression for vision payl…

1e371ab

…oads

afjcjsbx requested a review from alexhoshina May 28, 2026 22:32

afjcjsbx added 2 commits May 29, 2026 00:35

docs(agent): document configurable inbound image compression

05f5269

fix: resolve golangci-lint predeclared warning

0127cfe

This was referenced May 29, 2026

🦞 OpenClaw 生态日报 2026-05-29 zx0828/big_model_radar#76

Open

🦞 OpenClaw 生态日报 2026-05-29 JohnGao818/big_model_radar#26

Open

afjcjsbx added 9 commits May 29, 2026 12:32

fix(config): persist explicit image_input.attach_user_images=false ac…

5275a2e

…ross save/load

fix(agent): skip inline upload for invalid decoded images

a988651

fix(config): keep safe default caps for image_input compression off

b9dd3ea

fix(agent): allow compressing oversized source images for inline vision

6c56ca9

Merge remote-tracking branch 'upstream/main' into feat/image-input-co…

2449541

…mpression

fix(agent): preserve original inline PNGs when auto mode already fits

474aa56

fix(lint): resolve golines and shadow warnings in image input config

ba0931c

fix lint

293800a

fix(config): restructure image input env tags to satisfy golines

26ae76f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/image input compression#2964

Feat/image input compression#2964
afjcjsbx wants to merge 22 commits into
sipeed:mainfrom
afjcjsbx:feat/image-input-compression

afjcjsbx commented May 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

afjcjsbx commented May 28, 2026

📝 Description

🗣️ Type of Change

🤖 AI Code Generation

🔗 Related Issue

📚 Technical Context (Skip for Docs)

🧪 Test Environment

📸 Evidence (Optional)

☑️ Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant