Skip to content

feat(agui): Add multimodal input support (image/video/audio) to AguiM…#1380

Open
NoiAI wants to merge 1 commit into
agentscope-ai:mainfrom
NoiAI:feat/agui-multimodal-v3
Open

feat(agui): Add multimodal input support (image/video/audio) to AguiM…#1380
NoiAI wants to merge 1 commit into
agentscope-ai:mainfrom
NoiAI:feat/agui-multimodal-v3

Conversation

@NoiAI
Copy link
Copy Markdown

@NoiAI NoiAI commented May 12, 2026

…essage and AguiMessageConverter

Adds support for multimodal input (image, video, audio, document) in the AG-UI extension, aligning with the AG-UI Protocol InputContent specification.

  • AguiMessage.content: String -> Object (backward compatible)
  • AguiMessageConverter: InputContent[] -> ContentBlock conversion
  • Supports both url and data (base64) source types

AgentScope-Java Version

[The version of AgentScope-Java you are working on, e.g. 1.0.12, check your pom.xml dependency version or run mvn dependency:tree | grep agentscope-parent:pom(only mac/linux)]

Description

[Please describe the background, purpose, changes made, and how to test this PR]

Checklist

Please check the following items before code is ready to be reviewed.

  • Code has been formatted with mvn spotless:apply
  • All tests are passing (mvn test)
  • Javadoc comments are complete and follow project conventions
  • Related documentation has been updated (e.g. links, examples, etc.)
  • Code is ready for review

@NoiAI NoiAI requested a review from a team May 12, 2026 04:38
@CLAassistant
Copy link
Copy Markdown

CLAassistant commented May 12, 2026

CLA assistant check
Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
You have signed the CLA already but the status is still pending? Let us recheck it.

@NoiAI NoiAI force-pushed the feat/agui-multimodal-v3 branch 2 times, most recently from 88419b9 to 2e87d78 Compare May 12, 2026 04:51
@codecov
Copy link
Copy Markdown

codecov Bot commented May 12, 2026

Codecov Report

❌ Patch coverage is 84.61538% with 8 lines in your changes missing coverage. Please review.

Files with missing lines Patch % Lines
...cope/core/agui/converter/AguiMessageConverter.java 83.67% 3 Missing and 5 partials ⚠️

📢 Thoughts on this report? Let us know!

…essage and AguiMessageConverter

Adds support for multimodal input (image, video, audio, document) in the AG-UI extension, aligning with the AG-UI Protocol InputContent specification.

- AguiMessage.content: String -> Object (backward compatible)
- AguiMessageConverter: InputContent[] -> ContentBlock conversion
- Supports both url and data (base64) source types
@NoiAI NoiAI force-pushed the feat/agui-multimodal-v3 branch from 2e87d78 to 5d6cb4b Compare May 12, 2026 06:39
@zhengxudong
Copy link
Copy Markdown

这个功能对我们项目非常重要,我们已经测试过可以正常使用,希望能尽快合并,非常感谢!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants