Add inworld router by maxkahan · Pull Request #530 · GetStream/Vision-Agents

maxkahan · 2026-05-05T16:07:14Z

This pull request introduces asynchronous backend persistence for Stream Chat message syncing in the StreamConversation class, improving latency for voice and LLM-driven pipelines by dispatching REST calls as background tasks. It also ensures test and shutdown reliability by allowing callers to await completion of all in-flight syncs. Additionally, the changes update tests to await these background tasks and improve error handling and documentation.

Asynchronous backend persistence and ordering:

Refactored StreamConversation to dispatch Stream Chat syncs as fire-and-forget background tasks, serializing them with an asyncio.Lock to preserve message ordering and reduce latency on the critical path. Added a wait_for_pending_syncs() method to allow draining of in-flight tasks during tests and shutdown. [1] [2] [3] [4]
Updated the agent shutdown path to await pending conversation syncs, ensuring no in-flight writes are dropped.
Implemented a no-op wait_for_pending_syncs() in the base conversation class, with an override in StreamConversation for actual draining.

Test reliability improvements:

Updated all relevant tests in test_stream_conversation.py and test_message_chunking.py to await wait_for_pending_syncs() before making assertions, ensuring test correctness with asynchronous persistence. [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] [18] [19] [20] [21]

Robustness and documentation:

Improved error handling for Stream API exceptions during sync.
Enhanced docstrings and comments for clarity on asynchronous behavior and usage. [1] [2]

Bug fixes and pipeline improvements:

Prevented empty final LLM responses from being persisted, avoiding conversation history corruption and downstream provider errors.
Ensured TTS pipeline always signals turn boundaries, preventing hangs in streaming scenarios.

Documentation update:

Expanded inworld/README.md to better describe plugin capabilities.

Stream is a sort of queue with __aiter__ which can be cleared or closed. Clearing the Stream keeps iterators running but drops the queued data. Closing it signals the running iterators to stop.

…inalized

…ripts without deleting them

Also: - openrouter llm no longer inherits from openai.LLM

Also simplified the LLM integration

They flood the logs

…ushed on final chunk

- replace assert with ValueError in collect_simple_response - add todos to fix some undefined behavior later

…rigger vad

send_nowait() was infinitely accumulating carryover buffer because of incorrect handling of 2d numpy arrays

… output

…om LLM hot path

dangusev added 30 commits May 4, 2026 10:53

Add the new Stream class to pass data between components

5b6a208

Stream is a sort of queue with __aiter__ which can be cleared or closed. Clearing the Stream keeps iterators running but drops the queued data. Closing it signals the running iterators to stop.

Add "TranscriptBuffer.final" property to determine if the buffer is f…

cc85e96

…inalized

Add Stream.peek() to access buffered data in tests

6277ec7

Add drop:bool to TranscriptStore methods to accumulate final transc…

41e2a15

…ripts without deleting them

Add drop:bool to TranscriptStore methods to accumulate final transc…

f63fef6

…ripts without deleting them

Add AudioOutputStream to collect output audio

3e8c020

Add Stream.collect() to consume items from the stream with a timeout

a520e5e

Add new LLMTurn class to manage LLM conversation turns

3121568

Add TranscribingInferenceFlow for reliable STT -> LLM -> TTS processing

3e680c6

Updates STT and all the plugins to emit data into streams

a108aa8

Update TTS plugins & tests to use new send_iter API

1659652

STT tests cleanup & file renames

7c32ba3

Update TurnDetector plugins to use the new Stream API

3c5aae5

log_exceptions: don't catch baseexception

4d73644

Update SarvamTTS tests

e7e0649

Update XAITTS tests

6f23cfe

Update anthropic.ClaudeLLM & tests

4ce0bd6

Update openai.OpenAILLM to use the new API

81d516f

LLMTurn: log exceptions in background tasks

f26a8f0

Update GeminiLLM to use the new AsyncIterator API

1b5e029

Update GeminiVLM to use the new AsyncIterator API

5eeb673

Delete redundant gemini.events module

ac198cd

Update XAILLM to use the new AsyncIterator API

bc3f2f0

Update openai.ChatCompletionsLLM to use the new AsyncIterator API

05e43e8

TranscribingInferenceFlow: log info LLM response deltas

a619e26

Update openai.ChatCompletionsVLM to use the new AsyncIterator API

1ad3974

Update aws.BedrockLLM to use the new AsyncIterator API

afd8d09

Update openrouter.OpenRouterLLM to use the new AsyncIterator API

f567a9f

Also: - openrouter llm no longer inherits from openai.LLM

Update sarvam.SarvamLLM to use the new AsyncIterator API

1560120

Also simplified the LLM integration

Ruff

26f6d67

dangusev and others added 24 commits May 4, 2026 10:56

stt.Transcript: use self.response as a source of data

b5b6d84

TTSTokenizer: sanitize the final chunk too

512de2e

set return type for AudioOutputStream.flush()

4f9fabd

Agent: start audio producing task only when publishing audio

e218779

Pass conversation to xai and openai llms in tests

16a61b5

Do not log unhandled xai realtime events

b0195c5

They flood the logs

Fix gemini after rebase

d54f0ac

Fix agent.llm.simple_response usage in examples

1e2ff76

Use sarvam-30b in Sarvam LLM integration tests to verify tool calling

35acaf0

Fix TestGeminiLLM.test_convert_tools_strips_nested_schema_meta

04b56ed

Fix Gemini llm not merging tools correctly

e7f3d1a

Fix tool calling in OpenRouterLLM

7aa978f

inference.audio.AudioOutputStream: fix carry-over buffer not being fl…

c504dce

…ushed on final chunk

Address review comments in core.testing:

77604bd

- replace assert with ValueError in collect_simple_response - add todos to fix some undefined behavior later

send trailing silence in assemblyai integration test to trigger vad

e16eb0c

verify func call output in SarvamLLM integration test

d68e546

Emit trailing silence in Deepgram & Sarvam STT integration tests to t…

9a25070

…rigger vad

inworld.Realtime test: use Stream.collect() vs sleep()

1924863

nvidia vlm test: fix assert

ef668cf

Fix AudioOutputStream handling for stereo

5b40e7d

send_nowait() was infinitely accumulating carryover buffer because of incorrect handling of 2d numpy arrays

Ruff

4469e35

Handle "response.output_audio.done" in openai.Realtime to emit end of…

fd3ea5a

… output

add new inworld router support

ca4d642

improve response latency, decouple StreamConversation backend sync fr…

d3b1588

…om LLM hot path

github-actions Bot added agents-core plugins config docs project-info labels May 5, 2026

dangusev force-pushed the chore/audio-processing-v2 branch from fd3ea5a to f288778 Compare May 5, 2026 16:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add inworld router#530

Add inworld router#530
maxkahan wants to merge 101 commits intochore/audio-processing-v2from
add-inworld-router

maxkahan commented May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants