[FEATURE] WebSocket-based Concurrency Architecture #239

rycerzes · 2025-12-07T20:28:21Z

Add WebSocket support with concurrent session management

Adds WebSocket endpoints for persistent environment sessions with configurable concurrency limits #194

High-level Diff

These are the results on the server side:

- env = MyEnvironment()
  app = create_app(
-      env,
+     MyEnvironment,              # Pass class, not instance
      MyAction,
      MyObservation,
+     max_concurrent_envs=4,      # Allow 4 concurrent WebSocket sessions
)

On the client side, it requires a change or url:

from envs.echo_env import EchoEnv, EchoAction

+ client = EchoEnv(base_url="ws://localhost:8000/ws")
- client = EchoEnv(base_url="http://localhost:8000")

result = client.reset()
result = client.step(EchoAction(message="Hello!"))

# or async with
+ result = await client.reset()
+ result = await client.step(EchoAction(message="Hello!"))

This leads to high concurrency with limited resources:

Changes

WebSocket endpoint at /ws with message protocol for reset/step/state/close
Factory pattern support: pass environment class instead of instance to create per-session environments
ConcurrencyConfig for setting max concurrent sessions, timeout, and capacity behavior
CONCURRENCY_SAFE flag on environments (defaults to False) with startup validation
Session capacity tracking and error handling
New client: WebSocketEnvClient for persistent connections

API

New types:

ConcurrencyConfig(max_concurrent_envs, session_timeout_seconds, reject_on_capacity)
SessionInfo and ServerCapacityStatus for session metadata
WebSocket message types: WSResetMessage, WSStepMessage, WSStateMessage, WSCloseMessage
Response types: WSObservationResponse, WSStateResponse, WSErrorResponse

Usage:

# Factory mode for concurrent sessions
app = create_app(
    env=MyEnvironment,  # Pass class, not instance
    max_concurrent_envs=4
)

Defaults to max_concurrent_envs=1 for backward compatibility. Environments must set CONCURRENCY_SAFE=True to allow higher concurrency.

TODO

Session timeout enforcement (tracked but not implemented)
openenv init needs the WebSocket code integrated into the template:
Resource monitoring (memory/CPU per session)
Connection queueing when reject_on_capacity=False
Mark safe environments as CONCURRENCY_SAFE=True
Update envs to support concurrency

…erver capabilities - Introduced WebSocketEnvClient for persistent sessions with multi-step interactions. - Updated HTTPEnvServer to support WebSocket connections and manage multiple concurrent environments. - Added WebSocket message types and responses for better communication. - Enhanced Environment interface with concurrency safety attributes.

rycerzes · 2025-12-07T20:35:13Z

@burtenshaw draft PR for the ws and concurrency. I have merged the #238 into this as well.

Few notes, before #232 gets merged:

~~openenv init generates boilerplate template according to the old structure~~
openenv init needs the WebSocket code integrated into the template:
- Add WebSocket client example/template
- Update server templates to show WebSocket endpoint usage
- Include documentation on CONCURRENCY_SAFE flag and concurrent sessions
VectorEnv abstraction for batched operations inspired by Gymnasium

burtenshaw · 2025-12-08T12:07:36Z

Amazing work @rycerzes . Thanks

openenv init generates boilerplate template according to the old structure.

I'll integrate this in a new PR for you to merge here.

VectorEnv abstraction for batched operations inspired by Gymnasium

I think we can leave this for a subsequent PR.

Also, this env might be useful to you. It's basically just a benchmarking env that let's you test concurrency asynchronously like this.

burtenshaw · 2025-12-08T12:52:48Z

@rycerzes could you help me to understand this please:

openenv init generates boilerplate template according to the old structure.

What do you mean by old structure? afaik #232 openenv init generates a template with a corresponding structure to the branch. i.e. from:

from openenv.core.env_server.interfaces import Environment
from openenv.core.env_server.types import State

rycerzes · 2025-12-08T13:22:34Z

@burtenshaw

Thanks for the clarification! You're absolutely right - I need to correct my earlier comment.

What do you mean by old structure? afaik #232 openenv init generates a template with a corresponding structure to the branch. i.e. from:
from openenv.core.env_server.interfaces import Environment
from openenv.core.env_server.types import State

I must have run openenv init from the main branch when I was testing, which would explain the confusion. The openenv init command on both the impl/concurrency branch and in #232 does generate the correct new structure with openenv.core imports.

I just verified this by running uv run openenv init test_env -o tests/ on the current branch, and it correctly generates all files with the new import structure. I have updated my above comment accordingly 👍

Also, this env might be useful to you. It's basically just a benchmarking env that let's you test concurrency asynchronously like this.

Thanks! That benchmark env would be perfect for testing the concurrency implementation. I'll take a look at it.
Apologies for the confusion on point 1!

Wauplin

Thanks for working on this very important piece @rycerzes! I've left quite some comments on how I would do things but some parts are left to the maintainers' decisions 🤗 Especially:

should we allow "instantiate a server by passing an env instead of an env factory" to keep backward compatibility? => I would say "no" since project is still in early phase
should we maintain both a "HTTP-based interface" and a "websocket-based interface"? => same, I would say "no" at it means doubling the amount of work (2 paths in the http server and 2 very similar clients to maintain with same interface with different internal logic). Better to keep only 1 interface that is more robust for the future. End users should not be impacted by this decision (except for the breaking change to adapt).

Apart from that, I usually tend to advice to simplify logic by not adding too many optional features at first. More options usually means more internal logic and more maintenance burden on the long run. So if something is not explicitly required, let's keep it for later.

Note that I haven't run the code myself. Will give it a try soon!

Wauplin · 2025-12-08T13:54:54Z