Support for Redis-based session store to run across multiple nodes by dangusev · Pull Request #374 · GetStream/Vision-Agents

dangusev · 2026-02-25T14:54:00Z

Added SessionRegistry to store agent sessions
Added in-memory and Redis-based backends for SessionRegistry
[breaking] updated HTTP API urls to always include call_id for easy access in permission callbacks
[breaking] removed "get_current_user" permission callback as redundant

Summary by CodeRabbit

New Features
- Session registry for distributed session management with session metrics and TTL.
- Optional Redis-backed session persistence and in-memory session store.
Breaking Changes
- Session endpoints moved to /calls/{call_id}/sessions; call_id is now a path parameter.
- start session no longer expects call_id in the request body.
- Close operations now return HTTP 202.
- Permission callbacks are per-call (receive call_id); readiness endpoint returns 400 when not ready.

coderabbitai · 2026-02-25T14:54:24Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Walkthrough

Adds a stateless SessionRegistry with TTL-backed SessionKVStore implementations (in-memory and Redis), integrates the registry into AgentLauncher with a maintenance loop and registry-driven session lifecycle, moves HTTP session endpoints under /calls/{call_id}/..., and updates permission/dependency APIs to be call-scoped.

Changes

Cohort / File(s)	Summary
Dependencies `agents-core/pyproject.toml`, `pyproject.toml`	Added optional `redis[hiredis]>=5.0.0` and dev dependency `testcontainers[redis]>=4.0.0`.
Session Registry Core `agents-core/vision_agents/core/agents/session_registry/store.py`, `.../types.py`, `.../in_memory_store.py`, `.../redis_store.py`, `.../registry.py`, `.../__init__.py`	New abstract `SessionKVStore`, `SessionInfo` dataclass, in-memory and Redis store implementations, and `SessionRegistry` providing register/remove/refresh/request_close/get/get_for_call/update_metrics. Redis export is conditional.
AgentLauncher Integration `agents-core/vision_agents/core/agents/agent_launcher.py`, `agents-core/vision_agents/core/__init__.py`, `agents-core/vision_agents/core/agents/__init__.py`	AgentLauncher now accepts an optional `SessionRegistry`, renamed cleanup->maintenance interval, adds `_maintenance_loop` with TTL refresh and registry-driven close handling, updates start/stop/start_session/close_session flows, and exposes request_close_session/get_session_info APIs; package exports updated.
HTTP API Restructuring `agents-core/vision_agents/core/runner/http/api.py`, `dependencies.py`, `models.py`, `options.py`, `runner.py`	Session endpoints moved under `/calls/{call_id}/...`; endpoint signatures accept `call_id`; removed `get_current_user` and per-request session dependency; permission callbacks became call-scoped; `StartSessionRequest` no longer requires `call_id`; readiness behavior adjusted.
Examples `examples/08_agent_server_example/README.md`	Example updated to use path-scoped `call_id`, replaced `get_current_user` with `verify_token`, and adjusted permission callback signatures and example endpoints.
Tests — Runner & Launcher `tests/test_agents/test_runner.py`, `tests/test_agents/test_agent_launcher.py`	Tests updated for new `/calls/{call_id}/...` routing and per-call permissions; added storage-backed AgentLauncher tests exercising both in-memory and Redis registries and maintenance_interval naming.
Tests — Session Registry Stores `tests/test_agents/test_session_registry/test_in_memory_store.py`, `.../test_redis_store.py`, `.../test_registry.py`	Added comprehensive tests for InMemorySessionKVStore, RedisSessionKVStore, and SessionRegistry behavior (set/get/mset/expire/keys/delete, TTL, close-requests, metrics, registry lifecycle).

Sequence Diagram

sequenceDiagram
    participant Client as Client
    participant API as HTTP API
    participant Launcher as AgentLauncher
    participant Registry as SessionRegistry
    participant Store as SessionKVStore

    Client->>API: POST /calls/{call_id}/sessions
    API->>Launcher: start_session(call_id, ...)
    Launcher->>Launcher: create AgentSession
    Launcher->>Registry: register(call_id, session_id)
    Registry->>Store: set sessions/{call_id}/{session_id} (ttl)
    Registry->>Store: set call_sessions/{call_id}/{session_id} (ttl)
    Store-->>Registry: OK
    Registry-->>Launcher: OK
    Launcher-->>API: session_id
    API-->>Client: 201 Created

    loop Maintenance Loop
        Launcher->>Registry: get_close_requests({call_id: session_id...})
        Registry->>Store: scan close_requests/{call_id}/...
        Store-->>Registry: [session_ids]
        Registry-->>Launcher: [session_ids]
        Launcher->>Launcher: close local sessions for returned ids
        Launcher->>Registry: refresh({call_id: session_id...})
        Registry->>Store: expire(...keys, ttl)
        Store-->>Registry: OK
    end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related PRs

Agent HTTP server #284 — touches session-management surfaces (AgentLauncher and HTTP runner) and appears to be an earlier related refactor of session flows and endpoints.

Suggested reviewers

Nash0x7E2

Poem

I set the registry like a pulse beneath the skin—
small keys that hold the names of rooms and hours.
Nodes exhale their heartbeats into cold storage,
a ledger of arrivals and the slow evaporation of time,
the idle sessions folding like paper in a drawer.

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 37.67% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The PR title clearly summarizes the main change: adding Redis-based session store support for multi-node operation, which aligns with the substantial refactoring of SessionRegistry, HTTP API paths, and session management across the codebase.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch feature/agent-coordinator

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 7

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

agents-core/vision_agents/core/runner/http/options.py (1)
7-8: ⚠️ Potential issue | 🟡 Minor

Add explicit type parameters to Callable annotations.

Lines 44–47 use bare Callable without type parameters, which violates the coding guideline requiring full Callable signatures. Change to Callable[..., Any]:
Typing fix
-    can_start_session: Callable = allow_all
-    can_close_session: Callable = allow_all
-    can_view_session: Callable = allow_all
-    can_view_metrics: Callable = allow_all
+    can_start_session: Callable[..., Any] = allow_all
+    can_close_session: Callable[..., Any] = allow_all
+    can_view_session: Callable[..., Any] = allow_all
+    can_view_metrics: Callable[..., Any] = allow_all
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/runner/http/options.py` around lines 7 - 8,
Update any bare Callable annotations in this module to include explicit type
parameters (use Callable[..., Any]); locate functions/types that currently use
plain Callable (e.g., annotations referenced around allow_all and the Callable
usages on lines ~44–47) and change them to Callable[..., Any], ensuring Any is
imported from typing if not already present so the annotations are fully typed.
agents-core/vision_agents/core/runner/http/api.py (1)
75-86: ⚠️ Potential issue | 🟠 Major

Remove broad exception catch and avoid exposing exception details in HTTP responses.

The except Exception as e: clause violates the Python coding guideline (catch specific exceptions only) and leaks internal error details to clients. Since start_session() documents that it raises only SessionLimitExceeded and its subclasses—which are already caught on line 80—the broad catch is unnecessary. Remove it and let unexpected errors bubble up as 500 responses without exposing exception text.
🛠️ Proposed fix
-    except Exception as e:
-        logger.exception("Failed to start agent")
-        raise HTTPException(
-            status_code=status.HTTP_500_INTERNAL_SERVER_ERROR,
-            detail=f"Failed to start agent: {str(e)}",
-        ) from e
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/runner/http/api.py` around lines 75 - 86,
Remove the broad "except Exception as e" block so unexpected errors from
launcher.start_session bubble up instead of being caught and returning internal
exception text to clients; keep the specific "except SessionLimitExceeded as e"
handler. In practice, delete the general except block that calls
logger.exception and raises HTTPException with f"Failed to start agent:
{str(e)}", leaving only the try + SessionLimitExceeded except so Python/ASGI
will produce a 500 without exposing the exception details.

🧹 Nitpick comments (14)

examples/08_agent_server_example/README.md (1)

181-184: Use call_id in the auth example to avoid overly broad authorization.

On Line 181, call_id is accepted but never enforced. In a multi-call setup, this example can imply global permission is enough. Please show call-level authorization in the sample.

Suggested doc example adjustment

 async def can_start_session(call_id: str, token_payload=Depends(verify_token)):
     """Check if the caller has permission to start agent sessions."""
-    if "agents:start" not in token_payload.get("permissions", []):
+    allowed_calls = token_payload.get("call_ids", [])
+    if "agents:start" not in token_payload.get("permissions", []) or call_id not in allowed_calls:
         raise HTTPException(status_code=403, detail="Insufficient permissions")

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@examples/08_agent_server_example/README.md` around lines 181 - 184, The
can_start_session function accepts call_id but doesn't enforce call-level
authorization; update can_start_session to check that the token_payload grants
permission for this specific call_id (e.g., look for a scoped permission like
f"agents:start:{call_id}" or verify call_id is present in
token_payload.get("allowed_call_ids", [])) instead of only checking the global
"agents:start" permission; use the existing token_payload and call_id parameters
and raise HTTPException(403) if the scoped check fails so the example
demonstrates per-call authorization.

agents-core/vision_agents/core/agents/session_registry/redis_storage.py (2)

48-52: Move private helpers after public feature methods.

_prefixed and _strip_prefix are private helpers but are placed before the public lifecycle methods start/close. Per the project's method-ordering convention (__init__ → public lifecycle → properties → public feature methods → private helpers → dunders), they should move after delete/publish/subscribe.

As per coding guidelines: "Order class methods as: __init__, public lifecycle methods, properties, public feature methods, private helpers, dunder methods."
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/agents/session_registry/redis_storage.py`
around lines 48 - 52, Move the private helper methods _prefixed and
_strip_prefix so they appear after the public feature methods (delete, publish,
subscribe) and after any properties, but before any dunder methods; ensure the
class method order follows the project's convention: __init__, public lifecycle
methods (start/close), properties, public feature methods
(delete/publish/subscribe), then private helpers (_prefixed/_strip_prefix).
54-64: Replace the inspect.iscoroutine guard with a plain await.

redis.asyncio.Redis.ping() always returns a coroutine in the asyncio context; the guard is redundant and obscures intent. Remove the now-unused import inspect as well.
♻️ Proposed simplification
 async def start(self) -> None:
     """Open the Redis connection and verify it with a PING."""
-    # Handle non-specific Union return type here
-    ping = self._redis.ping()
-    if inspect.iscoroutine(ping):
-        await ping
+    await self._redis.ping()

     connection_kwargs = self._redis.connection_pool.connection_kwargs
Remove the now-unused import inspect from line 1.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/agents/session_registry/redis_storage.py`
around lines 54 - 64, In the start method of RedisSessionKVStore, replace the
inspect.iscoroutine guard around self._redis.ping() with a direct await (i.e.,
await self._redis.ping()) and remove the now-unused import inspect; update the
start function to always await the ping coroutine and delete the inspect import
from the module top.

agents-core/vision_agents/core/agents/session_registry/storage.py (1)

27-46: Move __aenter__/__aexit__ to the end of the class body.

Per the project's ordering convention, dunder methods come last. Currently __aenter__/__aexit__ appear between the public lifecycle methods and the abstract feature methods, which violates the guideline.

As per coding guidelines: "Order class methods as: __init__, public lifecycle methods, properties, public feature methods, private helpers, dunder methods."
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/agents/session_registry/storage.py` around
lines 27 - 46, Move the async context-manager dunders to the end of the class:
relocate __aenter__ and __aexit__ so they appear after the public lifecycle
methods (start, close), properties, public feature methods and private helpers —
i.e., place __aenter__ and __aexit__ at the end of the class body to follow the
project's ordering convention; keep their implementations unchanged and ensure
they still call await self.start() and await self.close() respectively.

agents-core/vision_agents/core/runner/http/dependencies.py (1)

7-16: Add return-type annotations and docstrings to the permission stubs.

All four stubs are public API hooks that users override; they currently have no return type annotation (implicitly None) and no docstrings. Per Google-style guidelines, public callables should carry at minimum a one-liner docstring and an explicit return type.

✏️ Suggested addition

-def can_start_session(call_id: str): ...
+def can_start_session(call_id: str) -> None:
+    """Permission hook called before a session is started for the given call."""
+    ...


-def can_close_session(call_id: str): ...
+def can_close_session(call_id: str) -> None:
+    """Permission hook called before a session is closed for the given call."""
+    ...


-def can_view_session(call_id: str): ...
+def can_view_session(call_id: str) -> None:
+    """Permission hook called before session info is retrieved for the given call."""
+    ...


-def can_view_metrics(call_id: str): ...
+def can_view_metrics(call_id: str) -> None:
+    """Permission hook called before metrics are retrieved for the given call."""
+    ...

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/runner/http/dependencies.py` around lines 7 -
16, The four permission stub functions (can_start_session, can_close_session,
can_view_session, can_view_metrics) lack return-type annotations and docstrings;
update each signature to return bool (e.g. def can_start_session(call_id: str)
-> bool:) and add a one-line Google-style docstring describing the purpose, the
call_id parameter, and that it returns True if the action is permitted (False
otherwise), so users overriding these public hooks have explicit types and
minimal documentation.

tests/test_agents/test_session_registry/test_in_memory_store.py (1)

9-16: Add a return type annotation to the async fixture.

✏️ Suggested annotation

+from collections.abc import AsyncIterator
+
 `@pytest.fixture`()
-async def store():
+async def store() -> AsyncIterator[InMemorySessionKVStore]:

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tests/test_agents/test_session_registry/test_in_memory_store.py` around lines
9 - 16, The async pytest fixture "store" should declare its return type to make
typing clear: change "async def store()" to "async def store() ->
AsyncIterator[InMemorySessionKVStore]"; add the needed import "from typing
import AsyncIterator" at the top of the test file; keep the body using
InMemorySessionKVStore, await s.start(), yield s, and await s.close() as-is.

agents-core/vision_agents/core/agents/agent_launcher.py (3)

306-317: Silent RuntimeError swallow in task finalizer is acceptable but worth a debug log.

When the event loop is gone, registry.remove() will never fire — the session lingers until TTL expiry. Consider adding a logger.debug so this isn't completely invisible.

♻️ Suggested improvement

                 try:
                     asyncio.get_running_loop().create_task(
                         self._registry.remove(session_id_)
                     )
                 except RuntimeError:
-                    pass
+                    logger.debug(
+                        "No running loop; registry removal of %s deferred to TTL",
+                        session_id_,
+                    )

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/agents/agent_launcher.py` around lines 306 -
317, The finalizer currently swallows RuntimeError silently when
asyncio.get_running_loop() fails; update the except block in _finalizer to log a
debug message including session_id_ and call_id_ (and optionally the exception
message) to indicate that self._registry.remove(session_id_) couldn't be
scheduled and the session will expire by TTL; use the module/class logger (e.g.,
logger.debug) so this situation is visible during debugging while preserving the
existing behavior.

462-478: Broad except Exception in maintenance path.

Lines 476 and 498 catch all exceptions. The guideline calls for specific exception handling. For a background resilience loop, this is a common tradeoff, but consider catching a narrower set (e.g., OSError, ConnectionError, redis.RedisError) to avoid swallowing unexpected programming errors like TypeError or KeyError.

As per coding guidelines, "Never write except Exception as e - use specific exception handling".

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/agents/agent_launcher.py` around lines 462 -
478, Replace the broad "except Exception" in the maintenance loop around the
call to self._registry.get_close_requests and the processing of self._sessions
with a narrow exception handler that only catches expected runtime/IO errors
(for example: OSError, ConnectionError and your Redis client error type such as
redis.RedisError) rather than all exceptions; update the except clause to
"except (OSError, ConnectionError, redis.RedisError) as e:" (add the redis
import if needed) and keep the existing logger.exception call to record the
error, so programming errors like TypeError/KeyError still surface while
transient registry/IO errors are handled gracefully.

493-499: Same broad catch pattern in _refresh_active_sessions.

Same guideline concern as above — consider narrowing the exception type to connection/IO errors.

As per coding guidelines, "Never write except Exception as e - use specific exception handling".

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/agents/agent_launcher.py` around lines 493 -
499, Narrow the broad except in _refresh_active_sessions: catch only
connection/IO-related errors that can occur when calling self._registry.refresh
(e.g., ConnectionError, asyncio.TimeoutError, and any HTTP client errors used in
this project such as aiohttp.ClientError), log the caught exception with
logger.exception including the exception variable (e) and context, and let
unexpected exceptions propagate (or re-raise) instead of swallowing all
Exception types; keep sessions_map and the call to self._registry.refresh as-is.

tests/test_agents/test_session_registry/test_redis_store.py (2)

7-9: Prefer importing from the public re-export.

Same as noted in test_agent_launcher.py — RedisSessionKVStore is re-exported from vision_agents.core.agents.session_registry.

♻️ Suggested fix

-from vision_agents.core.agents.session_registry.redis_storage import (
-    RedisSessionKVStore,
-)
+from vision_agents.core.agents.session_registry import RedisSessionKVStore

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tests/test_agents/test_session_registry/test_redis_store.py` around lines 7 -
9, The test currently imports RedisSessionKVStore from its internal module;
update the import to use the public re-export instead by importing
RedisSessionKVStore from vision_agents.core.agents.session_registry so the test
relies on the public API rather than the internal module.

188-194: Accessing private registry._store couples the test to internal implementation.

Consider creating a standalone SessionRegistry with ttl=1.0 using the redis_store fixture directly, rather than reaching into the private _store attribute.

♻️ Suggested refactor

     async def test_session_expires_without_refresh(
-        self, registry: SessionRegistry
+        self, redis_store: RedisSessionKVStore
     ) -> None:
-        short_registry = SessionRegistry(store=registry._store, ttl=1.0)
+        short_registry = SessionRegistry(store=redis_store, ttl=1.0)
         await short_registry.register("sess-expire", "call-e")
         await asyncio.sleep(1.5)
         assert await short_registry.get("sess-expire") is None

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tests/test_agents/test_session_registry/test_redis_store.py` around lines 188
- 194, The test test_session_expires_without_refresh currently accesses the
private attribute registry._store; instead, instantiate a standalone
SessionRegistry using the provided redis_store fixture (e.g., short_registry =
SessionRegistry(store=redis_store, ttl=1.0)) so the test depends only on the
public API; update the test to register `"sess-expire"` on short_registry, await
asyncio.sleep(1.5), and assert short_registry.get("sess-expire") is None,
removing any access to registry._store.

tests/test_agents/test_agent_launcher.py (2)

15-21: Prefer importing from the public re-export rather than internal submodules.

InMemorySessionKVStore, RedisSessionKVStore, and SessionKVStore are all re-exported from vision_agents.core.agents.session_registry (and even from vision_agents.core). Using the public path keeps tests resilient to internal restructuring.
♻️ Suggested imports
-from vision_agents.core.agents.session_registry.in_memory_storage import (
-    InMemorySessionKVStore,
-)
-from vision_agents.core.agents.session_registry.redis_storage import (
-    RedisSessionKVStore,
-)
-from vision_agents.core.agents.session_registry.storage import SessionKVStore
+from vision_agents.core.agents.session_registry import (
+    InMemorySessionKVStore,
+    RedisSessionKVStore,
+    SessionKVStore,
+)
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@tests/test_agents/test_agent_launcher.py` around lines 15 - 21, The test
imports use internal submodule paths for InMemorySessionKVStore,
RedisSessionKVStore, and SessionKVStore; update the import statements to use the
public re-export (e.g., import these three symbols from
vision_agents.core.agents.session_registry or the top-level vision_agents.core
re-export) so the test relies on the public API rather than internal modules
(replace the three imports referencing session_registry.in_memory_storage,
session_registry.redis_storage, and session_registry.storage with a single
public import of InMemorySessionKVStore, RedisSessionKVStore, and
SessionKVStore).
63-68: Consider marking Redis-dependent fixtures/tests with @pytest.mark.integration.

The redis_url fixture spins up a real Redis container. Per coding guidelines, integration tests should be decorated with @pytest.mark.integration. Since the storage fixture unconditionally depends on redis_url, even the "memory" param triggers a container startup. Consider either:

Adding @pytest.mark.integration to TestAgentLauncherWithStorage, or

Splitting the fixture so the memory variant doesn't require redis_url.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@tests/test_agents/test_agent_launcher.py` around lines 63 - 68, The redis_url
fixture launches a real Redis container and causes Redis to start even for
memory-backed runs; update tests so Redis-dependent runs are marked or the
fixture is split: either add `@pytest.mark.integration` to
TestAgentLauncherWithStorage (so the whole class is treated as integration) or
refactor the storage fixture to conditionally require redis_url only when
storage param != "memory" (ensure the storage fixture checks the param and only
references redis_url when needed, leaving memory cases free of container
startup).

agents-core/vision_agents/core/agents/session_registry/registry.py (1)

81-95: update_metrics uses a non-atomic read-modify-write.

In a multi-node Redis deployment, concurrent update_metrics calls for the same session could cause a lost update. This is likely safe if only the owning node updates metrics (as the docstring implies), but worth noting for future awareness if the contract changes.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/agents/session_registry/registry.py` around
lines 81 - 95, The current update_metrics function performs a non-atomic
read-modify-write using self._store.get and self._store.set which can cause lost
updates under concurrent calls; change update_metrics (the method named
update_metrics that reads the key f"sessions/{session_id}" and writes metrics
and metrics_updated_at) to perform the update atomically — either use your store
client's optimistic locking (WATCH/MULTI/EXEC) around the key, or run the update
in a single server-side operation (e.g., a Redis EVAL/Lua script or an atomic
JSON/SET command) so that writing "metrics" and "metrics_updated_at" to the
sessions/{session_id} value cannot be interleaved by other updaters.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@agents-core/vision_agents/core/agents/session_registry/in_memory_storage.py`:
- Around line 115-124: In subscribe(), the generator finalizer can raise
KeyError if close() cleared self._subscribers; modify the finally block in
subscribe(self, channel: str) so it checks for the channel and queue presence
before removing (e.g., use self._subscribers.get(channel) and remove if present,
or guard with "if channel in self._subscribers and queue in
self._subscribers[channel]" and only then remove and possibly delete the empty
list) to avoid accessing a missing key on cleanup.

In `@agents-core/vision_agents/core/agents/session_registry/redis_storage.py`:
- Around line 75-80: In mset, remove the erroneous await on pipeline command
calls so commands are queued instead of awaited: inside the async def mset(self,
items...) loop call pipe.set(self._prefixed(key), value, px=int(ttl * 1000)) (no
await) for each item and leave only a single await pipe.execute() after the
loop; this fixes improper use of await with redis-py asyncio pipeline methods.

In `@agents-core/vision_agents/core/agents/session_registry/storage.py`:
- Around line 45-46: Change the async context-manager exit method to use the
standard three-argument signature rather than a catch-all tuple: update the
method definition for __aexit__ to async def __aexit__(self, exc_type, exc_val,
exc_tb) -> None so static analyzers and editors can recognize the expected
parameters; keep calling await self.close() inside the method and preserve the
existing return behavior (None/False) so exceptions continue to propagate as
before.
- Around line 154-171: The abstract method subscribe is declared as a sync def
but all implementations (InMemorySessionKVStore.subscribe and
RedisSessionKVStore.subscribe) are async generator functions; change the
abstract signature to "async def subscribe(self, channel: str) ->
AsyncIterator[bytes]:" so the abstract method matches the implementations and
type checkers (mypy/pyright) recognize it as an async generator returning
AsyncIterator[bytes]; update the docstring if needed but keep the same semantics
and return type.

In `@agents-core/vision_agents/core/runner/http/api.py`:
- Around line 110-117: Handlers call launcher.get_session_info(session_id) then
operate on the session without verifying the session actually belongs to the
provided call_id; update each handler (the ones that call
launcher.get_session_info and then call launcher.request_close_session or
launcher.request_read_session) to check that the returned session info has the
same call_id as the path param (e.g., compare info.call_id or
info.session_call_id to the call_id arg) and if it doesn’t match raise an
HTTPException (404 or 403) before calling
launcher.request_close_session/request_read_session; apply this check to the
close and read handlers that use
get_session_info/request_close_session/request_read_session so callers cannot
close/read sessions from other calls.

In `@tests/test_agents/test_session_registry/test_in_memory_store.py`:
- Around line 32-35: Tests like test_ttl_expiry (and the related
test_expire_refreshes_ttl and test_expire_multiple_keys) are timing-sensitive
and can flake on busy CI; update these tests in
tests/test_agents/test_session_registry/test_in_memory_store.py to widen the
expiry margin by reducing the TTL and increasing the post-set sleep (for example
change ttl from 0.5 to 0.1 and sleep from 0.7 to 0.3) or otherwise increase the
sleep-to-ttl ratio (ensure sleep >= ttl * 2–3) so the assertion checking
InMemorySessionKVStore.get("...") being None is robust on loaded runners.

In `@tests/test_agents/test_session_registry/test_redis_store.py`:
- Line 44: The two test classes that require Redis, TestRedisSessionKVStore and
TestSessionRegistryWithRedis, must be marked as integration tests; add the
`@pytest.mark.integration` decorator above each class definition and ensure pytest
is imported (add "import pytest" at the top if missing) so the decorator is
available.

---

Outside diff comments:
In `@agents-core/vision_agents/core/runner/http/api.py`:
- Around line 75-86: Remove the broad "except Exception as e" block so
unexpected errors from launcher.start_session bubble up instead of being caught
and returning internal exception text to clients; keep the specific "except
SessionLimitExceeded as e" handler. In practice, delete the general except block
that calls logger.exception and raises HTTPException with f"Failed to start
agent: {str(e)}", leaving only the try + SessionLimitExceeded except so
Python/ASGI will produce a 500 without exposing the exception details.

In `@agents-core/vision_agents/core/runner/http/options.py`:
- Around line 7-8: Update any bare Callable annotations in this module to
include explicit type parameters (use Callable[..., Any]); locate
functions/types that currently use plain Callable (e.g., annotations referenced
around allow_all and the Callable usages on lines ~44–47) and change them to
Callable[..., Any], ensuring Any is imported from typing if not already present
so the annotations are fully typed.

---

Nitpick comments:
In `@agents-core/vision_agents/core/agents/agent_launcher.py`:
- Around line 306-317: The finalizer currently swallows RuntimeError silently
when asyncio.get_running_loop() fails; update the except block in _finalizer to
log a debug message including session_id_ and call_id_ (and optionally the
exception message) to indicate that self._registry.remove(session_id_) couldn't
be scheduled and the session will expire by TTL; use the module/class logger
(e.g., logger.debug) so this situation is visible during debugging while
preserving the existing behavior.
- Around line 462-478: Replace the broad "except Exception" in the maintenance
loop around the call to self._registry.get_close_requests and the processing of
self._sessions with a narrow exception handler that only catches expected
runtime/IO errors (for example: OSError, ConnectionError and your Redis client
error type such as redis.RedisError) rather than all exceptions; update the
except clause to "except (OSError, ConnectionError, redis.RedisError) as e:"
(add the redis import if needed) and keep the existing logger.exception call to
record the error, so programming errors like TypeError/KeyError still surface
while transient registry/IO errors are handled gracefully.
- Around line 493-499: Narrow the broad except in _refresh_active_sessions:
catch only connection/IO-related errors that can occur when calling
self._registry.refresh (e.g., ConnectionError, asyncio.TimeoutError, and any
HTTP client errors used in this project such as aiohttp.ClientError), log the
caught exception with logger.exception including the exception variable (e) and
context, and let unexpected exceptions propagate (or re-raise) instead of
swallowing all Exception types; keep sessions_map and the call to
self._registry.refresh as-is.

In `@agents-core/vision_agents/core/agents/session_registry/redis_storage.py`:
- Around line 48-52: Move the private helper methods _prefixed and _strip_prefix
so they appear after the public feature methods (delete, publish, subscribe) and
after any properties, but before any dunder methods; ensure the class method
order follows the project's convention: __init__, public lifecycle methods
(start/close), properties, public feature methods (delete/publish/subscribe),
then private helpers (_prefixed/_strip_prefix).
- Around line 54-64: In the start method of RedisSessionKVStore, replace the
inspect.iscoroutine guard around self._redis.ping() with a direct await (i.e.,
await self._redis.ping()) and remove the now-unused import inspect; update the
start function to always await the ping coroutine and delete the inspect import
from the module top.

In `@agents-core/vision_agents/core/agents/session_registry/registry.py`:
- Around line 81-95: The current update_metrics function performs a non-atomic
read-modify-write using self._store.get and self._store.set which can cause lost
updates under concurrent calls; change update_metrics (the method named
update_metrics that reads the key f"sessions/{session_id}" and writes metrics
and metrics_updated_at) to perform the update atomically — either use your store
client's optimistic locking (WATCH/MULTI/EXEC) around the key, or run the update
in a single server-side operation (e.g., a Redis EVAL/Lua script or an atomic
JSON/SET command) so that writing "metrics" and "metrics_updated_at" to the
sessions/{session_id} value cannot be interleaved by other updaters.

In `@agents-core/vision_agents/core/agents/session_registry/storage.py`:
- Around line 27-46: Move the async context-manager dunders to the end of the
class: relocate __aenter__ and __aexit__ so they appear after the public
lifecycle methods (start, close), properties, public feature methods and private
helpers — i.e., place __aenter__ and __aexit__ at the end of the class body to
follow the project's ordering convention; keep their implementations unchanged
and ensure they still call await self.start() and await self.close()
respectively.

In `@agents-core/vision_agents/core/runner/http/dependencies.py`:
- Around line 7-16: The four permission stub functions (can_start_session,
can_close_session, can_view_session, can_view_metrics) lack return-type
annotations and docstrings; update each signature to return bool (e.g. def
can_start_session(call_id: str) -> bool:) and add a one-line Google-style
docstring describing the purpose, the call_id parameter, and that it returns
True if the action is permitted (False otherwise), so users overriding these
public hooks have explicit types and minimal documentation.

In `@examples/08_agent_server_example/README.md`:
- Around line 181-184: The can_start_session function accepts call_id but
doesn't enforce call-level authorization; update can_start_session to check that
the token_payload grants permission for this specific call_id (e.g., look for a
scoped permission like f"agents:start:{call_id}" or verify call_id is present in
token_payload.get("allowed_call_ids", [])) instead of only checking the global
"agents:start" permission; use the existing token_payload and call_id parameters
and raise HTTPException(403) if the scoped check fails so the example
demonstrates per-call authorization.

In `@tests/test_agents/test_agent_launcher.py`:
- Around line 15-21: The test imports use internal submodule paths for
InMemorySessionKVStore, RedisSessionKVStore, and SessionKVStore; update the
import statements to use the public re-export (e.g., import these three symbols
from vision_agents.core.agents.session_registry or the top-level
vision_agents.core re-export) so the test relies on the public API rather than
internal modules (replace the three imports referencing
session_registry.in_memory_storage, session_registry.redis_storage, and
session_registry.storage with a single public import of InMemorySessionKVStore,
RedisSessionKVStore, and SessionKVStore).
- Around line 63-68: The redis_url fixture launches a real Redis container and
causes Redis to start even for memory-backed runs; update tests so
Redis-dependent runs are marked or the fixture is split: either add
`@pytest.mark.integration` to TestAgentLauncherWithStorage (so the whole class is
treated as integration) or refactor the storage fixture to conditionally require
redis_url only when storage param != "memory" (ensure the storage fixture checks
the param and only references redis_url when needed, leaving memory cases free
of container startup).

In `@tests/test_agents/test_session_registry/test_in_memory_store.py`:
- Around line 9-16: The async pytest fixture "store" should declare its return
type to make typing clear: change "async def store()" to "async def store() ->
AsyncIterator[InMemorySessionKVStore]"; add the needed import "from typing
import AsyncIterator" at the top of the test file; keep the body using
InMemorySessionKVStore, await s.start(), yield s, and await s.close() as-is.

In `@tests/test_agents/test_session_registry/test_redis_store.py`:
- Around line 7-9: The test currently imports RedisSessionKVStore from its
internal module; update the import to use the public re-export instead by
importing RedisSessionKVStore from vision_agents.core.agents.session_registry so
the test relies on the public API rather than the internal module.
- Around line 188-194: The test test_session_expires_without_refresh currently
accesses the private attribute registry._store; instead, instantiate a
standalone SessionRegistry using the provided redis_store fixture (e.g.,
short_registry = SessionRegistry(store=redis_store, ttl=1.0)) so the test
depends only on the public API; update the test to register `"sess-expire"` on
short_registry, await asyncio.sleep(1.5), and assert
short_registry.get("sess-expire") is None, removing any access to
registry._store.

ℹ️ Review info

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Disabled knowledge base sources:

Linear integration is disabled

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 94c8a33 and b387971.

⛔ Files ignored due to path filters (1)

uv.lock is excluded by !**/*.lock

📒 Files selected for processing (22)

agents-core/pyproject.toml
agents-core/vision_agents/core/__init__.py
agents-core/vision_agents/core/agents/__init__.py
agents-core/vision_agents/core/agents/agent_launcher.py
agents-core/vision_agents/core/agents/session_registry/__init__.py
agents-core/vision_agents/core/agents/session_registry/in_memory_storage.py
agents-core/vision_agents/core/agents/session_registry/redis_storage.py
agents-core/vision_agents/core/agents/session_registry/registry.py
agents-core/vision_agents/core/agents/session_registry/storage.py
agents-core/vision_agents/core/agents/session_registry/types.py
agents-core/vision_agents/core/runner/http/api.py
agents-core/vision_agents/core/runner/http/dependencies.py
agents-core/vision_agents/core/runner/http/models.py
agents-core/vision_agents/core/runner/http/options.py
agents-core/vision_agents/core/runner/runner.py
examples/08_agent_server_example/README.md
pyproject.toml
tests/test_agents/test_agent_launcher.py
tests/test_agents/test_runner.py
tests/test_agents/test_session_registry/__init__.py
tests/test_agents/test_session_registry/test_in_memory_store.py
tests/test_agents/test_session_registry/test_redis_store.py

💤 Files with no reviewable changes (2)

agents-core/vision_agents/core/runner/http/models.py
agents-core/vision_agents/core/runner/runner.py

agents-core/vision_agents/core/agents/session_registry/in_memory_storage.py

agents-core/vision_agents/core/agents/session_registry/redis_store.py

agents-core/vision_agents/core/agents/session_registry/storage.py

agents-core/vision_agents/core/runner/http/api.py

tests/test_agents/test_session_registry/test_in_memory_store.py

tests/test_agents/test_session_registry/test_redis_store.py

- Added SessionRegistry to store agent sessions - Added in-memory and Redis-based backends for SessionRegistry - [breaking] updated HTTP API urls to always include call_id for easy access in permission callbacks - [breaking] removed "get_current_user" permission callback as redundant

coderabbitai

Actionable comments posted: 7

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

agents-core/vision_agents/core/agents/agent_launcher.py (2)
306-328: ⚠️ Potential issue | 🟡 Minor

Fire-and-forget registry.remove in _finalizer has no exception handler.

The create_task(self._registry.remove(session_id_)) at lines 313-315 fires and forgets the coroutine. If registry.remove raises (e.g., Redis connection error during shutdown), the unhandled task exception will surface as an asyncio warning. Consider adding a done callback or using asyncio.shield with error logging.

This is a minor reliability concern — the session will still be removed by TTL expiry, but operational noise is worth avoiding.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/agents/agent_launcher.py` around lines 306 -
328, The _finalizer currently fire-and-forgets the coroutine created by
asyncio.get_running_loop().create_task(self._registry.remove(session_id_)) which
can produce unhandled-task exceptions; wrap that call so exceptions are
observed—either create a small async wrapper that awaits
self._registry.remove(session_id_) inside try/except and logs errors, or attach
a done callback to the created Task to catch and log exceptions; update the code
in _finalizer (referencing _finalizer, self._registry.remove, and the
create_task call) to ensure any exception from registry.remove is caught and
logged rather than left unhandled.
186-190: ⚠️ Potential issue | 🟠 Major

Broad except Exception violates the "catch specific exceptions" guideline.

cancel_and_wait can only realistically raise task-related errors. Prefer catching asyncio.CancelledError (or BaseException if truly needed) and use logger.exception() instead of logger.error() so the traceback is preserved.
🐛 Proposed fix
         for result in asyncio.as_completed(coros):
             try:
                 await result
-            except Exception as exc:
-                logger.error(f"Failed to cancel the agent task: {exc}")
+            except BaseException:
+                logger.exception("Failed to cancel the agent task")
As per coding guidelines, "Never write except Exception as e; catch specific exceptions instead" and "Prefer logger.exception() when logging an error with a traceback instead of logger.error('Error: {exc}')".
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/agents/agent_launcher.py` around lines 186 -
190, Replace the broad "except Exception as exc" around "await result" with a
targeted handler for cancellation-related errors (e.g., "except
asyncio.CancelledError") and use logger.exception() to preserve the traceback;
if you truly need to catch everything (rare), use "except BaseException"
intentionally and document why. Update the block that currently calls
logger.error(...) (referencing "result" and the surrounding cancel_and_wait
logic) to call logger.exception(...) in the cancellation-specific except handler
so tracebacks are retained.

♻️ Duplicate comments (7)

agents-core/vision_agents/core/agents/session_registry/storage.py (2)

45-46: Non-standard __aexit__ signature — unchanged from prior review.

The three exception arguments are still collapsed into *exc rather than named (exc_type, exc_val, exc_tb).

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/agents/session_registry/storage.py` around
lines 45 - 46, The asynchronous context manager method __aexit__ currently
collapses exception args into *exc; change its signature to the standard async
def __aexit__(self, exc_type, exc_val, exc_tb) -> None in the SessionRegistry
storage class so the interpreter provides the three exception parameters
correctly, and keep the body to await self.close() (returning None implicitly).

154-155: subscribe abstract method is still declared def instead of async def — unchanged from prior review.

Both InMemorySessionKVStore.subscribe and RedisSessionKVStore.subscribe are async generators; the ABC contract still doesn't match.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/agents/session_registry/storage.py` around
lines 154 - 155, The abstract method subscribe in the SessionKVStore ABC is
declared with a regular def but the concrete implementations
InMemorySessionKVStore.subscribe and RedisSessionKVStore.subscribe are async
generators; change the ABC declaration to "async def subscribe(self, channel:
str) -> AsyncIterator[bytes]" and keep the `@abc.abstractmethod` decorator so the
signature matches the concrete async implementations; update any type imports if
needed to use typing.AsyncIterator.

agents-core/vision_agents/core/agents/session_registry/redis_storage.py (1)

75-80: await on pipeline commands in mset — unchanged from prior review.

Individual pipeline command calls should not be awaited; only pipe.execute() should be awaited.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/agents/session_registry/redis_storage.py`
around lines 75 - 80, In mset, the pipeline command calls are being awaited
incorrectly; inside async def mset (function mset) remove the await before
pipe.set within the for loop so pipeline commands are queued (use
pipe.set(self._prefixed(key), value, px=int(ttl * 1000)) without await), and
keep only a single await on pipe.execute() to run the pipeline; reference the
async with self._redis.pipeline() as pipe block and ensure _prefixed(key) is
still used as the key.

agents-core/vision_agents/core/agents/session_registry/in_memory_storage.py (1)

122-124: KeyError in subscribe finalizer — unchanged from prior review.

self._subscribers[channel].remove(queue) will raise KeyError if close() already called self._subscribers.clear().

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/agents/session_registry/in_memory_storage.py`
around lines 122 - 124, The finalizer in the subscribe flow can raise KeyError
when self._subscribers has been cleared; update the removal to first fetch the
subscriber collection with self._subscribers.get(channel) (or check "if channel
in self._subscribers") and only remove the queue if present, then delete the
channel key if the collection is empty, or wrap the removal in a try/except
KeyError to safely ignore races; target the finalizer inside the subscribe logic
that references self._subscribers[channel].remove(queue) and the subsequent
deletion so it no longer assumes the channel or queue always exists.

tests/test_agents/test_session_registry/test_in_memory_store.py (1)

32-35: ⚠️ Potential issue | 🟡 Minor

Harden TTL assertions against scheduler jitter.

These expiry checks still use tight sleep margins and can intermittently fail on loaded runners. Increase the sleep-to-TTL ratio (≥2–3x) to make assertions deterministic.

♻️ Proposed stabilization

-        await store.set("ephemeral", b"bye", ttl=0.5)
-        await asyncio.sleep(0.7)
+        await store.set("ephemeral", b"bye", ttl=0.2)
+        await asyncio.sleep(0.6)
         assert await store.get("ephemeral") is None
@@
-        await store.set("refresh_me", b"val", ttl=1.0)
-        await asyncio.sleep(0.5)
-        await store.expire("refresh_me", ttl=2.0)
-        await asyncio.sleep(1.0)
+        await store.set("refresh_me", b"val", ttl=0.3)
+        await asyncio.sleep(0.2)
+        await store.expire("refresh_me", ttl=0.8)
+        await asyncio.sleep(0.4)
         assert await store.get("refresh_me") == b"val"
@@
-        await store.mset([("m1", b"a", 1.0), ("m2", b"b", 1.0)])
-        await asyncio.sleep(0.5)
-        await store.expire("m1", "m2", ttl=2.0)
-        await asyncio.sleep(1.0)
+        await store.mset([("m1", b"a", 0.3), ("m2", b"b", 0.3)])
+        await asyncio.sleep(0.2)
+        await store.expire("m1", "m2", ttl=0.8)
+        await asyncio.sleep(0.4)
         assert await store.get("m1") == b"a"
         assert await store.get("m2") == b"b"

Also applies to: 56-61, 66-72

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tests/test_agents/test_session_registry/test_in_memory_store.py` around lines
32 - 35, Increase the sleep margins in the TTL-related tests to avoid flakiness:
for tests that call InMemorySessionKVStore.set(..., ttl=...), e.g.,
test_ttl_expiry (and the other cases around lines 56-61 and 66-72), multiply the
sleep duration to at least 2–3× the TTL (for example change asyncio.sleep(0.7)
after ttl=0.5 to asyncio.sleep(1.1–1.5) or equivalent) so assertions using
store.get("...") reliably observe expiry; update the sleeps wherever ttl-based
expiry is asserted to use the higher ratio.

tests/test_agents/test_session_registry/test_redis_store.py (1)

44-44: ⚠️ Potential issue | 🟠 Major

Mark Redis-dependent suites as integration tests.

These classes depend on Testcontainers/Redis and should be explicitly marked integration so they don’t run in unit-only jobs.
♻️ Proposed fix
+@pytest.mark.integration
 class TestRedisSessionKVStore:
@@
+@pytest.mark.integration
 class TestSessionRegistryWithRedis:
As per coding guidelines, "Mark integration tests with @pytest.mark.integration decorator".

Also applies to: 144-144
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@tests/test_agents/test_session_registry/test_redis_store.py` at line 44, The
Redis-dependent test class TestRedisSessionKVStore should be marked as an
integration test by adding the pytest.mark.integration decorator above the class
definition; locate class TestRedisSessionKVStore and prepend
`@pytest.mark.integration` (and do the same for the other Redis-dependent test
class in this file) so these suites are excluded from unit-only jobs.

agents-core/vision_agents/core/runner/http/api.py (1)

110-117: ⚠️ Potential issue | 🔴 Critical

Enforce call_id ownership before close/read/metrics operations.

These handlers check existence only, not that the session belongs to the path call_id. That enables cross-call operations with a known session_id.

🔒 Proposed fix

-    info = await launcher.get_session_info(session_id)
-    if info is None:
+    info = await launcher.get_session_info(session_id)
+    if info is None or info.call_id != call_id:
         raise HTTPException(
             status_code=status.HTTP_404_NOT_FOUND,
             detail=f"Session with id '{session_id}' not found",
         )
@@
-    info = await launcher.get_session_info(session_id)
-    if info is None:
+    info = await launcher.get_session_info(session_id)
+    if info is None or info.call_id != call_id:
         raise HTTPException(
             status_code=status.HTTP_404_NOT_FOUND,
             detail=f"Session with id '{session_id}' not found",
         )
@@
-    info = await launcher.get_session_info(session_id)
-    if info is None:
+    info = await launcher.get_session_info(session_id)
+    if info is None or info.call_id != call_id:
         raise HTTPException(
             status_code=status.HTTP_404_NOT_FOUND,
             detail=f"Session with id '{session_id}' not found",
         )
@@
-    info = await launcher.get_session_info(session_id)
-    if info is None:
+    info = await launcher.get_session_info(session_id)
+    if info is None or info.call_id != call_id:
         raise HTTPException(
             status_code=status.HTTP_404_NOT_FOUND,
             detail=f"Session with id '{session_id}' not found",
         )

Also applies to: 137-144, 160-170, 187-202

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/runner/http/api.py` around lines 110 - 117,
Handler code calls launcher.get_session_info(session_id) and only checks for
existence; change each handler (the close/read/metrics HTTP handlers that call
launcher.get_session_info and then launcher.request_close_session /
launcher.request_read_session / launcher.request_session_metrics) to also verify
that the returned session info belongs to the provided path call_id (e.g.,
compare info.call_id or info["call_id"] to the request path call_id) and raise
HTTPException(status_code=404, detail="Session with id '...' not found") when it
does not match; do this check before calling
launcher.request_close_session/request_read_session/request_session_metrics so
cross-call session access is prevented.

🧹 Nitpick comments (3)

agents-core/vision_agents/core/agents/session_registry/redis_storage.py (1)

54-64: Simplify ping — inspect.iscoroutine branch is always true in redis.asyncio.

In the redis.asyncio namespace, Redis.ping() always returns a coroutine. The conditional dance exists to work around a Union return type in redis-py's stubs, but the sync branch is dead code. A direct await self._redis.ping() is cleaner and equally correct.
♻️ Proposed simplification
 async def start(self) -> None:
     """Open the Redis connection and verify it with a PING."""
-    # Handle non-specific Union return type here
-    ping = self._redis.ping()
-    if inspect.iscoroutine(ping):
-        await ping
+    await self._redis.ping()

     connection_kwargs = self._redis.connection_pool.connection_kwargs
If inspect is not used elsewhere, remove it from the imports too.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/agents/session_registry/redis_storage.py`
around lines 54 - 64, In RedisSessionKVStore.start(), remove the unnecessary
inspect.iscoroutine check and simply await the coroutine returned by
self._redis.ping() (i.e., replace the ping = ... / inspect.iscoroutine branch
with await self._redis.ping()); also remove the unused inspect import if it
isn’t used elsewhere so the module no longer contains dead code.

agents-core/vision_agents/core/__init__.py (1)

25-29: Silent ImportError swallowing may hide development-time bugs.

Any ImportError inside redis_storage.py itself (e.g., a missing sub-import) will be silently swallowed. A logger.debug or logger.warning call in the except branch would surface such issues during development without affecting production.

♻️ Proposed refinement

 try:
     from vision_agents.core.agents.session_registry import RedisSessionKVStore

     __all__ += ["RedisSessionKVStore"]
 except ImportError:
-    pass
+    import logging as _logging
+    _logging.getLogger(__name__).debug(
+        "RedisSessionKVStore not available (install redis[hiredis] to enable)"
+    )

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/__init__.py` around lines 25 - 29, The except
block currently swallows ImportError when importing RedisSessionKVStore; update
the import in __init__.py to catch ImportError as e and call the module logger
(e.g., logger.debug or logger.warning) to emit the exception details (including
e or exc_info) while still allowing fallback behavior, referencing the symbol
RedisSessionKVStore and the import line from
vision_agents.core.agents.session_registry to locate the change.

tests/test_agents/test_runner.py (1)

405-406: Add response-status assertions in call_id propagation tests.

These tests currently verify callback invocation only. Add status assertions so they also fail on endpoint behavior regressions.

♻️ Proposed fix

-            await client.delete(f"/calls/my-call-456/sessions/{session_id}")
+            resp = await client.delete(f"/calls/my-call-456/sessions/{session_id}")
+            assert resp.status_code == 202
             assert received_call_ids == ["my-call-456"]
@@
-            await client.get(f"/calls/my-call-789/sessions/{session_id}")
+            resp = await client.get(f"/calls/my-call-789/sessions/{session_id}")
+            assert resp.status_code == 200
             assert received_call_ids == ["my-call-789"]
@@
-            await client.get(f"/calls/my-call-abc/sessions/{session_id}/metrics")
+            resp = await client.get(f"/calls/my-call-abc/sessions/{session_id}/metrics")
+            assert resp.status_code == 200
             assert received_call_ids == ["my-call-abc"]

Also applies to: 428-429, 451-452

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tests/test_agents/test_runner.py` around lines 405 - 406, Add assertions that
check the HTTP response status when calling the delete endpoint so the tests
fail on endpoint regressions; specifically, capture the result of the call to
client.delete(f"/calls/my-call-456/sessions/{session_id}") (and the two other
similar calls at the other locations) into a variable like response and assert
response.status_code == 200 before asserting on received_call_ids to ensure both
callback invocation and endpoint status are validated.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@agents-core/vision_agents/core/agents/agent_launcher.py`:
- Around line 493-499: Replace the broad "except Exception" in
_refresh_active_sessions with a specific registry/storage-backend exception (the
same specific exception used in _close_expired_sessions), e.g., catch
RegistryError or StorageBackendError thrown by self._registry.refresh; import
that exception symbol and handle only it (log with logger.exception to record
the exception), and let unexpected exceptions propagate (or re-raise) so they
aren't silently swallowed; reference symbols: _refresh_active_sessions,
self._registry.refresh, sessions_map, logger.exception, and the specific
exception type used in _close_expired_sessions.
- Around line 462-477: The broad except in _close_expired_sessions should be
replaced with a specific storage-related exception handler: change the catch
around the call to registry.get_close_requests to only catch your storage error
(e.g., redis.exceptions.RedisError) or a new domain-level StorageError that all
session-store implementations (like RedisSessionKVStore) raise; log the
exception with logger.exception when that storage error occurs and allow other
exceptions to propagate (or re-raise) so logical bugs aren’t suppressed. Ensure
registry.get_close_requests and the storage backend implementations raise the
chosen StorageError type (or the specific backend exception) so the except
targets a concrete error class instead of Exception.
- Around line 486-491: The loop over to_close and result currently logs
exceptions using logger.error(..., exc_info=r); replace that call with
logger.exception so the traceback is logged idiomatically — e.g., in the block
that checks isinstance(r, Exception) for each agent (refer to the
to_close/result loop and agent.agent_user.id), call logger.exception(f"Failed to
close agent with user_id {agent.agent_user.id}", exc_info=r) (or simply
logger.exception(f"Failed to close agent with user_id {agent.agent_user.id}") if
you ensure r is the active exception) to conform to logging guidelines.

In `@agents-core/vision_agents/core/agents/session_registry/redis_storage.py`:
- Around line 82-89: The expire method is awaiting non-awaitable pipeline
queueing calls which causes a TypeError and prevents commands from being
buffered; in the expire function remove the spurious await when queuing pexpire
on the redis pipeline (i.e. change await pipe.pexpire(...) to pipe.pexpire(...))
so that the commands are enqueued and only executed when await pipe.execute() is
called; ensure you still call self._prefixed(key) for each key and keep the
final await pipe.execute() intact.

In `@agents-core/vision_agents/core/agents/session_registry/registry.py`:
- Around line 123-128: The current get method uses
SessionInfo(**json.loads(raw)) which will raise TypeError if stored JSON
contains extra fields; change both get and get_for_call to parse raw with
json.loads(raw), then filter the resulting dict to only keys present in
SessionInfo's defined fields (e.g., using SessionInfo.__dataclass_fields__ or
dataclasses.fields to build the allowed key set) before instantiating
SessionInfo, so unknown/newer fields are ignored and construction succeeds
during rolling deployments.
- Around line 40-46: Add async context manager support to SessionRegistry by
implementing __aenter__ and __aexit__: have __aenter__ await self.start() and
return self, and have __aexit__ await self.stop(); ensure __aexit__ accepts
(exc_type, exc, tb) and always attempts to stop (so resources are cleaned up
even on errors) and re-raise or return False per current semantics. Update the
class where start and stop are defined (SessionRegistry, methods start/stop) to
include these two new async dunder methods.

In `@tests/test_agents/test_agent_launcher.py`:
- Around line 71-72: The parameterized fixture storage currently runs both
"memory" and "redis" for every test; update the storage fixture to mark the
"redis" variant as an integration-only case so it only runs when integration
tests are requested. Concretely, change the parametrization in the storage
fixture to use pytest.param("redis", marks=pytest.mark.integration) for the
Redis variant (keeping "memory" unmarked), ensuring the fixture signature
(storage(request, redis_url) -> AsyncIterator[SessionKVStore]) and existing use
of request/redis_url remain intact.

---

Outside diff comments:
In `@agents-core/vision_agents/core/agents/agent_launcher.py`:
- Around line 306-328: The _finalizer currently fire-and-forgets the coroutine
created by
asyncio.get_running_loop().create_task(self._registry.remove(session_id_)) which
can produce unhandled-task exceptions; wrap that call so exceptions are
observed—either create a small async wrapper that awaits
self._registry.remove(session_id_) inside try/except and logs errors, or attach
a done callback to the created Task to catch and log exceptions; update the code
in _finalizer (referencing _finalizer, self._registry.remove, and the
create_task call) to ensure any exception from registry.remove is caught and
logged rather than left unhandled.
- Around line 186-190: Replace the broad "except Exception as exc" around "await
result" with a targeted handler for cancellation-related errors (e.g., "except
asyncio.CancelledError") and use logger.exception() to preserve the traceback;
if you truly need to catch everything (rare), use "except BaseException"
intentionally and document why. Update the block that currently calls
logger.error(...) (referencing "result" and the surrounding cancel_and_wait
logic) to call logger.exception(...) in the cancellation-specific except handler
so tracebacks are retained.

---

Duplicate comments:
In `@agents-core/vision_agents/core/agents/session_registry/in_memory_storage.py`:
- Around line 122-124: The finalizer in the subscribe flow can raise KeyError
when self._subscribers has been cleared; update the removal to first fetch the
subscriber collection with self._subscribers.get(channel) (or check "if channel
in self._subscribers") and only remove the queue if present, then delete the
channel key if the collection is empty, or wrap the removal in a try/except
KeyError to safely ignore races; target the finalizer inside the subscribe logic
that references self._subscribers[channel].remove(queue) and the subsequent
deletion so it no longer assumes the channel or queue always exists.

In `@agents-core/vision_agents/core/agents/session_registry/redis_storage.py`:
- Around line 75-80: In mset, the pipeline command calls are being awaited
incorrectly; inside async def mset (function mset) remove the await before
pipe.set within the for loop so pipeline commands are queued (use
pipe.set(self._prefixed(key), value, px=int(ttl * 1000)) without await), and
keep only a single await on pipe.execute() to run the pipeline; reference the
async with self._redis.pipeline() as pipe block and ensure _prefixed(key) is
still used as the key.

In `@agents-core/vision_agents/core/agents/session_registry/storage.py`:
- Around line 45-46: The asynchronous context manager method __aexit__ currently
collapses exception args into *exc; change its signature to the standard async
def __aexit__(self, exc_type, exc_val, exc_tb) -> None in the SessionRegistry
storage class so the interpreter provides the three exception parameters
correctly, and keep the body to await self.close() (returning None implicitly).
- Around line 154-155: The abstract method subscribe in the SessionKVStore ABC
is declared with a regular def but the concrete implementations
InMemorySessionKVStore.subscribe and RedisSessionKVStore.subscribe are async
generators; change the ABC declaration to "async def subscribe(self, channel:
str) -> AsyncIterator[bytes]" and keep the `@abc.abstractmethod` decorator so the
signature matches the concrete async implementations; update any type imports if
needed to use typing.AsyncIterator.

In `@agents-core/vision_agents/core/runner/http/api.py`:
- Around line 110-117: Handler code calls launcher.get_session_info(session_id)
and only checks for existence; change each handler (the close/read/metrics HTTP
handlers that call launcher.get_session_info and then
launcher.request_close_session / launcher.request_read_session /
launcher.request_session_metrics) to also verify that the returned session info
belongs to the provided path call_id (e.g., compare info.call_id or
info["call_id"] to the request path call_id) and raise
HTTPException(status_code=404, detail="Session with id '...' not found") when it
does not match; do this check before calling
launcher.request_close_session/request_read_session/request_session_metrics so
cross-call session access is prevented.

In `@tests/test_agents/test_session_registry/test_in_memory_store.py`:
- Around line 32-35: Increase the sleep margins in the TTL-related tests to
avoid flakiness: for tests that call InMemorySessionKVStore.set(..., ttl=...),
e.g., test_ttl_expiry (and the other cases around lines 56-61 and 66-72),
multiply the sleep duration to at least 2–3× the TTL (for example change
asyncio.sleep(0.7) after ttl=0.5 to asyncio.sleep(1.1–1.5) or equivalent) so
assertions using store.get("...") reliably observe expiry; update the sleeps
wherever ttl-based expiry is asserted to use the higher ratio.

In `@tests/test_agents/test_session_registry/test_redis_store.py`:
- Line 44: The Redis-dependent test class TestRedisSessionKVStore should be
marked as an integration test by adding the pytest.mark.integration decorator
above the class definition; locate class TestRedisSessionKVStore and prepend
`@pytest.mark.integration` (and do the same for the other Redis-dependent test
class in this file) so these suites are excluded from unit-only jobs.

---

Nitpick comments:
In `@agents-core/vision_agents/core/__init__.py`:
- Around line 25-29: The except block currently swallows ImportError when
importing RedisSessionKVStore; update the import in __init__.py to catch
ImportError as e and call the module logger (e.g., logger.debug or
logger.warning) to emit the exception details (including e or exc_info) while
still allowing fallback behavior, referencing the symbol RedisSessionKVStore and
the import line from vision_agents.core.agents.session_registry to locate the
change.

In `@agents-core/vision_agents/core/agents/session_registry/redis_storage.py`:
- Around line 54-64: In RedisSessionKVStore.start(), remove the unnecessary
inspect.iscoroutine check and simply await the coroutine returned by
self._redis.ping() (i.e., replace the ping = ... / inspect.iscoroutine branch
with await self._redis.ping()); also remove the unused inspect import if it
isn’t used elsewhere so the module no longer contains dead code.

In `@tests/test_agents/test_runner.py`:
- Around line 405-406: Add assertions that check the HTTP response status when
calling the delete endpoint so the tests fail on endpoint regressions;
specifically, capture the result of the call to
client.delete(f"/calls/my-call-456/sessions/{session_id}") (and the two other
similar calls at the other locations) into a variable like response and assert
response.status_code == 200 before asserting on received_call_ids to ensure both
callback invocation and endpoint status are validated.

ℹ️ Review info

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Disabled knowledge base sources:

Linear integration is disabled

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between b387971 and 221c226.

⛔ Files ignored due to path filters (1)

uv.lock is excluded by !**/*.lock

📒 Files selected for processing (22)

agents-core/pyproject.toml
agents-core/vision_agents/core/__init__.py
agents-core/vision_agents/core/agents/__init__.py
agents-core/vision_agents/core/agents/agent_launcher.py
agents-core/vision_agents/core/agents/session_registry/__init__.py
agents-core/vision_agents/core/agents/session_registry/in_memory_storage.py
agents-core/vision_agents/core/agents/session_registry/redis_storage.py
agents-core/vision_agents/core/agents/session_registry/registry.py
agents-core/vision_agents/core/agents/session_registry/storage.py
agents-core/vision_agents/core/agents/session_registry/types.py
agents-core/vision_agents/core/runner/http/api.py
agents-core/vision_agents/core/runner/http/dependencies.py
agents-core/vision_agents/core/runner/http/models.py
agents-core/vision_agents/core/runner/http/options.py
agents-core/vision_agents/core/runner/runner.py
examples/08_agent_server_example/README.md
pyproject.toml
tests/test_agents/test_agent_launcher.py
tests/test_agents/test_runner.py
tests/test_agents/test_session_registry/__init__.py
tests/test_agents/test_session_registry/test_in_memory_store.py
tests/test_agents/test_session_registry/test_redis_store.py

💤 Files with no reviewable changes (2)

agents-core/vision_agents/core/runner/runner.py
agents-core/vision_agents/core/runner/http/models.py

🚧 Files skipped from review as they are similar to previous changes (5)

agents-core/vision_agents/core/agents/session_registry/init.py
agents-core/vision_agents/core/runner/http/options.py
agents-core/vision_agents/core/agents/session_registry/types.py
agents-core/pyproject.toml
agents-core/vision_agents/core/agents/init.py

agents-core/vision_agents/core/agents/agent_launcher.py

agents-core/vision_agents/core/agents/session_registry/redis_store.py

agents-core/vision_agents/core/agents/session_registry/registry.py

tests/test_agents/test_agent_launcher.py

cursor · 2026-02-26T10:57:52Z

You have run out of free Bugbot PR reviews for this billing cycle. This will reset on March 7.

To receive reviews on all of your PRs, visit the Cursor dashboard to activate Pro and start your 14-day free trial.

coderabbitai

Actionable comments posted: 3

♻️ Duplicate comments (5)

tests/test_agents/test_agent_launcher.py (1)

71-72: ⚠️ Potential issue | 🟡 Minor

Mark the Redis fixture variant as integration-only.

The "redis" parameter requires external infrastructure and should be marked pytest.mark.integration to avoid forcing it in default test runs.

💡 Proposed fix

-@pytest.fixture(params=["memory", "redis"])
+@pytest.fixture(
+    params=[
+        "memory",
+        pytest.param("redis", marks=pytest.mark.integration),
+    ]
+)
 async def storage(request, redis_url) -> AsyncIterator[SessionKVStore]:

As per coding guidelines, "Use `@pytest.mark.integration` to mark integration tests".

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tests/test_agents/test_agent_launcher.py` around lines 71 - 72, The
parametrized storage fixture currently includes "redis" unconditionally; change
the params list to mark the Redis variant as integration-only by replacing the
plain "redis" entry with pytest.param("redis", marks=pytest.mark.integration) so
only the Redis case is selected for integration runs; ensure pytest is imported
and leave the fixture signature async def storage(request, redis_url) ->
AsyncIterator[SessionKVStore] unchanged.

tests/test_agents/test_session_registry/test_in_memory_store.py (1)

32-35: ⚠️ Potential issue | 🟡 Minor

TTL expiry assertion window is a bit tight for CI jitter.

This test can intermittently fail under load. Consider widening the sleep-to-TTL margin to make expiry checks deterministic.

💡 Proposed fix

     async def test_ttl_expiry(self, store: InMemorySessionKVStore) -> None:
-        await store.set("ephemeral", b"bye", ttl=0.5)
-        await asyncio.sleep(0.7)
+        await store.set("ephemeral", b"bye", ttl=0.2)
+        await asyncio.sleep(0.8)
         assert await store.get("ephemeral") is None

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tests/test_agents/test_session_registry/test_in_memory_store.py` around lines
32 - 35, The TTL expiry assertion window in test_ttl_expiry is too tight; update
the test to give more buffer by increasing the sleep duration after calling
store.set("ephemeral", b"bye", ttl=0.5) (for example change the sleep from 0.7s
to ~1.0s or increase the TTL+margin) so that awaiting store.get("ephemeral")
reliably returns None; touch the test_ttl_expiry function in
tests/test_agents/test_session_registry/test_in_memory_store.py and only adjust
the asyncio.sleep timing (or TTL+sleep pair) to widen the margin.

agents-core/vision_agents/core/agents/session_registry/redis_store.py (1)

81-88: ⚠️ Potential issue | 🔴 Critical

Do not await queued pipeline commands in expire().

await pipe.pexpire(...) is incorrect for redis-py pipeline queuing. Queue the command, then await only pipe.execute().

💡 Proposed fix

     async def expire(self, *keys: str, ttl: float) -> None:
         """Refresh TTL on one or more keys via a transactional PEXPIRE pipeline."""
         if not keys:
             return
         async with self._redis.pipeline() as pipe:
             for key in keys:
-                await pipe.pexpire(self._prefixed(key), int(ttl * 1000))
+                pipe.pexpire(self._prefixed(key), int(ttl * 1000))
             await pipe.execute()

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/agents/session_registry/redis_store.py` around
lines 81 - 88, In expire(), stop awaiting queued pipeline commands: when
iterating keys call pipe.pexpire(self._prefixed(key), int(ttl * 1000)) without
await to queue the commands, then await pipe.execute() once after the loop;
update the expire method (symbol: expire and usage of self._redis.pipeline(),
pipe.pexpire, pipe.execute) so only pipe.execute() is awaited.

tests/test_agents/test_session_registry/test_redis_store.py (1)

44-44: ⚠️ Potential issue | 🟡 Minor

Mark Redis-backed test classes as integration tests.

Both classes require containerized Redis and should be explicitly marked for integration test runs.
💡 Proposed fix
+@pytest.mark.integration
 class TestRedisSessionKVStore:
@@
+@pytest.mark.integration
 class TestSessionRegistryWithRedis:
As per coding guidelines, "Use `@pytest.mark.integration` to mark integration tests".
Also applies to: 125-125
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@tests/test_agents/test_session_registry/test_redis_store.py` at line 44, Mark
the Redis-backed test classes as integration tests by decorating the class
TestRedisSessionKVStore (and the other Redis-backed test class in this file)
with pytest's integration marker: add `@pytest.mark.integration` above each class
or set pytestmark = pytest.mark.integration at module/class scope, and ensure
pytest is imported (import pytest) so the marker resolves.

agents-core/vision_agents/core/agents/session_registry/registry.py (1)

123-139: ⚠️ Potential issue | 🟠 Major

Harden SessionInfo deserialization for rolling-deploy compatibility.

SessionInfo(**json.loads(raw)) will fail on unknown fields from newer writers. Filter payload keys before constructing SessionInfo.

💡 Proposed fix

-import json
+import json
 import logging
 import time
-from dataclasses import asdict
+from dataclasses import asdict, fields
@@
 logger = logging.getLogger(__name__)
+_SESSION_INFO_FIELDS = {f.name for f in fields(SessionInfo)}
@@
     async def get(self, session_id: str) -> SessionInfo | None:
         """Look up a session by ID from shared storage."""
         raw = await self._store.get(f"sessions/{session_id}")
         if raw is None:
             return None
-        return SessionInfo(**json.loads(raw))
+        data = json.loads(raw)
+        return SessionInfo(**{k: v for k, v in data.items() if k in _SESSION_INFO_FIELDS})
@@
     async def get_for_call(self, call_id: str) -> list[SessionInfo]:
@@
         session_ids = [k.rsplit("/", 1)[-1] for k in index_keys]
         session_keys = [f"sessions/{sid}" for sid in session_ids]
         values = await self._store.mget(session_keys)
-        return [SessionInfo(**json.loads(raw)) for raw in values if raw is not None]
+        result: list[SessionInfo] = []
+        for raw in values:
+            if raw is None:
+                continue
+            data = json.loads(raw)
+            result.append(
+                SessionInfo(**{k: v for k, v in data.items() if k in _SESSION_INFO_FIELDS})
+            )
+        return result

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/agents/session_registry/registry.py` around
lines 123 - 139, The deserialization calls in get and get_for_call currently
pass raw JSON directly to SessionInfo which will raise on unknown fields;
instead, json.loads(raw) should be filtered to only the allowed SessionInfo
fields before constructing the model. For both get (inside
SessionInfo(**json.loads(raw))) and get_for_call (inside the list comprehension
over values), parse raw into a dict, compute the allowed keys from SessionInfo
(e.g., SessionInfo.__fields__.keys() for a pydantic model or dataclasses.fields
for a dataclass), create a filtered dict = {k: v for k, v in parsed.items() if k
in allowed_keys}, and then call SessionInfo(**filtered); also skip/ignore
entries where raw is None or parsing fails.

🧹 Nitpick comments (2)

agents-core/vision_agents/core/agents/session_registry/in_memory_store.py (1)

32-35: Make start() idempotent to avoid duplicate cleanup tasks.

Calling start() twice currently overwrites _cleanup_task while leaving the old task running.

💡 Proposed fix

     async def start(self) -> None:
         """Start the background cleanup task."""
+        if self._cleanup_task is not None and not self._cleanup_task.done():
+            return
         self._cleanup_task = asyncio.create_task(self._cleanup_loop())

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/agents/session_registry/in_memory_store.py`
around lines 32 - 35, The start() method currently always assigns a new asyncio
task to _cleanup_task and can spawn duplicates; make start() idempotent by
checking the existing _cleanup_task before creating a new one: if
self._cleanup_task is not None and not self._cleanup_task.done(), simply return;
if it is None or done(), create a fresh task via
asyncio.create_task(self._cleanup_loop()) and assign it to _cleanup_task. Update
start() to use this guard (referencing start(), _cleanup_task, and
_cleanup_loop()) so repeated calls won't leave old cleanup tasks running.

tests/test_agents/test_session_registry/test_redis_store.py (1)

169-173: Prefer fixture-level store injection over registry._store access.

Using the private _store attribute makes this test brittle to internal refactors.

💡 Proposed fix

-    async def test_session_expires_without_refresh(
-        self, registry: SessionRegistry
-    ) -> None:
-        short_registry = SessionRegistry(store=registry._store, ttl=1.0)
+    async def test_session_expires_without_refresh(
+        self, redis_store: RedisSessionKVStore
+    ) -> None:
+        short_registry = SessionRegistry(store=redis_store, ttl=1.0)
         await short_registry.register("sess-expire", "call-e")
         await asyncio.sleep(1.5)
         assert await short_registry.get("sess-expire") is None

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tests/test_agents/test_session_registry/test_redis_store.py` around lines 169
- 173, The test test_session_expires_without_refresh is accessing the private
attribute registry._store to construct a short-lived SessionRegistry, which is
brittle; update the test to use a public/fixture-provided store instead — e.g.,
accept the store fixture (or use a public accessor like registry.store if
available) and pass that into SessionRegistry(store=store, ttl=1.0), then call
short_registry.register("sess-expire", "call-e") as before; ensure you remove
any direct references to registry._store in the test.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@agents-core/vision_agents/core/agents/session_registry/__init__.py`:
- Around line 13-18: The current blanket "except ImportError: pass" hides
legitimate import errors; change it to only ignore the optional Redis dependency
by catching ModuleNotFoundError (or ImportError) and verifying the missing
module is 'redis' before swallowing—otherwise re-raise the exception; update the
import block that references RedisSessionKVStore and __all__ to use this
selective handling so real errors in redis_store.py surface.

In `@agents-core/vision_agents/core/agents/session_registry/in_memory_store.py`:
- Around line 22-31: In the InMemoryStore.__init__ add validation for the
cleanup_interval parameter: if cleanup_interval is not a positive number (<= 0)
raise a ValueError with a descriptive message (e.g. "cleanup_interval must be >
0"); ensure this check runs before assigning to self._cleanup_interval and
before starting any cleanup task so the invalid value cannot create a
pathological cleanup loop.

In `@agents-core/vision_agents/core/agents/session_registry/registry.py`:
- Around line 25-35: In the SessionRegistry.__init__ initializer (registry.py)
add validation for the ttl parameter: if ttl is less than or equal to 0 raise a
ValueError with a clear message (e.g. "ttl must be > 0") before assigning
self._ttl to prevent invalid expiry behavior across backends; keep the existing
defaults and assignment logic for self._store and self._node_id but perform the
ttl check first and raise the domain-appropriate ValueError when invalid.

---

Duplicate comments:
In `@agents-core/vision_agents/core/agents/session_registry/redis_store.py`:
- Around line 81-88: In expire(), stop awaiting queued pipeline commands: when
iterating keys call pipe.pexpire(self._prefixed(key), int(ttl * 1000)) without
await to queue the commands, then await pipe.execute() once after the loop;
update the expire method (symbol: expire and usage of self._redis.pipeline(),
pipe.pexpire, pipe.execute) so only pipe.execute() is awaited.

In `@agents-core/vision_agents/core/agents/session_registry/registry.py`:
- Around line 123-139: The deserialization calls in get and get_for_call
currently pass raw JSON directly to SessionInfo which will raise on unknown
fields; instead, json.loads(raw) should be filtered to only the allowed
SessionInfo fields before constructing the model. For both get (inside
SessionInfo(**json.loads(raw))) and get_for_call (inside the list comprehension
over values), parse raw into a dict, compute the allowed keys from SessionInfo
(e.g., SessionInfo.__fields__.keys() for a pydantic model or dataclasses.fields
for a dataclass), create a filtered dict = {k: v for k, v in parsed.items() if k
in allowed_keys}, and then call SessionInfo(**filtered); also skip/ignore
entries where raw is None or parsing fails.

In `@tests/test_agents/test_agent_launcher.py`:
- Around line 71-72: The parametrized storage fixture currently includes "redis"
unconditionally; change the params list to mark the Redis variant as
integration-only by replacing the plain "redis" entry with pytest.param("redis",
marks=pytest.mark.integration) so only the Redis case is selected for
integration runs; ensure pytest is imported and leave the fixture signature
async def storage(request, redis_url) -> AsyncIterator[SessionKVStore]
unchanged.

In `@tests/test_agents/test_session_registry/test_in_memory_store.py`:
- Around line 32-35: The TTL expiry assertion window in test_ttl_expiry is too
tight; update the test to give more buffer by increasing the sleep duration
after calling store.set("ephemeral", b"bye", ttl=0.5) (for example change the
sleep from 0.7s to ~1.0s or increase the TTL+margin) so that awaiting
store.get("ephemeral") reliably returns None; touch the test_ttl_expiry function
in tests/test_agents/test_session_registry/test_in_memory_store.py and only
adjust the asyncio.sleep timing (or TTL+sleep pair) to widen the margin.

In `@tests/test_agents/test_session_registry/test_redis_store.py`:
- Line 44: Mark the Redis-backed test classes as integration tests by decorating
the class TestRedisSessionKVStore (and the other Redis-backed test class in this
file) with pytest's integration marker: add `@pytest.mark.integration` above each
class or set pytestmark = pytest.mark.integration at module/class scope, and
ensure pytest is imported (import pytest) so the marker resolves.

---

Nitpick comments:
In `@agents-core/vision_agents/core/agents/session_registry/in_memory_store.py`:
- Around line 32-35: The start() method currently always assigns a new asyncio
task to _cleanup_task and can spawn duplicates; make start() idempotent by
checking the existing _cleanup_task before creating a new one: if
self._cleanup_task is not None and not self._cleanup_task.done(), simply return;
if it is None or done(), create a fresh task via
asyncio.create_task(self._cleanup_loop()) and assign it to _cleanup_task. Update
start() to use this guard (referencing start(), _cleanup_task, and
_cleanup_loop()) so repeated calls won't leave old cleanup tasks running.

In `@tests/test_agents/test_session_registry/test_redis_store.py`:
- Around line 169-173: The test test_session_expires_without_refresh is
accessing the private attribute registry._store to construct a short-lived
SessionRegistry, which is brittle; update the test to use a
public/fixture-provided store instead — e.g., accept the store fixture (or use a
public accessor like registry.store if available) and pass that into
SessionRegistry(store=store, ttl=1.0), then call
short_registry.register("sess-expire", "call-e") as before; ensure you remove
any direct references to registry._store in the test.

ℹ️ Review info

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Disabled knowledge base sources:

Linear integration is disabled

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 221c226 and f2cddbb.

📒 Files selected for processing (8)

agents-core/vision_agents/core/agents/session_registry/__init__.py
agents-core/vision_agents/core/agents/session_registry/in_memory_store.py
agents-core/vision_agents/core/agents/session_registry/redis_store.py
agents-core/vision_agents/core/agents/session_registry/registry.py
agents-core/vision_agents/core/agents/session_registry/store.py
tests/test_agents/test_agent_launcher.py
tests/test_agents/test_session_registry/test_in_memory_store.py
tests/test_agents/test_session_registry/test_redis_store.py

agents-core/vision_agents/core/agents/session_registry/__init__.py

agents-core/vision_agents/core/agents/session_registry/in_memory_store.py

agents-core/vision_agents/core/agents/session_registry/registry.py

coderabbitai

Actionable comments posted: 1

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

agents-core/vision_agents/core/runner/http/api.py (1)

75-86: ⚠️ Potential issue | 🟠 Major

Do not return raw exception text in 500 responses.

detail=f"Failed to start agent: {str(e)}" can expose internal runtime details to clients. Keep details in logs and return a generic error message.

🛡️ Proposed fix

-    except Exception as e:
+    except Exception:
         logger.exception("Failed to start agent")
         raise HTTPException(
             status_code=status.HTTP_500_INTERNAL_SERVER_ERROR,
-            detail=f"Failed to start agent: {str(e)}",
-        ) from e
+            detail="Failed to start agent",
+        )

As per coding guidelines, Never write except Exception as e; catch specific exceptions instead.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/runner/http/api.py` around lines 75 - 86, The
handler currently catches all exceptions from launcher.start_session (see
launcher.start_session) and returns the raw exception text in the 500 response;
change this to catch only known specific exceptions you expect (besides
SessionLimitExceeded) and remove the broad "except Exception as e" clause — for
unexpected errors keep logger.exception("Failed to start agent") to record full
details but return a generic HTTPException with
status.HTTP_500_INTERNAL_SERVER_ERROR and a non-sensitive message like "Failed
to start agent" (no str(e)); if you must handle unknown errors, re-raise or wrap
them so clients never see internal exception text while logs retain the full
stack trace.

agents-core/vision_agents/core/agents/agent_launcher.py (1)

175-182: ⚠️ Potential issue | 🟠 Major

Don’t set _running before startup succeeds.

If warmup or registry initialization fails, _running remains True, and subsequent start() calls will fail with RuntimeError even though startup never completed.

🐛 Proposed fix

 async def start(self) -> None:
@@
-    self._running = True
-    await self.warmup()
-    await self._registry.start()
+    await self.warmup()
+    await self._registry.start()
+    self._running = True
     self._maintenance_task = asyncio.create_task(self._maintenance_loop())

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/agents/agent_launcher.py` around lines 175 -
182, The code sets self._running = True before startup finishes, causing future
start() calls to raise RuntimeError if warmup() or self._registry.start() fail;
change AgentLauncher.start so that self._running is only set to True after await
self.warmup() and await self._registry.start() complete successfully and after
self._maintenance_task is created; additionally, ensure that if creating the
maintenance task raises, you do not leave _running True (i.e., set _running
after all startup steps succeed or roll back _running on exception) — see
AgentLauncher.start, warmup, _registry.start, _maintenance_task and _running for
locations to modify.

♻️ Duplicate comments (4)

tests/test_agents/test_session_registry/test_redis_store.py (1)
31-31: ⚠️ Potential issue | 🟠 Major

Mark Redis-dependent test class as integration.

TestRedisSessionKVStore requires Docker/Redis runtime infrastructure. It should be explicitly marked as an integration test.
♻️ Proposed fix
+@pytest.mark.integration
 class TestRedisSessionKVStore:
As per coding guidelines, Use @pytest.mark.integration to mark integration tests.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@tests/test_agents/test_session_registry/test_redis_store.py` at line 31, Mark
the Redis-dependent test class TestRedisSessionKVStore as an integration test by
adding the pytest integration marker: import pytest if missing and place
`@pytest.mark.integration` directly above the TestRedisSessionKVStore class
definition so the test suite treats it as an integration test requiring
Docker/Redis infrastructure.
agents-core/vision_agents/core/agents/session_registry/registry.py (1)
26-35: ⚠️ Potential issue | 🟠 Major

Validate ttl in the constructor.

ttl <= 0 is currently accepted. Rejecting invalid TTLs early prevents backend-specific expiry failures and inconsistent behavior.
🛡️ Proposed fix
 def __init__(
     self,
     store: SessionKVStore | None = None,
     *,
     node_id: str | None = None,
     ttl: float = 30.0,
 ) -> None:
+    if ttl <= 0:
+        raise ValueError("ttl must be > 0")
     self._store = store or InMemorySessionKVStore()
     self._node_id = node_id or str(uuid4())
     self._ttl = ttl
As per coding guidelines, Raise ValueError with a descriptive message for invalid constructor arguments.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/agents/session_registry/registry.py` around
lines 26 - 35, Constructor __init__ accepts non-positive ttl values which can
cause backend expiry issues; add validation in __init__ (class using
SessionKVStore/InMemorySessionKVStore and fields _ttl/_node_id) to check the ttl
argument is > 0 and raise ValueError with a clear message (e.g., "ttl must be >
0") when invalid; perform this check before assigning to self._ttl so invalid
objects are not created.
tests/test_agents/test_agent_launcher.py (1)
71-72: ⚠️ Potential issue | 🟠 Major

Mark the Redis fixture variant as integration-only.

The "redis" fixture branch uses RedisContainer, so it should be integration-scoped to keep default test runs isolated from external infra.
♻️ Proposed fix
-@pytest.fixture(params=["memory", "redis"])
+@pytest.fixture(
+    params=[
+        "memory",
+        pytest.param("redis", marks=pytest.mark.integration),
+    ]
+)
 async def storage(request, redis_url) -> AsyncIterator[SessionKVStore]:
As per coding guidelines, Use @pytest.mark.integration to mark integration tests.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@tests/test_agents/test_agent_launcher.py` around lines 71 - 72, The
parameterized fixture storage currently includes a "redis" branch that relies on
RedisContainer; mark that variant as integration-only by replacing the raw
"redis" string in the params list with a pytest.param entry that applies
pytest.mark.integration (e.g., pytest.param("redis",
marks=pytest.mark.integration)). Update the `@pytest.fixture`(params=[...])
declaration in the storage fixture and ensure pytest is imported so the mark is
available.
agents-core/vision_agents/core/agents/agent_launcher.py (1)
466-480: ⚠️ Potential issue | 🟠 Major

Narrow catch-all exception handlers in maintenance paths.

Both maintenance methods catch Exception, which can hide non-storage logic bugs and make failures hard to detect.

As per coding guidelines, Never write except Exception as e; catch specific exceptions instead.

Also applies to: 497-501
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/agents/agent_launcher.py` around lines 466 -
480, The try/except around building sessions_map and calling
self._registry.get_close_requests is too broad; replace the bare "except
Exception" with narrowly targeted exception handlers for the errors the registry
call can raise (e.g., RegistryError, TimeoutError, ConnectionError or the
specific storage/client exceptions your registry library defines) and log those
with logger.exception including context (session id list) while allowing truly
unexpected exceptions to propagate (or re-raise after logging). Apply the same
change to the similar maintenance block that uses self._registry (the later
try/except around lines 497-501), referencing the same symbols: sessions_map,
self._registry.get_close_requests, flagged, to_close, and logger.exception.
Ensure you import or reference the specific exception classes used by the
registry client and do not swallow other exceptions.

🧹 Nitpick comments (1)

tests/test_agents/test_agent_launcher.py (1)

568-569: await asyncio.sleep(0) is a flaky synchronization point.

A single loop yield is not deterministic for async registry cleanup (especially Redis-backed). Prefer a bounded wait/poll on an explicit condition instead of tick-based timing.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tests/test_agents/test_agent_launcher.py` around lines 568 - 569, Replace the
flaky "await asyncio.sleep(0)" yield with a bounded wait/poll that checks the
explicit cleanup condition (e.g., that registry.remove() has completed or that
the item is no longer present) until success or a short timeout; implement this
by polling the registry state (for example using registry.contains(...),
registry.get(...), or whatever API verifies removal) in a small sleep loop or
wrapping the check with asyncio.wait_for to fail fast if the removal never
occurs, and use clear timeout and assertion on the final condition instead of a
single loop yield.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@agents-core/vision_agents/core/agents/agent_launcher.py`:
- Around line 316-335: The done-callback scheduling removal can run before
register completes, so move the callback registration to after the registry
registration to avoid races: create AgentSession and await
self._registry.register(call_id, session.id) first (ensuring
self._sessions[agent.id] = session is set), then attach
task.add_done_callback(partial(_finalizer, agent.id, call_id)); keep _finalizer/
remove logic unchanged but ensure the register completes before the callback can
schedule remove.

---

Outside diff comments:
In `@agents-core/vision_agents/core/agents/agent_launcher.py`:
- Around line 175-182: The code sets self._running = True before startup
finishes, causing future start() calls to raise RuntimeError if warmup() or
self._registry.start() fail; change AgentLauncher.start so that self._running is
only set to True after await self.warmup() and await self._registry.start()
complete successfully and after self._maintenance_task is created; additionally,
ensure that if creating the maintenance task raises, you do not leave _running
True (i.e., set _running after all startup steps succeed or roll back _running
on exception) — see AgentLauncher.start, warmup, _registry.start,
_maintenance_task and _running for locations to modify.

In `@agents-core/vision_agents/core/runner/http/api.py`:
- Around line 75-86: The handler currently catches all exceptions from
launcher.start_session (see launcher.start_session) and returns the raw
exception text in the 500 response; change this to catch only known specific
exceptions you expect (besides SessionLimitExceeded) and remove the broad
"except Exception as e" clause — for unexpected errors keep
logger.exception("Failed to start agent") to record full details but return a
generic HTTPException with status.HTTP_500_INTERNAL_SERVER_ERROR and a
non-sensitive message like "Failed to start agent" (no str(e)); if you must
handle unknown errors, re-raise or wrap them so clients never see internal
exception text while logs retain the full stack trace.

---

Duplicate comments:
In `@agents-core/vision_agents/core/agents/agent_launcher.py`:
- Around line 466-480: The try/except around building sessions_map and calling
self._registry.get_close_requests is too broad; replace the bare "except
Exception" with narrowly targeted exception handlers for the errors the registry
call can raise (e.g., RegistryError, TimeoutError, ConnectionError or the
specific storage/client exceptions your registry library defines) and log those
with logger.exception including context (session id list) while allowing truly
unexpected exceptions to propagate (or re-raise after logging). Apply the same
change to the similar maintenance block that uses self._registry (the later
try/except around lines 497-501), referencing the same symbols: sessions_map,
self._registry.get_close_requests, flagged, to_close, and logger.exception.
Ensure you import or reference the specific exception classes used by the
registry client and do not swallow other exceptions.

In `@agents-core/vision_agents/core/agents/session_registry/registry.py`:
- Around line 26-35: Constructor __init__ accepts non-positive ttl values which
can cause backend expiry issues; add validation in __init__ (class using
SessionKVStore/InMemorySessionKVStore and fields _ttl/_node_id) to check the ttl
argument is > 0 and raise ValueError with a clear message (e.g., "ttl must be >
0") when invalid; perform this check before assigning to self._ttl so invalid
objects are not created.

In `@tests/test_agents/test_agent_launcher.py`:
- Around line 71-72: The parameterized fixture storage currently includes a
"redis" branch that relies on RedisContainer; mark that variant as
integration-only by replacing the raw "redis" string in the params list with a
pytest.param entry that applies pytest.mark.integration (e.g.,
pytest.param("redis", marks=pytest.mark.integration)). Update the
`@pytest.fixture`(params=[...]) declaration in the storage fixture and ensure
pytest is imported so the mark is available.

In `@tests/test_agents/test_session_registry/test_redis_store.py`:
- Line 31: Mark the Redis-dependent test class TestRedisSessionKVStore as an
integration test by adding the pytest integration marker: import pytest if
missing and place `@pytest.mark.integration` directly above the
TestRedisSessionKVStore class definition so the test suite treats it as an
integration test requiring Docker/Redis infrastructure.

---

Nitpick comments:
In `@tests/test_agents/test_agent_launcher.py`:
- Around line 568-569: Replace the flaky "await asyncio.sleep(0)" yield with a
bounded wait/poll that checks the explicit cleanup condition (e.g., that
registry.remove() has completed or that the item is no longer present) until
success or a short timeout; implement this by polling the registry state (for
example using registry.contains(...), registry.get(...), or whatever API
verifies removal) in a small sleep loop or wrapping the check with
asyncio.wait_for to fail fast if the removal never occurs, and use clear timeout
and assertion on the final condition instead of a single loop yield.

ℹ️ Review info

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Disabled knowledge base sources:

Linear integration is disabled

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between f2cddbb and f8e4994.

📒 Files selected for processing (9)

agents-core/vision_agents/core/agents/agent_launcher.py
agents-core/vision_agents/core/agents/session_registry/redis_store.py
agents-core/vision_agents/core/agents/session_registry/registry.py
agents-core/vision_agents/core/agents/session_registry/store.py
agents-core/vision_agents/core/runner/http/api.py
tests/test_agents/test_agent_launcher.py
tests/test_agents/test_runner.py
tests/test_agents/test_session_registry/test_redis_store.py
tests/test_agents/test_session_registry/test_registry.py

✅ Files skipped from review due to trivial changes (1)

tests/test_agents/test_session_registry/test_registry.py

agents-core/vision_agents/core/agents/agent_launcher.py

coderabbitai

♻️ Duplicate comments (1)

agents-core/vision_agents/core/agents/session_registry/registry.py (1)
26-35: ⚠️ Potential issue | 🟠 Major

Validate ttl in __init__ before assigning state.

Non-positive TTL values can create immediately invalid expiry behavior. Reject them early with a descriptive ValueError.
As per coding guidelines, "Raise ValueError with a descriptive message for invalid constructor arguments; prefer custom domain exceptions over generic ones".
💡 Proposed fix
     def __init__(
         self,
         store: SessionKVStore | None = None,
         *,
         node_id: str | None = None,
         ttl: float = 30.0,
     ) -> None:
+        if ttl <= 0:
+            raise ValueError("ttl must be > 0")
         self._store = store or InMemorySessionKVStore()
         self._node_id = node_id or str(uuid4())
         self._ttl = ttl
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@agents-core/vision_agents/core/agents/session_registry/registry.py` around
lines 26 - 35, Validate the ttl argument in the __init__ constructor before
assigning instance state: check that ttl is a positive number (e.g., ttl > 0)
and if not raise a ValueError with a descriptive message like "ttl must be
positive, got {ttl}". Update the __init__ of the SessionRegistry (the method
setting self._ttl) to perform this validation prior to setting self._ttl (and
before any other state depends on it) so non-positive TTLs are rejected early.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Duplicate comments:
In `@agents-core/vision_agents/core/agents/session_registry/registry.py`:
- Around line 26-35: Validate the ttl argument in the __init__ constructor
before assigning instance state: check that ttl is a positive number (e.g., ttl
> 0) and if not raise a ValueError with a descriptive message like "ttl must be
positive, got {ttl}". Update the __init__ of the SessionRegistry (the method
setting self._ttl) to perform this validation prior to setting self._ttl (and
before any other state depends on it) so non-positive TTLs are rejected early.

ℹ️ Review info

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Disabled knowledge base sources:

Linear integration is disabled

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between f8e4994 and 028a5ad.

📒 Files selected for processing (4)

agents-core/vision_agents/core/agents/session_registry/registry.py
agents-core/vision_agents/core/agents/session_registry/types.py
tests/test_agents/test_session_registry/test_in_memory_store.py
tests/test_agents/test_session_registry/test_redis_store.py

🚧 Files skipped from review as they are similar to previous changes (1)

tests/test_agents/test_session_registry/test_redis_store.py

examples/08_agent_server_example/README.md

agents-core/vision_agents/core/agents/agent_launcher.py

agents-core/vision_agents/core/agents/session_registry/registry.py

agents-core/vision_agents/core/runner/http/api.py

agents-core/vision_agents/core/agents/agent_launcher.py

agents-core/vision_agents/core/agents/session_registry/registry.py

agents-core/vision_agents/core/runner/http/api.py

agents-core/vision_agents/core/agents/session_registry/in_memory_store.py

Comments were addressed

dangusev requested a review from aliev February 25, 2026 14:54

github-actions bot added dependencies agents-core examples config docs project-info labels Feb 25, 2026

coderabbitai bot reviewed Feb 25, 2026

View reviewed changes

dangusev force-pushed the feature/agent-coordinator branch from b387971 to 221c226 Compare February 25, 2026 15:09

coderabbitai bot reviewed Feb 25, 2026

View reviewed changes

dangusev added 4 commits February 25, 2026 23:43

Rename storage -> store

c5e49d9

Fix pipe.set() call

58b689c

Cleanup method signature

4a285cf

Remove publish() and subscribe() from SessionKVStore

f2cddbb

Fix logger.error call

ee8685f

coderabbitai bot reviewed Feb 26, 2026

View reviewed changes

dangusev added 5 commits February 26, 2026 12:22

Implement async context manager for SessionRegistry

36b9956

Parametrize SessionRegistry tests

6a68217

Update log message

c6073e4

Fix pipe.pexpire await

9bfc081

Require call_id to access session data on the shared storage

f8e4994

coderabbitai bot reviewed Feb 26, 2026

View reviewed changes

agents-core/vision_agents/core/agents/agent_launcher.py Show resolved Hide resolved

dangusev added 3 commits February 26, 2026 14:38

Increase sleep timeouts in expiry tests

13c2823

Ignore extra keys when reading SessionInfo from the storage

028a5ad

Validate ttl and cleanup interval

2f89eeb

coderabbitai bot reviewed Feb 26, 2026

View reviewed changes

More tests

17029da

dangusev added 2 commits February 26, 2026 14:54

Fix possible race condition on session start

813b8fe

Improve handling of the missing "redis" module

0592a7d

aliev previously requested changes Feb 27, 2026

View reviewed changes

dangusev added 10 commits February 27, 2026 17:43

Fix wrapping exceptions into str()

b318078

InMemorySessionKVStore: make start() idempotent

b86abd2

InMemorySessionKVStore: fix typo

eb1db6b

REST API: extract close_session into a helper func

98a8f13

REST API: use 503 for /ready error code

5f9116a

AgentLauncher: log on start_session failure

b971637

AgentLauncher: validate call ids on start_session

8a8b7d0

SessionRegistry: update metrics only if the session is not yet expired

39818b8

AgentLauncher: keep references for the session cleanup tasks

bde4d76

Fix tests

8b4a343

Nash0x7E2 approved these changes Feb 27, 2026

View reviewed changes

dangusev merged commit 10fdeb0 into main Feb 27, 2026
10 checks passed

dangusev deleted the feature/agent-coordinator branch February 27, 2026 20:12

Conversation

dangusev commented Feb 25, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Feb 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot commented Feb 26, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dangusev commented Feb 25, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Feb 25, 2026 •

edited

Loading