Add limits to AgentLauncher by dangusev · Pull Request #302 · GetStream/Vision-Agents

dangusev · 2026-01-21T13:30:51Z

What's changed

Added limits to AgentLauncher to control agents' behavior over the process lifetime:
- max_concurrent_sessions
- max_sessions_per_call
- max_session_duration_seconds
Passed these limits to REST API too
Wrapped AgentLauncher.start_session() into a lock to accurately check the limits

Summary by CodeRabbit

New Features
- Adds explicit session abstraction, configurable global/per-call concurrency limits, optional max session duration, and 429 responses when limits are exceeded.
- Launcher supports async context management to auto start/stop.
Refactor
- Moves to session-centric tracking and unified cleanup/timeout configuration for more reliable lifecycle management.
Bug Fixes
- More robust shutdown, cancellation and cleanup to ensure sessions are finalized.
Tests
- Expanded coverage for session lifecycle, limits, timeouts, and edge cases.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

…sions limits

coderabbitai · 2026-01-21T13:31:28Z

📝 Walkthrough

Walkthrough

Refactors AgentLauncher to session-based management: adds AgentSession dataclass and explicit _sessions/_calls tracking, introduces concurrency controls (max_concurrent_sessions, max_sessions_per_call, max_session_duration_seconds, cleanup_interval), new session-limit exceptions, updated lifecycle/finalization, async context support, and corresponding tests. (38 words)

Changes

Cohort / File(s)	Summary
Exception Definitions `agents-core/vision_agents/core/agents/exceptions.py`	New session-related exceptions: `SessionLimitExceeded` (base), `MaxConcurrentSessionsExceeded`, `MaxSessionsPerCallExceeded`.
Agent launcher & session model `agents-core/vision_agents/core/agents/agent_launcher.py`	Adds `AgentSession` dataclass; replaces weakref `_active_agents` with explicit `_sessions` and `_calls`; new init params `max_concurrent_sessions`, `max_sessions_per_call`, `max_session_duration_seconds`, `cleanup_interval` with validation; enforces limits in `start_session` (raises new exceptions); attaches finalizer to session tasks; updates `close_session`, `_cleanup_idle_sessions`, `start`/`stop`, and adds `__aenter__/__aexit__`.
Agent API `agents-core/vision_agents/core/agents/agents.py`	Adds `Agent.on_call_for()` to report seconds since join (0.0 if not joined).
Runner cancel handling `agents-core/vision_agents/core/runner/runner.py`	Catches `asyncio.CancelledError` in `_run()` to log and prevent propagation during shutdown.
HTTP runner surface `agents-core/vision_agents/core/runner/http/api.py`	Start-session endpoint documents 429 responses and maps `SessionLimitExceeded` to HTTP 429.
Tests (session-focused) `tests/test_agents/test_agent_launcher.py`, `tests/test_agents/test_runner.py`	Tests updated to use `start_session` / `AgentSession`; validate lifecycle, cleanup after finish/cancel, concurrency limits (including None as unlimited), limit enforcement and recovery, cleanup_interval semantics, and context-managed launcher usage.

Sequence Diagram(s)

sequenceDiagram
    autonumber
    participant Client as Client
    participant Launcher as AgentLauncher
    participant Agent as Agent
    participant Cleaner as CleanupTask

    Client->>Launcher: start_session(call_id, ...)
    Launcher->>Launcher: validate limits (max_concurrent, per-call)
    Launcher->>Agent: launch agent task (async)
    Launcher->>Launcher: create & store AgentSession in _sessions/_calls
    Agent->>Agent: run agent logic (task)
    loop periodic
        Cleaner->>Launcher: _cleanup_idle_sessions scan
        alt session idle or expired
            Cleaner->>Agent: request close/cancel
        end
    end
    Agent->>Launcher: task completes/cancelled
    Launcher->>Launcher: finalizer removes session from _sessions/_calls

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~50 minutes

Possibly related PRs

Agent HTTP server #284 — Overlapping AgentLauncher session-management additions and HTTP wiring.
Close idle agents after some timeout & Agent clean up #279 — Related refactor of agent lifecycle and idle-cleanup logic into session semantics.
[AI-269] Eager turn taking #169 — Changes to Agent class (lifecycle/turn handling) that may intersect with new on_call_for additions.

Suggested reviewers

Nash0x7E2

Poem

I counted rooms where nothing answered,
The sessions hummed like trapped small birds;
A partial hand stitched their thin exits,
And one by one the lamps went black—then words.

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 27.38% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately summarizes the main change: introducing concurrency and session limits to AgentLauncher through new parameters like max_concurrent_sessions and max_sessions_per_call.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

tests/test_agents/test_agent_launcher.py (1)

41-45: Replace unittest.mock usage with fakes or test-specific subclasses.

Project guidelines forbid mocks in tests. Replace MagicMock at lines 41-45 and three instances of patch.object at lines 103, 133, and 164 with lightweight fakes: a simple DummyEdge class for the fixture, and an IdleAgent subclass that overrides idle_for(). This follows the existing pattern in the file (DummyTTS, DummyLLM).

🧩 Example pattern (apply similarly across tests)

-from unittest.mock import MagicMock, patch
+from dataclasses import dataclass
...
+@dataclass
+class DummyEdge:
+    events: EventManager
...
 `@pytest.fixture`()
 async def stream_edge_mock() -> MagicMock:
-    mock = MagicMock()
-    mock.events = EventManager()
-    return mock
+    return DummyEdge(events=EventManager())
...
+class IdleAgent(Agent):
+    def idle_for(self) -> float:
+        return 10.0
+
 class TestAgentLauncher:
     async def test_idle_sessions_stopped(self, stream_edge_mock):
         ...
         async def create_agent(**kwargs) -> Agent:
-            return Agent(
+            return IdleAgent(
                 llm=llm,
                 tts=tts,
                 edge=stream_edge_mock,
                 agent_user=User(name="test"),
             )
-        with patch.object(Agent, "idle_for", return_value=10):
-            async with launcher:
-                session1 = await launcher.start_session(call_id="1")
-                session2 = await launcher.start_session(call_id="2")
-                await asyncio.sleep(2)
+        async with launcher:
+            session1 = await launcher.start_session(call_id="1")
+            session2 = await launcher.start_session(call_id="2")
+            await asyncio.sleep(2)

agents-core/vision_agents/core/agents/agent_launcher.py (1)

62-94: max_session_duration_seconds is validated but never enforced.

The parameter is accepted and validated in __init__ (lines 88–93) but never applied. No code uses self._max_session_duration_seconds after assignment—the task created in start_session (lines 205–206) has no timeout wrapper. Implement the timeout enforcement (e.g., wrap join_call with asyncio.wait_for) or remove the parameter to prevent misleading API behavior.
✅ Example enforcement at task creation
-        task = asyncio.create_task(
-            self._join_call(agent, call_type, call_id), name=f"agent-{agent.id}"
-        )
+        join_coro = self._join_call(agent, call_type, call_id)
+        if self._max_session_duration_seconds is not None:
+            join_coro = asyncio.wait_for(
+                join_coro, timeout=self._max_session_duration_seconds
+            )
+        task = asyncio.create_task(join_coro, name=f"agent-{agent.id}")

🤖 Fix all issues with AI agents

In `@agents-core/vision_agents/core/agents/agent_launcher.py`:
- Around line 187-199: The session limit checks in start_session are race-prone
because they use == and perform checks before awaiting; change the comparisons
to >= and add explicit None guards for call lookup (use self._calls.get(call_id)
is not None or check call_set = self._calls.get(call_id, set()) and treat None
safely), and make the check-and-reserve sequence atomic by protecting it with an
async primitive (e.g., an asyncio.Lock or an asyncio.Semaphore) on the
AgentLauncher instance; specifically update the block referencing
self._sessions, sessions_total, self._max_concurrent_sessions, self._calls,
call_sessions_total and self._max_sessions_per_call so the code acquires the
lock, re-evaluates >= limits, then registers the new session before releasing
the lock to prevent concurrent over-allocation.

🧹 Nitpick comments (1)

agents-core/vision_agents/core/agents/agent_launcher.py (1)

212-218: Remove empty call entries from _calls to prevent growth.

After discarding the last session for a call, the empty set remains. Consider pruning the key when the set becomes empty.

♻️ Suggested cleanup

         def _finalizer(session_id_: str, call_id_: str, *_):
             session_ = self._sessions.pop(session_id_, None)
             if session_ is not None:
                 call_sessions = self._calls.get(call_id_, set())
                 if call_sessions:
                     call_sessions.discard(session_id_)
+                    if not call_sessions:
+                        self._calls.pop(call_id_, None)
...
         call_sessions = self._calls.get(session.call_id)
         if call_sessions:
             call_sessions.discard(session.id)
+            if not call_sessions:
+                self._calls.pop(session.call_id, None)

Also applies to: 228-229, 249-251

agents-core/vision_agents/core/agents/agent_launcher.py

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@agents-core/vision_agents/core/agents/agent_launcher.py`:
- Around line 397-413: The cleanup logic is closing sessions incorrectly because
it unconditionally compares idle_for and on_call_for against potentially
disabled timeout values; update the checks in the loop that calls
agent.on_call_for() and agent.idle_for() to first guard that the timeouts are
enabled: only perform the idle check if self._agent_idle_timeout > 0 (e.g., if
self._agent_idle_timeout > 0 and idle_for >= self._agent_idle_timeout) and only
perform the max-duration check if self._max_session_duration_seconds is not None
(e.g., if self._max_session_duration_seconds is not None and on_call_for >=
self._max_session_duration_seconds), preserving the existing logger.info
messages and adding the agent to to_close when those guarded conditions are met.

♻️ Duplicate comments (1)

agents-core/vision_agents/core/agents/agent_launcher.py (1)

264-276: Session limit checks remain race-prone.

This concern was raised in a previous review. The == comparisons occur before await, allowing concurrent calls to both pass before either registers. This is a known issue pending resolution.

🧹 Nitpick comments (2)

agents-core/vision_agents/core/agents/agent_launcher.py (1)
286-296: Minor: Comment claims GC-based finalization but only done callback is used.

The comment mentions "when the AgentSession is garbage-collected" but only add_done_callback is registered. The done callback suffices for cleanup, but the comment is misleading.
Suggested comment fix
         # Remove the session when the task is done
-        # or when the AgentSession is garbage-collected
-        # in case the done callback wasn't fired
+        # The done callback ensures cleanup when the task completes
         def _finalizer(session_id_: str, call_id_: str, *_):
tests/test_agents/test_agent_launcher.py (1)

3-3: Mocking usage violates coding guidelines.

The coding guidelines state "Never mock in tests" but unittest.mock is imported and used extensively (e.g., patch.object on Agent.idle_for and Agent.on_call_for). Based on coding guidelines, consider alternative approaches like injectable time providers or test-specific subclasses.

agents-core/vision_agents/core/agents/agent_launcher.py

coderabbitai

Actionable comments posted: 1

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

agents-core/vision_agents/core/agents/agent_launcher.py (1)
176-182: Broad exception handling during shutdown.

The except Exception pattern violates the coding guideline requiring specific exception handling. Consider catching more specific exceptions (e.g., asyncio.CancelledError, RuntimeError) or at minimum document why broad catching is intentional here.
Suggested fix
             try:
                 await result
-            except Exception as exc:
+            except (asyncio.CancelledError, RuntimeError) as exc:
                 logger.error(f"Failed to cancel the agent task: {exc}")
Based on coding guidelines for **/*.py.

🤖 Fix all issues with AI agents

In `@tests/test_agents/test_agent_launcher.py`:
- Around line 635-665: Remove the patch.object call in
test_sessions_alive_with_max_session_duration_none and instead inject the
behavior via a pytest fixture: provide a fixture (used by the test) that
supplies create_agent which returns an Agent instance or subclass with
on_call_for implemented to return 10 (or use pytest's monkeypatch to set
Agent.on_call_for), so the test uses that fixture's create_agent when
constructing AgentLauncher; do not mock within the test body and do not patch
idle_for since it naturally returns 0.0 in this scenario.

♻️ Duplicate comments (2)

agents-core/vision_agents/core/agents/agent_launcher.py (2)

264-282: Session limit checks remain race-prone.

The comparisons still use == instead of >=, allowing concurrent calls to bypass limits. This was flagged in a previous review and remains unaddressed.

408-415: Idle timeout check still closes sessions when agent_idle_timeout=0.

When agent_idle_timeout=0 (intended to disable idle cleanup per docstring), but max_session_duration_seconds is set, the loop runs and idle_for >= 0 evaluates true for all sessions since idle_for() returns non-negative values. This was flagged in a previous review.

🧹 Nitpick comments (1)

tests/test_agents/test_agent_launcher.py (1)

3-3: Mocking usage violates coding guidelines.

The coding guidelines specify "Never mock in tests" for test files, but this file imports and uses MagicMock and patch extensively. Consider refactoring to use test doubles or configurable behavior in the production classes instead.

Based on coding guidelines for **/*test*.py.

tests/test_agents/test_agent_launcher.py

dangusev added 2 commits January 21, 2026 14:10

AgentLauncher: implement max_sessions_per_call and max_concurrent_ses…

ea64b93

…sions limits

AgentLauncher: update tests

ce889d3

github-actions bot added the agents-core label Jan 21, 2026

Fix ruff

4e94b3e

coderabbitai bot reviewed Jan 21, 2026

View reviewed changes

agents-core/vision_agents/core/agents/agent_launcher.py Outdated Show resolved Hide resolved

dangusev added 2 commits January 21, 2026 18:01

Fix cancellation handling in run mode

fdb5bcf

Add docstrings

0de6b68

coderabbitai bot reviewed Jan 21, 2026

View reviewed changes

agents-core/vision_agents/core/agents/agent_launcher.py Show resolved Hide resolved

Fix tests

567b25d

coderabbitai bot reviewed Jan 21, 2026

View reviewed changes

tests/test_agents/test_agent_launcher.py Show resolved Hide resolved

dangusev added 3 commits January 22, 2026 13:00

Add limits to Runner API

e5ccead

Fix possible limit overflow in AgentLauncher.start_session

b898fa9

Fix idleness check in AgentLauncher

f45964e

Nash0x7E2 approved these changes Jan 22, 2026

View reviewed changes

Nash0x7E2 merged commit dfd68db into main Jan 22, 2026
10 checks passed

Nash0x7E2 deleted the feature/agent-launcher-limits branch January 22, 2026 17:08

coderabbitai bot mentioned this pull request Jan 23, 2026

Fix Agent warnings #310

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add limits to AgentLauncher#302

Add limits to AgentLauncher#302
Nash0x7E2 merged 9 commits intomainfrom
feature/agent-launcher-limits

dangusev commented Jan 21, 2026 •

edited

Loading

Uh oh!

coderabbitai bot commented Jan 21, 2026 •

edited

Loading

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dangusev commented Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What's changed

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

dangusev commented Jan 21, 2026 •

edited

Loading

coderabbitai bot commented Jan 21, 2026 •

edited

Loading