Runner._get_or_create_session loads all events ignoring EventsCompactionConfig, causing linear performance degradation

## Describe the bug

When `EventsCompactionConfig` is configured on an `App`, the compaction mechanism correctly summarizes old events and appends compaction events to the session. However, **`Runner._get_or_create_session` still calls `get_session` without any `GetSessionConfig`**, loading the **full event history** on every invocation.

This means compaction only reduces the LLM context window — it does **nothing** to reduce the session loading overhead. As conversations grow longer, `get_session` latency degrades linearly, even though compaction has already produced summaries that could make the old raw events unnecessary for the Runner.

### The architectural disconnect

```
EventsCompactionConfig (configured on App)
    → Compacts events for LLM context ✅
    → Does NOT reduce get_session load ❌

Runner._get_or_create_session (runners.py L375)
    → Calls get_session() WITHOUT GetSessionConfig
    → Loads ALL events every time
    → Latency grows linearly with conversation length
```

The `GetSessionConfig` class already supports `num_recent_events` and `after_timestamp` filtering, and all session service implementations (`InMemorySessionService`, `DatabaseSessionService`, `VertexAiSessionService`) honor these parameters. But the Runner never uses them.

## To Reproduce

1. Create an App with `EventsCompactionConfig`:
```python
app = App(
    name="my_app",
    root_agent=agent,
    events_compaction_config=EventsCompactionConfig(
        compaction_interval=5,
        overlap_size=1,
        summarizer=LlmEventSummarizer(llm=my_llm),
    ),
)
```

2. Use `DatabaseSessionService` (PostgreSQL) or any persistent session service
3. Have a multi-turn conversation (10+ rounds)
4. Observe that `get_session` becomes progressively slower on each new invocation

### Real-world numbers

In our production setup with a 6-stage sequential pipeline agent (Science Navigator), each conversation turn generates ~480 events (streaming chunks, tool calls, tool responses, text finals). After 10 turns:

- **~4,800 events** accumulated per session
- **`get_session` takes 70+ seconds** (via `HttpSessionService` → PostgreSQL `DatabaseSessionService`)
- Compaction events exist in the session but the Runner still loads all 4,800 raw events

## Expected behavior

When `EventsCompactionConfig` is configured, the Runner should be aware that compacted summaries exist and avoid loading the full event history. Possible approaches:

1. **Runner should pass `GetSessionConfig` to `get_session`** — After compaction has run, the Runner knows which events are summarized. It should only load the compaction event(s) + recent un-compacted events.

2. **`EventsCompactionConfig` should inform session loading** — The compaction config could derive appropriate `GetSessionConfig` parameters (e.g., only load events after the last compaction timestamp).

3. **At minimum, expose `GetSessionConfig` via `RunConfig`** — As proposed in #3562 and PR #3662, allow users to control how many events are loaded. But ideally the framework should do this automatically when compaction is configured.

## Environment

- **ADK version**: 1.26.0
- **Python**: 3.12
- **Session service**: `DatabaseSessionService` (PostgreSQL, accessed via HTTP microservice)
- **Agent type**: Multi-stage `SequentialAgent` with `ParallelAgent` sub-stages

## Related issues

- #3039 — `VertexAiSessionService.get_session` becomes super slow with many events
- #3562 — Allow limiting num. of Session events fetched when calling `Runner.run_async`
- PR #3662 — `feat(runners): Add get_session_config property to RunConfig` (still open)

## Analysis

The root cause is that compaction and session loading are designed as two independent systems:

1. **Compaction** (in `apps/compaction.py`) runs **post-invocation** and appends summary events
2. **Session loading** (in `runners.py`) runs **pre-invocation** and loads everything

There is no feedback loop between them. Even after compaction has run successfully, the next invocation still loads all original events + the compaction event.

PR #3662 would help as a workaround (letting users manually set `num_recent_events`), but the ideal fix is for the Runner to automatically leverage compaction metadata to optimize session loading — making `EventsCompactionConfig` a true end-to-end optimization rather than just an LLM context optimization.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Runner._get_or_create_session loads all events ignoring EventsCompactionConfig, causing linear performance degradation #4673

Describe the bug

The architectural disconnect

To Reproduce

Real-world numbers

Expected behavior

Environment

Related issues

Analysis

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Runner._get_or_create_session loads all events ignoring EventsCompactionConfig, causing linear performance degradation #4673

Description

Describe the bug

The architectural disconnect

To Reproduce

Real-world numbers

Expected behavior

Environment

Related issues

Analysis

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions