perf(agent_server)!: trim conversation skills by default (include_skills=false) by simonrosenberg · Pull Request #3316 · OpenHands/software-agent-sdk

simonrosenberg · 2026-05-20T08:06:12Z

Closes #3301. Companion to the closed PR #3302 (in-model field_serializer); this PR takes a simpler route-boundary approach.

⚠️ BREAKING CHANGE — see Breaking change below.

TL;DR

The conversation read endpoints now strip agent.agent_context.skills from the response by default. Callers that still need the legacy full-payload shape can opt back in with ?include_skills=true.

Persisted ConversationState on disk: untouched.
In-memory AgentContext.skills (what the prompt renderer reads): untouched.
Default HTTP response shape: agent.agent_context.skills == [] (BREAKING).
Legacy opt-in (?include_skills=true): the prior shape, with the full ~260 KB skills array.

Breaking change

GET /conversations* and the deprecated ACP equivalents previously returned the full skills list inline; they now return [] unless the caller passes ?include_skills=true.

Audit of consumers of agent.agent_context.skills from HTTP responses:

agent-canvas — does not read this field (only reads agent.kind and agent.llm.model from ConversationInfo).
OpenHands/OpenHands (app-server) — does not read it; the field is consumed in-process via the agent's runtime, not from HTTP.
SDK examples — use LocalConversation (in-process), not RemoteConversation.

No known caller is affected. Anyone with a custom integration that does read conversation.agent.agent_context.skills from a GET /conversations* response can append ?include_skills=true to preserve the prior shape — documented as a legacy escape hatch, not the steady-state path.

Why default-on (revised from the opt-in revision)

An earlier revision of this PR defaulted to include_skills=true and required callers to opt into the trim. Per maintainer feedback, since no client actually reads the field from HTTP, the opt-in shape was carrying ~260 KB of payload that nothing consumes. Flipping the default is the right cost/benefit: every consumer gets the slimmer payload for free, and the rare legacy caller pays a documented opt-in.

Empirical measurements (honest)

The wire-level reduction is rock-solid. The user-perceived cold-open improvement I claimed in earlier revisions (-42%) does not reproduce under careful measurement — it was a Vite cold-start artifact in the baseline, not the trim's effect.

Wire-level (reproducible, 5 GETs each)

Request	`?include_skills=true` (legacy)	Default (trimmed)	Delta
`GET /api/conversations?ids=…` (40-skill conv)	265 KB, 3.3 ms server	4.7 KB, 2.4 ms server	-98% bytes, -27% server time

Browser cold-cold-cold (3 trials; servers killed, Vite cache wiped between trials)

Trial	Baseline (full payload)	With trim	Per-trial delta
1	5144 ms	4889 ms	-255 ms
2	4956 ms	4990 ms	+34 ms
3	4970 ms	4883 ms	-87 ms
Median	4970 ms	4889 ms	-81 ms (-1.6%)
Mean	5023 ms	4921 ms	-102 ms (-2.0%)

Run-to-run variance is ~200 ms; the trim's effect is ~100 ms. That's at the edge of statistical significance from three trials — directionally consistent (2/3 trials favor the trim) but small enough that it's close to the noise floor.

Order-of-magnitude reconciliation: network transfer (~3 ms saved) + JSON.parse (~7 ms) + React Query structural-share (~7 ms) + server serialization (~1 ms) = ~20 ms expected. Observed ~100 ms — larger than back-of-envelope math, possibly amplified through React reconciliation / GC pressure.

Where the trim actually wins

Network bandwidth on cloud backends (100ms+ RTT) — full 260 KB per fetch is a real cost over the wire; 4.7 KB is not.
Sustained polling load — over a long session, the cumulative bytes drop ~98%.
Server-side CPU — 27% less serialization time per request.
Client memory — 260 KB × N conversations in React Query's cache → ~5 KB × N.

Where it doesn't win

Cold-open wall-clock on a fast localhost dev box. End-users on localhost won't feel a difference. The dev experience is the same.

Why this exists alongside #3302

#3302 attempted the same win via a @field_serializer("skills") on AgentContext. It works, has 20 tests, and addresses every edge case the bot review caught — but the patch grew to 5 commits / ~150 lines / multiple PrivateAttr snapshots / three context flags (preserve_full_skills, loading_from_snapshot, round_trip) / config-drift detection / deep copies / a model_copy-merge dance in ConversationState factory. Each guard fixes a real edge case; the underlying reason there are so many is that the model is trying to keep three different truths in sync (in-memory full, wire trimmed, persisted full).

This PR moves the optimization to where the bytes actually leave the server. Side-by-side:

Concern	#3302 (in-model serializer)	This PR (route default-on)
Backward-compatible public API	Default trim ⇒ `RemoteConversation.agent.agent_context.skills` returns `[]` from a server that has 40 skills	Same breaking semantics; explicit `?include_skills=true` opt-in to restore
Persistence (`_save_base_state`) keeps full snapshot	Opt-out via `context={"preserve_full_skills": True}`	Untouched — persistence path bypasses HTTP entirely
`model_dump(round_trip=True)` round-trips losslessly	Opt-out via `info.round_trip` check	Untouched
`model_copy(update={"skills": merged})` keeps replacement	Equality-based filter via snapshot	Untouched
Snapshot drift (marketplace adds skill `b`)	`loading_from_snapshot` context flag	Untouched
Lines of code	~150 in `agent_context.py` + ~30 in `state.py`	~50 in `models.py` + `conversation_router*.py`
Test count	20	14

What changes

openhands-agent-server/openhands/agent_server/models.py
- INCLUDE_SKILLS_PARAM_TITLE constant (used by both routers for the OpenAPI param docstring).
- trim_conversation_response_skills(info) -> ConversationInfo helper.
- Drive-by: migrates ConversationSortOrder / EventSortOrder to StrEnum to clear pre-existing UP042 lint that the pre-commit hook blocks on.
openhands-agent-server/openhands/agent_server/conversation_router.py — adds include_skills: bool = False query param at search, get_conversation, batch_get_conversations, start_conversation, fork_conversation. Default False (trimmed); True restores the legacy shape. search route handler is non-destructive: builds a fresh ConversationPage via model_copy instead of mutating page.items.
openhands-agent-server/openhands/agent_server/conversation_router_acp.py — same on the deprecated ACP equivalents.
tests/agent_server/test_conversation_router_skill_trim.py — 14 tests covering the helper, default-trim branch, include_skills=true opt-in branch, null-item handling, byte-count comparison.

Test plan

pytest tests/agent_server/test_conversation_router_skill_trim.py — 14 passed.
pytest tests/agent_server/ -q --deselect TestAutoTitle::test_autotitle_sets_title_on_first_user_message — 1038 passed (1 pre-existing failure unrelated).
ruff format + ruff check on the changed files — clean.
End-to-end against agent-canvas: default API response now 4.7 KB / 0 skills (was 265 KB / 40 skills on main); ?include_skills=true restores 265 KB.
Browser cold-cold-cold 3-trial measurement (servers killed + Vite cache wiped between trials): median 4970 → 4889 ms (-81 ms, -1.6%), mean 5023 → 4921 ms (-102 ms, -2.0%), high variance.

🤖 Generated with Claude Code

Agent Server images for this PR

• GHCR package: https://github.com/OpenHands/agent-sdk/pkgs/container/agent-server

Variants & Base Images

Variant	Architectures	Base Image	Docs / Tags
java	amd64, arm64	`eclipse-temurin:17-jdk`	Link
python	amd64, arm64	`nikolaik/python-nodejs:python3.13-nodejs22-slim`	Link
golang	amd64, arm64	`golang:1.21-bookworm`	Link

Pull (multi-arch manifest)

# Each variant is a multi-arch manifest supporting both amd64 and arm64
docker pull ghcr.io/openhands/agent-server:23f0d9f-python

Run

docker run -it --rm \
  -p 8000:8000 \
  --name agent-server-23f0d9f-python \
  ghcr.io/openhands/agent-server:23f0d9f-python

All tags pushed for this build

ghcr.io/openhands/agent-server:23f0d9f-golang-amd64
ghcr.io/openhands/agent-server:23f0d9fc7a2b6433879a4b7cdd4f22a0abb1cec9-golang-amd64
ghcr.io/openhands/agent-server:perf-api-route-skill-trim-golang-amd64
ghcr.io/openhands/agent-server:23f0d9f-golang_tag_1.21-bookworm-amd64
ghcr.io/openhands/agent-server:23f0d9f-golang-arm64
ghcr.io/openhands/agent-server:23f0d9fc7a2b6433879a4b7cdd4f22a0abb1cec9-golang-arm64
ghcr.io/openhands/agent-server:perf-api-route-skill-trim-golang-arm64
ghcr.io/openhands/agent-server:23f0d9f-golang_tag_1.21-bookworm-arm64
ghcr.io/openhands/agent-server:23f0d9f-java-amd64
ghcr.io/openhands/agent-server:23f0d9fc7a2b6433879a4b7cdd4f22a0abb1cec9-java-amd64
ghcr.io/openhands/agent-server:perf-api-route-skill-trim-java-amd64
ghcr.io/openhands/agent-server:23f0d9f-eclipse-temurin_tag_17-jdk-amd64
ghcr.io/openhands/agent-server:23f0d9f-java-arm64
ghcr.io/openhands/agent-server:23f0d9fc7a2b6433879a4b7cdd4f22a0abb1cec9-java-arm64
ghcr.io/openhands/agent-server:perf-api-route-skill-trim-java-arm64
ghcr.io/openhands/agent-server:23f0d9f-eclipse-temurin_tag_17-jdk-arm64
ghcr.io/openhands/agent-server:23f0d9f-python-amd64
ghcr.io/openhands/agent-server:23f0d9fc7a2b6433879a4b7cdd4f22a0abb1cec9-python-amd64
ghcr.io/openhands/agent-server:perf-api-route-skill-trim-python-amd64
ghcr.io/openhands/agent-server:23f0d9f-nikolaik_s_python-nodejs_tag_python3.13-nodejs22-slim-amd64
ghcr.io/openhands/agent-server:23f0d9f-python-arm64
ghcr.io/openhands/agent-server:23f0d9fc7a2b6433879a4b7cdd4f22a0abb1cec9-python-arm64
ghcr.io/openhands/agent-server:perf-api-route-skill-trim-python-arm64
ghcr.io/openhands/agent-server:23f0d9f-nikolaik_s_python-nodejs_tag_python3.13-nodejs22-slim-arm64
ghcr.io/openhands/agent-server:23f0d9f-golang
ghcr.io/openhands/agent-server:23f0d9fc7a2b6433879a4b7cdd4f22a0abb1cec9-golang
ghcr.io/openhands/agent-server:perf-api-route-skill-trim-golang
ghcr.io/openhands/agent-server:23f0d9f-golang_tag_1.21-bookworm
ghcr.io/openhands/agent-server:23f0d9f-java
ghcr.io/openhands/agent-server:23f0d9fc7a2b6433879a4b7cdd4f22a0abb1cec9-java
ghcr.io/openhands/agent-server:perf-api-route-skill-trim-java
ghcr.io/openhands/agent-server:23f0d9f-eclipse-temurin_tag_17-jdk
ghcr.io/openhands/agent-server:23f0d9f-python
ghcr.io/openhands/agent-server:23f0d9fc7a2b6433879a4b7cdd4f22a0abb1cec9-python
ghcr.io/openhands/agent-server:perf-api-route-skill-trim-python
ghcr.io/openhands/agent-server:23f0d9f-nikolaik_s_python-nodejs_tag_python3.13-nodejs22-slim

About Multi-Architecture Support

Each variant tag (e.g., 23f0d9f-python) is a multi-arch manifest supporting both amd64 and arm64
Docker automatically pulls the correct architecture for your platform
Individual architecture tags (e.g., 23f0d9f-python-amd64) are also available if needed

… responses When ``AgentContext`` is constructed with ``load_user_skills=True`` or ``load_public_skills=True``, its model_validator resolves the entire skill catalog (~40 entries in stock setups) and persists them inline. Every ``GET /api/conversations`` / ``/search`` / ``/{id}`` therefore carried ~260 KB of skill content that no API consumer reads — skill bodies are only consumed server-side at prompt-render time, never client-side. Drop ``agent.agent_context.skills`` from the four read endpoints' response payloads (``GET /search``, ``GET /{id}``, ``GET ""``, ``POST ""`` — plus the deprecated ACP equivalents and ``fork``). The persisted ``ConversationState`` on disk and the in-memory copy held by the agent's runtime are untouched. Only the HTTP-response bytes shrink. This is the simpler alternative to PR #3302, which attempted the same goal via a ``@field_serializer`` on ``AgentContext`` itself. See the route-trim PR description for the comparison; the short version is that touching the model has to coordinate across persistence, ``model_copy``, ``round_trip``, equality checks, snapshot drift detection, ``ValidationInfo`` contexts, and a ``state.agent = agent`` overwrite — each of which the reviewer found edge cases in. Trimming at the route boundary avoids all of that: the model stays honest (the wire matches the in-memory shape minus exactly one leaf field), and the optimization lives in one ~10-line helper that's called from 7 route handlers. Measured impact (agent-canvas cold-open of a 40-skill conversation, same browser, same agent-server port, only the SDK swapped): - ``GET /api/conversations?ids=…``: 258 KB → 4.6 KB (-98%) - Cold-open "first event painted": 2226 ms → 1283 ms (-42%) Tests: - ``test_conversation_router_skill_trim.py``: 10 cases covering the helper (no-mutation, identity return on empty, preserve other agent_context fields, agent without agent_context) and FastAPI integration through all four read endpoints, plus a concrete byte-count comparison (trimmed vs ``model_dump_json`` of the same ConversationInfo with skills intact — ~1500+ bytes saved per 5-skill conversation in the test fixture). - 351 tests pass on the affected suites (router tests + sdk/context). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

github-actions · 2026-05-20T08:06:47Z

Python API breakage checks — ✅ PASSED

Result: ✅ PASSED

Behavioral default changes detected

These public Field(default=...) changes were auto-marked with the release-note-required label:

openhands.sdk.llm.llm.LLM.model: 'claude-sonnet-4-20250514' → 'gpt-5.5'

Action log

github-actions · 2026-05-20T08:06:48Z

REST API breakage checks (OpenAPI) — ✅ PASSED

Result: ✅ PASSED

Action log

github-actions · 2026-05-20T08:08:23Z

Coverage Report •

File	Stmts	Miss	Cover	Missing
openhands-agent-server/openhands/agent_server
conversation_router.py	158	10	93%	314, 405–408, 420–423, 458
conversation_router_acp.py	47	2	95%	138, 162
TOTAL	27331	11828	56%

all-hands-bot

✅ QA Report: PASS

Verified the changed API behavior with real HTTP requests: heavy conversation skills are removed from route responses while the server-side conversation object stays intact.

Does this PR achieve its stated goal?

Yes. The PR set out to trim agent.agent_context.skills at the FastAPI route boundary to shrink ConversationInfo HTTP payloads without changing in-memory/persisted conversation state. Against main, the same heavy conversation returned 40 skills and ~178.5 KB per response; against this PR, standard and deprecated ACP endpoints returned skills: [] and ~2.5 KB responses, while the in-memory object still reported 40 skills before and after API calls.

Phase	Result
Environment Setup	✅ `make build` completed successfully; no tests/linters were run.
CI Status	⏳ 20 checks successful, 2 skipped, 8 still in progress; merge state is blocked by pending checks.
Functional Verification	✅ Real FastAPI server + `curl` requests verified standard and ACP read/start/fork route responses.

Functional Verification

Test 1: Standard conversation read endpoints shrink payloads

Step 1 — Establish baseline without the fix:
Checked out origin/main, started a temporary FastAPI app using the actual conversation router, and made real HTTP requests with curl against a stubbed conversation containing 40 skills × 4096-byte bodies.

Observed:

BASE untrimmed: {"skill_count":40,"model_dump_json_bytes":178553,"page_model_dump_json_bytes":178585}
BASE search: bytes=178585; skills_len=40; first_content_len=4105
BASE get: bytes=178553; skills_len=40; first_content_len=4105
BASE batch: bytes=178555; skills_len=40; first_content_len=4105

This confirms the pre-PR API response carried the full skill list/content over HTTP.

Step 2 — Apply the PR's changes:
Checked out commit 7b67420b66286e2173754c9573762ed58257a1b8.

Step 3 — Re-run with the fix in place:
Started the same FastAPI route app and called the same endpoints.

Observed:

PR untrimmed before: {"skill_count":40,"model_dump_json_bytes":178553,"page_model_dump_json_bytes":178585}
PR search: bytes=2506; skills_len=0; suffix='qa-suffix'
PR get: bytes=2474; skills_len=0; suffix='qa-suffix'
PR batch: bytes=2476; skills_len=0; suffix='qa-suffix'
PR untrimmed after: {"skill_count":40,"model_dump_json_bytes":178553,"page_model_dump_json_bytes":178585}

This shows the HTTP payload drops by ~176 KB and skills is empty in responses, while the server-side conversation still has all 40 skills before and after requests.

Test 2: Deprecated ACP and write-returning routes also trim responses

Using the PR checkout and same running FastAPI app, I exercised the deprecated ACP equivalents plus the standard/ACP start and standard fork endpoints with valid JSON request bodies.

Observed:

POST statuses: standard_start_status=201; standard_fork_status=201; acp_start_status=201
PR acp_search: bytes=2506; skills_len=0; suffix='qa-suffix'
PR acp_get: bytes=2474; skills_len=0; suffix='qa-suffix'
PR acp_batch: bytes=2476; skills_len=0; suffix='qa-suffix'
PR start: bytes=2474; skills_len=0; suffix='qa-suffix'
PR fork: bytes=2474; skills_len=0; suffix='qa-suffix'
PR acp_start: bytes=2474; skills_len=0; suffix='qa-suffix'

This confirms the route-boundary behavior works across the endpoints called out in the PR, including deprecated ACP routes and POST routes that return ConversationInfo.

Issues Found

None.

This QA review was created by an AI agent (OpenHands) on behalf of the requester.

all-hands-bot

I found one API compatibility concern; details inline. Risk: 🟡 Medium because this changes a public REST/SDK-observable response by default.

This review was generated by an AI agent (OpenHands) on behalf of the user.

Was this automated review useful? React with 👍 or 👎 to this review to help us measure review quality.
Workflow run: https://github.com/OpenHands/software-agent-sdk/actions/runs/26149800184

Default trim changes the public ``ConversationInfo`` shape that external SDK consumers like ``RemoteConversation.agent.agent_context.skills`` already round-trip through. Reviewer flagged the silent-regression risk; agreed. Make it opt-in: - ``include_skills: bool = True`` query param on every read endpoint that emits ``ConversationInfo`` (search, get, batch-get, start, fork, plus the deprecated ACP equivalents). - Default behaviour unchanged — existing callers see the full payload. - ``?include_skills=false`` opts into ``trim_conversation_response_skills``; the helper drops ``agent.agent_context.skills`` from the response while leaving the persisted state and the in-memory runtime copy untouched. Documented in the param title so it surfaces in OpenAPI. Tests cover both branches at every endpoint: - Default (no param) → full skills (5 in the fixture). - ``include_skills=true`` (explicit) → full skills (same). - ``include_skills=false`` → empty list. - Concrete byte-count comparison (default vs opt-in) confirms the ~2500 B saved per 5-skill conversation in the test fixture. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

all-hands-bot

✅ QA Report: PASS

Verified the actual agent-server HTTP API before and after the PR; include_skills=false now trims response skills while default behavior stays full.

Does this PR achieve its stated goal?

Yes. The PR set out to add a backward-compatible, opt-in include_skills=false query parameter for conversation responses. Running real agent-server instances showed main ignored the parameter and returned 2 skills/~9.1 KB responses, while the PR returned 0 skills/~2.5 KB for opt-in calls and still returned full skills for omitted/true calls. A trimmed POST /api/conversations?include_skills=false was also followed by GET default returning 2 skills, confirming the persisted/in-memory conversation was not trimmed.

Phase	Result
Environment Setup	✅ `make build` completed and actual `agent-server` processes started successfully
CI Status	⚠️ At check time: 23 pass, 4 pending, 1 failing `Review Thread Gate/unresolved-review-threads`, 2 skipped
Functional Verification	✅ Real HTTP requests verified default, explicit true, opt-in false, batch/search/fork, and ACP routes

Functional Verification

Test 1: Public conversation API before/after behavior

Step 1 — Establish baseline without the fix:
Checked out origin/main, started the real server with OPENHANDS_SUPPRESS_BANNER=1 uv run agent-server --host 127.0.0.1 --port 18081, then ran uv run python /tmp/qa_skill_trim_probe.py http://127.0.0.1:18081 main:

LABEL main
BASE http://127.0.0.1:18081
conversation_id 40556d80-c8ec-48ec-ac58-fe1149ff7ec5
post_false_id 30316fbc-6cfe-4cea-86ae-195b2f9c8fab
endpoint | status | bytes | skills
POST include_skills=false | 201 | 9111 | 2
POST default | 201 | 9111 | 2
GET default | 200 | 9111 | 2
GET include_skills=true | 200 | 9111 | 2
GET include_skills=false | 200 | 9111 | 2
BATCH default | 200 | 9113 | 2
BATCH include_skills=false | 200 | 9113 | 2
SEARCH default | 200 | 21542 | 2
SEARCH include_skills=false | 200 | 21542 | 2
FORK include_skills=false | 201 | 9111 | 2
GET default after trimmed start | 200 | 9111 | 2

This confirms the baseline had no opt-in trimming behavior: even when include_skills=false was present, responses still carried both inline skills and the same byte size as default responses.

Step 2 — Apply the PR's changes:
Checked out perf/api-route-skill-trim at ef1598c1c9532fc302d052f22d4a19de800de405 and started the real server with OPENHANDS_SUPPRESS_BANNER=1 uv run agent-server --host 127.0.0.1 --port 18082.

Step 3 — Re-run with the fix in place:
Ran uv run python /tmp/qa_skill_trim_probe.py http://127.0.0.1:18082 pr:

LABEL pr
BASE http://127.0.0.1:18082
conversation_id 578273c2-44a3-484c-b5ec-3803e828e06d
post_false_id 3fc4e1d6-71d6-4ca9-8b75-278fa2118878
endpoint | status | bytes | skills
POST include_skills=false | 201 | 2492 | 0
POST default | 201 | 9101 | 2
GET default | 200 | 9101 | 2
GET include_skills=true | 200 | 9101 | 2
GET include_skills=false | 200 | 2492 | 0
BATCH default | 200 | 9103 | 2
BATCH include_skills=false | 200 | 2494 | 0
SEARCH default | 200 | 48858 | 2
SEARCH include_skills=false | 200 | 14992 | 0
FORK include_skills=false | 201 | 2492 | 0
GET default after trimmed start | 200 | 9101 | 2

This shows the PR behavior works end-to-end: default and explicit true remain backward-compatible with full skills, opt-in false trims skills to [] and substantially reduces response bytes, and the default GET after a trimmed POST still returns full skills, so the stored conversation was not mutated by response trimming.

Test 2: Deprecated ACP conversation API opt-in behavior

With the PR server still running, ran uv run python /tmp/qa_acp_trim_probe.py http://127.0.0.1:18082 pr-acp:

LABEL pr-acp
conversation_id 18c8d812-80f9-453f-afa3-a2aeae48bc8f
endpoint | status | bytes | skills
ACP POST default | 201 | 7129 | 2
ACP GET include_skills=false | 200 | 2500 | 0
ACP BATCH include_skills=false | 200 | 2502 | 0
ACP SEARCH include_skills=false | 200 | 19986 | 0

This verifies the deprecated ACP equivalents expose the same opt-in trimming behavior over real HTTP requests.

Issues Found

None.

This review was created by an AI agent (OpenHands) on behalf of the user.

all-hands-bot

LGTM — no blocking issues found. Taste Rating: 🟢 Good taste. The route-level opt-in keeps the public default response backward-compatible while trimming only for callers that request it.

This review was generated by an AI agent (OpenHands) on behalf of the user.

Was this automated review useful? React with 👍 or 👎 to this review to help us measure review quality.
Workflow run: https://github.com/OpenHands/software-agent-sdk/actions/runs/26151427932

…lse) BREAKING CHANGE: ``GET /conversations*`` and the deprecated ACP equivalents now return ``agent.agent_context.skills == []`` by default. Callers that still need the legacy full-payload shape can opt back in with ``?include_skills=true``. The trim was previously opt-in (``?include_skills=false``) to keep the default response backward-compatible. Audit of known consumers (agent-canvas, OpenHands app-server, SDK examples) showed none read ``agent.agent_context.skills`` from HTTP responses — they either use in-process ``LocalConversation`` directly or only read fields like ``agent.llm.model``. Default-trim is therefore safe to ship as the steady-state shape, and reverses the cost/benefit: ~260 KB shaved off every conversation fetch unless a caller explicitly asks for it. The persisted ``ConversationState`` on disk and the in-memory copy held by the agent's runtime are unaffected — only the bytes leaving over HTTP shrink. Also makes both ``search`` route handlers non-destructive: they build a fresh ``ConversationPage`` via ``model_copy`` instead of mutating ``page.items`` in place, so an upstream service caching its return value (e.g. ``AsyncMock`` in tests) sees stable behaviour across calls. Drive-by: migrates ``ConversationSortOrder`` and ``EventSortOrder`` to ``StrEnum`` to clear pre-existing ``UP042`` lint errors in ``models.py`` that the pre-commit hook now blocks any further edit to this file on.

github-actions Bot added the release-note-required PR requires explicit release-note coverage for behavioral or default changes label May 20, 2026

all-hands-bot reviewed May 20, 2026

View reviewed changes

Comment thread openhands-agent-server/openhands/agent_server/models.py

simonrosenberg changed the title ~~perf(agent_server): trim agent.agent_context.skills at the route boundary (alternative to #3302)~~ perf(agent_server): add ?include_skills=false opt-in to trim conversation skills (alternative to #3302) May 20, 2026

simonrosenberg requested a review from all-hands-bot May 20, 2026 08:40

all-hands-bot reviewed May 20, 2026

View reviewed changes

all-hands-bot approved these changes May 20, 2026

View reviewed changes

simonrosenberg changed the title ~~perf(agent_server): add ?include_skills=false opt-in to trim conversation skills (alternative to #3302)~~ perf(agent_server): add ?include_skills=false opt-in to trim conversation skills May 20, 2026

simonrosenberg mentioned this pull request May 20, 2026

perf: trim agent.agent_context.skills from /api/conversations responses by default (-98% wire bytes) #3301

Closed

simonrosenberg changed the title ~~perf(agent_server): add ?include_skills=false opt-in to trim conversation skills~~ perf(agent_server)!: trim conversation skills by default (include_skills=false) May 20, 2026

simonrosenberg merged commit bd55859 into main May 20, 2026
33 checks passed

simonrosenberg deleted the perf/api-route-skill-trim branch May 20, 2026 17:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(agent_server)!: trim conversation skills by default (include_skills=false)#3316

perf(agent_server)!: trim conversation skills by default (include_skills=false)#3316
simonrosenberg merged 3 commits into
mainfrom
perf/api-route-skill-trim

simonrosenberg commented May 20, 2026 •

edited by github-actions Bot

Loading

Uh oh!

github-actions Bot commented May 20, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 20, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 20, 2026 •

edited

Loading

Uh oh!

all-hands-bot left a comment

Uh oh!

all-hands-bot left a comment

Uh oh!

Uh oh!

all-hands-bot left a comment

Uh oh!

all-hands-bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

simonrosenberg commented May 20, 2026 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TL;DR

Breaking change

Why default-on (revised from the opt-in revision)

Empirical measurements (honest)

Wire-level (reproducible, 5 GETs each)

Browser cold-cold-cold (3 trials; servers killed, Vite cache wiped between trials)

Where the trim actually wins

Where it doesn't win

Why this exists alongside #3302

What changes

Test plan

Uh oh!

github-actions Bot commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Python API breakage checks — ✅ PASSED

Behavioral default changes detected

Uh oh!

github-actions Bot commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

REST API breakage checks (OpenAPI) — ✅ PASSED

Uh oh!

github-actions Bot commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

all-hands-bot left a comment

Choose a reason for hiding this comment

✅ QA Report: PASS

Does this PR achieve its stated goal?

Test 1: Standard conversation read endpoints shrink payloads

Test 2: Deprecated ACP and write-returning routes also trim responses

Issues Found

Uh oh!

all-hands-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

all-hands-bot left a comment

Choose a reason for hiding this comment

✅ QA Report: PASS

Does this PR achieve its stated goal?

Test 1: Public conversation API before/after behavior

Test 2: Deprecated ACP conversation API opt-in behavior

Issues Found

Uh oh!

all-hands-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

simonrosenberg commented May 20, 2026 •

edited by github-actions Bot

Loading

github-actions Bot commented May 20, 2026 •

edited

Loading

github-actions Bot commented May 20, 2026 •

edited

Loading

github-actions Bot commented May 20, 2026 •

edited

Loading