Skip to content

bump default request timeout 30s -> 60s#9

Closed
goharanwar wants to merge 1 commit into
mainfrom
gohar/bump-default-timeout-agents
Closed

bump default request timeout 30s -> 60s#9
goharanwar wants to merge 1 commit into
mainfrom
gohar/bump-default-timeout-agents

Conversation

@goharanwar

Copy link
Copy Markdown
Contributor

Summary

  • Bumps VECTARA_TIMEOUT default from 30s to 60s in utils/config.py
  • Updates README to match

Why

Smoke tests on admin-ops staging were failing test_execute_agent_query with 408 - Request timeout after exactly 30,020ms. Agents-api logs (request_id=db4cb451336142a6b0a38b3607cc5c45, 2026-05-20 15:18:15 → 15:19:12) show the server completed the LLM call successfully — 57s end-to-end. The test client gave up at the 30s default before the server replied.

The suite already declares 60s as the acceptable budget for agent execution in test_execute_agent_response_time (assert response.elapsed_ms < 60000). The 30s default contradicted that. This change makes the default consistent with the suite's own SLA.

Sanity check on other tests

All response-time assertions in the suite remain well under 60s, so the bump is purely additive headroom for agent calls:

Test SLA
test_authentication_response_time < 5s
test_get_corpus_response_time < 3s
test_list_corpora_response_time < 5s
test_metadata (indexing) < 10s
test_rag_summary < 30s
test_query_edge_cases < 5s
test_execute_agent_response_time < 60s

Test plan

  • Trigger admin-ops staging smoke run for the agents profile; test_execute_agent_query should pass
  • Other profiles unaffected (no behavior change for any call that returned in < 30s before)

Agent execution involves LLM generation that routinely takes >30s in
staging; the prior default caused test_execute_agent_query to fail with
408 client timeouts while the server completed the request successfully
in ~57s. The new default matches the existing budget asserted by
test_execute_agent_response_time (< 60_000ms).

Other endpoints' SLA assertions all remain well under 60s (auth < 5s,
get/list corpus < 5s, indexing < 10s, rag summary < 30s, query < 5s),
so the bump only buys headroom for agent calls.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@goharanwar

Copy link
Copy Markdown
Contributor Author

Closing — fixing in vectara/platform PR #5450 instead since the test suite is being moved into the platform monorepo.

@goharanwar goharanwar closed this May 20, 2026
@goharanwar goharanwar deleted the gohar/bump-default-timeout-agents branch May 20, 2026 17:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant