Skip to content

#254 Increase ACI resources, tune RQ2, drop default strategy#339

Merged
jathavaan merged 4 commits into
mainfrom
feature/254-increase-resources-for-aci-containers-and-postgresql
May 24, 2026
Merged

#254 Increase ACI resources, tune RQ2, drop default strategy#339
jathavaan merged 4 commits into
mainfrom
feature/254-increase-resources-for-aci-containers-and-postgresql

Conversation

@jathavaan
Copy link
Copy Markdown
Collaborator

Summary

  • Bump all ACI containers from 3 vCPU / 8 GB to 4 vCPU / 16 GB (ACI max for most Azure regions)
  • Reduce RQ2 (national-scale spatial join) iterations from 5 → 3, raise per-iteration timeout to 90 minutes (applied to warmup too)
  • Remove Sedona default-strategy benchmarks (3 experiments) — consistently timed out or failed, infeasible to collect reliable data
  • Increase DATABRICKS_DRIVER_MAX_RESULT_SIZE to 16g
  • Update data release to 2026-05-23.1
  • Experiment count: 44 → 41, batch count: 18 → 17

Test plan

  • Verify benchmarks.yml parses correctly via python main.py dry-run
  • Confirm no import errors: python -c "from src.presentation.entrypoints import *"
  • Run a single RQ2 benchmark locally to verify 3-iteration ceiling and 90-min timeout apply
  • Check that duckdb-large and postgis-large batch together without the removed default-8

…tegy

Bump all ACI containers to 4 vCPU / 16 GB RAM (max for most Azure
regions). Reduce national-scale spatial join iterations from 5 to 3
and raise per-iteration timeout to 90 minutes (including warmup).
Remove Sedona default-strategy benchmarks — iterations consistently
timed out or failed, making reliable measurement infeasible.
Increase Databricks driver max result size to 16g. Update data release
to 2026-05-23.1.
Copilot AI review requested due to automatic review settings May 24, 2026 07:40
@jathavaan jathavaan linked an issue May 24, 2026 that may be closed by this pull request
10 tasks
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR updates benchmark orchestration/config to increase ACI container resources, reduce/tune the RQ2 (national-scale spatial join) run length, and remove the infeasible Sedona “default strategy” experiments from the benchmark suite.

Changes:

  • Increase ACI experiment resources in benchmarks.yml (3 vCPU / 8 GB → 4 vCPU / 16 GB) and update the data release string.
  • Tune RQ2 execution by lowering NATIONAL_SCALE_SPATIAL_JOIN iterations (5 → 3) and extending the timeout ceiling (3600s → 5400s), including warmup enforcement.
  • Remove Databricks “default” national-scale spatial join entrypoints and dispatch/import wiring, and update docs to reflect the dropped variant.

Reviewed changes

Copilot reviewed 15 out of 15 changed files in this pull request and generated 4 comments.

Show a summary per file
File Description
benchmarks.yml Bumps all experiment CPU/memory and removes default-strategy experiments from the matrix.
src/config.py Updates data release, increases max timed window, and increases Databricks driver max result size.
src/domain/enums/benchmark_iteration.py Lowers national-scale spatial join iteration ceiling from 5 to 3.
src/application/common/monitor.py Applies timeout enforcement to warmup iterations and avoids entering timed loop if warmup times out.
src/presentation/entrypoints/national_scale_spatial_join_databricks_default_2_nodes.py Deletes default-strategy 2-worker Databricks entrypoint.
src/presentation/entrypoints/national_scale_spatial_join_databricks_default_8_nodes.py Deletes default-strategy 8-worker Databricks entrypoint.
src/presentation/entrypoints/national_scale_spatial_join_databricks_default_16_nodes.py Deletes default-strategy 16-worker Databricks entrypoint.
src/presentation/entrypoints/__init__.py Removes exports for deleted default-strategy Databricks entrypoints.
benchmark_runner.py Removes imports and dispatch cases for deleted default-strategy Databricks script IDs.
README.md Updates narrative and batch listing to reflect removal of default strategy and new timeout/iteration values.
CLAUDE.md Updates stopping-rule rationale text to match the new 3-iteration RQ2 configuration.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread benchmark_runner.py
Comment thread README.md
Comment thread src/config.py
Comment thread src/config.py Outdated
jathavaan added 2 commits May 24, 2026 09:47
…config

Delete the default-strategy Databricks notebook and all remaining
references (Config paths, NotebookVariant literal, service dispatch
branches, interface docstring). Fix README notebook count (three → two).

Increase Databricks driver memory (9g → 14g) and overhead (512m → 1g)
to address OOMs, cap maxResultSize at 8g to stay within driver heap.
Copilot AI review requested due to automatic review settings May 24, 2026 07:59
@jathavaan jathavaan enabled auto-merge May 24, 2026 08:00
@jathavaan jathavaan disabled auto-merge May 24, 2026 08:00
@jathavaan jathavaan self-assigned this May 24, 2026
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 20 out of 20 changed files in this pull request and generated 2 comments.

Comment thread src/config.py
Comment thread README.md
@jathavaan jathavaan merged commit 301673e into main May 24, 2026
26 checks passed
@jathavaan jathavaan deleted the feature/254-increase-resources-for-aci-containers-and-postgresql branch May 24, 2026 08:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Increase resources for ACI containers and PostgreSQL

2 participants