feat: add Swarm mode with atomic Caddyfile rollout via configs by oneingan · Pull Request #778 · lucaslorentz/caddy-docker-proxy

oneingan · 2026-03-03T17:28:57Z

Implements the Swarm mode discussed in #773 (and the earlier atomic-write motivation from #766).

Adds --mode=swarm: controller-only; renders a full Caddyfile from labels and rolls it out via immutable Swarm configs.
Updates an existing Swarm service (--swarm-service) by swapping the mounted config at /etc/caddy/Caddyfile (or --swarm-caddyfile-target).
Keeps Caddy workers socketless and the Admin API closed; rollout happens via docker service update, so disruption is governed by the service's update/rollback settings.
Includes docs + examples/swarm.yaml and a swarm-mode test script.

Notes:

Each config change creates a new Swarm config object named <prefix>-<sha256>; old configs are not garbage-collected yet (documented in README). A GC pass can be added as follow-up.
This has been running on a production Swarm cluster for a couple of weeks without issues.

AI assistance disclosure: this PR was prepared with help from OpenCode (AI coding assistant) using OpenAI model gpt-5.2-xhigh.

lucaslorentz · 2026-03-08T06:44:21Z

swarm_mode.go

+	if err != nil {
+		// If another instance created it concurrently, inspect again.
+		if errdefs.IsConflict(err) {
+			return ensureSwarmConfig(ctx, dockerClient, name, data, fullHash)


This recursive call has no depth limit. If ConfigCreate keeps returning a conflict (e.g. a genuine name collision with different data), this will stack overflow.

Consider a bounded loop like the one in ensureServiceCaddyfileConfig just below (line 190).

lucaslorentz · 2026-03-08T06:44:21Z

loader.go

-		if err != nil {
-			log.Error("Failed to convert caddyfile into json config", zap.Error(err))
+	if dockerLoader.options.SwarmMode {
+		if err := dockerLoader.updateSwarmService(); err != nil {


updateSwarmService() runs on every update() cycle, not just when caddyfileChanged is true. This means every polling interval / Docker event triggers ServiceInspectWithRaw + ConfigList even when nothing changed.

The existing flow only does expensive work (Adapt + server push) inside the if caddyfileChanged block. Could this be guarded the same way?

lucaslorentz · 2026-03-08T06:44:21Z

swarm_mode.go

+	defaultSwarmConfigPrefix    = "caddyfile"
+	defaultSwarmConfigHashLen   = 32
+	maxSwarmConfigSizeBytes     = 1000 * 1024
+)


These defaults are also defined in the flag declarations in cmd.go (lines 75-86). In the existing code, defaults live only in the flag definitions. Having two sources means they can silently drift apart.

Example:

fs.Int("swarm-config-hash-len", 32, "Length of sha256 hex used in generated Swarm config name (swarm mode only)")

Having a shared const would be nice

lucaslorentz · 2026-03-08T06:44:21Z

tests/swarm-mode/run.sh

+  docker config ls --format "{{.Name}}" | grep "^${CONFIG_PREFIX}-" | xargs -r docker config rm >/dev/null || true
+}
+
+cleanup_configs


cleanup_configs runs here at the start but not on exit. docker stack rm (which the main test runner calls) won't remove standalone Swarm config objects created by this mode — unlike other tests, this one creates Docker objects outside the stack.

Consider adding a trap:

trap cleanup_configs EXIT

lucaslorentz · 2026-03-08T06:45:19Z

Reviewed with Claude Opus

lucaslorentz · 2026-03-08T07:01:11Z

A concern I have is the lack of config garbage collection. Each Caddyfile change creates a new Swarm config object (<prefix>-<sha256>) that is never removed. In a cluster with frequent deploys, this will silently accumulate and fill up Swarm's Raft store, which can degrade the entire cluster's performance and eventually prevent new configs/secrets from being created.

Would it be possible to tag the created configs with a label identifying them as managed by this controller, and include a cleanup routine that scans for old configs with that label and removes any that are no longer mounted on the target service?

oneingan added 7 commits March 3, 2026 18:26

feat: add swarm mode with atomic Caddyfile rollout via configs

0610f20

docs: document swarm config accumulation

a2c80a0

test: fix swarm-mode cleanup to avoid in-use config removal

ced799d

test: show full whoami response in swarm-mode test

152c46f

docs,test: use vanilla caddy image in swarm mode

38dce05

revert: leave Dockerfiles unchanged

43dd9db

docs,test: reuse placeholder Caddyfile and drop swarm bootstrap

fe0d24d

lucaslorentz reviewed Mar 8, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add Swarm mode with atomic Caddyfile rollout via configs#778

feat: add Swarm mode with atomic Caddyfile rollout via configs#778
oneingan wants to merge 7 commits intolucaslorentz:masterfrom
oneingan:feat/swarm-mode-configs-upstream

oneingan commented Mar 3, 2026 •

edited

Loading

Uh oh!

lucaslorentz Mar 8, 2026

Uh oh!

lucaslorentz Mar 8, 2026

Uh oh!

lucaslorentz Mar 8, 2026

Uh oh!

lucaslorentz Mar 8, 2026

Uh oh!

lucaslorentz Mar 8, 2026

Uh oh!

lucaslorentz commented Mar 8, 2026

Uh oh!

lucaslorentz commented Mar 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

oneingan commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lucaslorentz Mar 8, 2026

Choose a reason for hiding this comment

Uh oh!

lucaslorentz Mar 8, 2026

Choose a reason for hiding this comment

Uh oh!

lucaslorentz Mar 8, 2026

Choose a reason for hiding this comment

Uh oh!

lucaslorentz Mar 8, 2026

Choose a reason for hiding this comment

Uh oh!

lucaslorentz Mar 8, 2026

Choose a reason for hiding this comment

Uh oh!

lucaslorentz commented Mar 8, 2026

Uh oh!

lucaslorentz commented Mar 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

oneingan commented Mar 3, 2026 •

edited

Loading