Skip to content

Improve Gatus reliability and grouping#605

Merged
mulatta merged 1 commit into
mainfrom
gatus-monitoring-reliability
Jul 3, 2026
Merged

Improve Gatus reliability and grouping#605
mulatta merged 1 commit into
mainfrom
gatus-monitoring-reliability

Conversation

@mulatta

@mulatta mulatta commented Jul 3, 2026

Copy link
Copy Markdown
Collaborator

Summary

  • persist Gatus results in SQLite so deploys do not reset uptime history
  • add external endpoint heartbeat checks so missing push updates turn red
  • allow one pushed endpoint to require multiple HTTP probes
  • tighten app checks with browser-facing routes for Grafana, MULTI-evolve, Docling, Nextcloud, and Vaultwarden
  • collapse dashboard groups to apps/ci/monitoring/platform and default UI sorting to group

Validation

  • nix fmt
  • nix eval --impure --json --expr 'let f = builtins.getFlake (toString ./.); in { eta = f.nixosConfigurations.eta.config.services.gatus.settings; psiPush = f.nixosConfigurations.psi.config.gatusCheck.push; rhoPush = f.nixosConfigurations.rho.config.gatusCheck.push; tauPush = f.nixosConfigurations.tau.config.gatusCheck.push; }'
  • nix build --dry-run .#checks.x86_64-linux.nixos-eta .#checks.x86_64-linux.nixos-psi .#checks.x86_64-linux.nixos-rho .#checks.x86_64-linux.nixos-tau --no-link

Gatus kept all endpoint state in memory, so every deploy reset uptime
history and external endpoints disappeared until their next push. External
checks also had no heartbeat, so stopped push timers could leave stale green
state indefinitely.

Persist results in SQLite, require external endpoint heartbeats, and let a
single pushed endpoint aggregate multiple HTTP probes. Use that for services
where both the app health endpoint and the browser-facing route need to pass.
Collapse dashboard groups and set the default UI sort to group to make the
status page easier to scan.
@mulatta mulatta force-pushed the gatus-monitoring-reliability branch from 66a4812 to 85486d9 Compare July 3, 2026 06:39
@mulatta mulatta merged commit da16e85 into main Jul 3, 2026
3 checks passed
@mulatta mulatta deleted the gatus-monitoring-reliability branch July 3, 2026 06:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant