Collect JVM memory metrics #2937

jicelhay · 2026-01-16T04:28:29Z

Adds JVM memory monitoring by registering new gauge metrics into the existing MetricRegistryImpl object. Metrics in MetricRegistryImpl are already being pushed to Cloud monitoring every 60 secs by a MetricReporter.

Example of metrics in crash: https://console.cloud.google.com/monitoring/metrics-explorer;duration=PT1H?referrer=search&project=domain-registry-crash&pageState=%7B%22xyChart%22:%7B%22constantLines%22:%5B%5D,%22dataSets%22:%5B%7B%22plotType%22:%22LINE%22,%22pointConnectionMethod%22:%22GAP_DETECTION%22,%22targetAxis%22:%22Y1%22,%22timeSeriesFilter%22:%7B%22aggregations%22:%5B%7B%22alignmentPeriod%22:%2260s%22,%22crossSeriesReducer%22:%22REDUCE_SUM%22,%22groupByFields%22:%5B%22metric.label.%5C%22type%5C%22%22,%22resource.label.%5C%22pod_id%5C%22%22%5D,%22perSeriesAligner%22:%22ALIGN_MEAN%22%7D%5D,%22apiSource%22:%22DEFAULT_CLOUD%22,%22crossSeriesReducer%22:%22REDUCE_SUM%22,%22filter%22:%22metric.type%3D%5C%22custom.googleapis.com%2Fjvm%2Fmemory%2Fmax%5C%22%20resource.type%3D%5C%22gke_container%5C%22%22,%22groupByFields%22:%5B%22metric.label.%5C%22type%5C%22%22,%22resource.label.%5C%22pod_id%5C%22%22%5D,%22minAlignmentPeriod%22:%2260s%22,%22perSeriesAligner%22:%22ALIGN_MEAN%22%7D%7D%5D,%22options%22:%7B%22mode%22:%22COLOR%22%7D,%22y1Axis%22:%7B%22label%22:%22%22,%22scale%22:%22LINEAR%22%7D%7D%7D

b/468031702

This change is

core/src/main/java/google/registry/monitoring/whitebox/StackdriverModule.java

ptkach

Overall looks good, but did you say we collect it evert 60s? It's for evert pod in every workload, so I think it's way too much metrics for what we need - I'd suggest lowering it every 5mins

@ptkach reviewed 3 files and all commit messages, and made 1 comment.
Reviewable status: all files reviewed, 1 unresolved discussion (waiting on @jicelhay).

jicelhay · 2026-01-21T19:44:52Z

Yes, we're just registering new metrics in the default MetricRegistry and piggybacking the logic for writing them. the existing writer is configured to write every 60 secs (see StackdriverModule.provideMetricReporter and writeIntervalSeconds in the default config yaml).

From what I see in the metrics explorer, we already pushing 30+ custom metrics, so I don't think pushing 3 more is a big issue.

Creating a configuration for a new writer + providing it and initializing as it's own process just to have these new metrics report in a different interval doesn't seem worth it for me, let me know what you think.

ptkach

Lets merge and see how metrics volume changes after a week of running this in prod

@ptkach made 1 comment.
Reviewable status: all files reviewed, 1 unresolved discussion (waiting on @jicelhay).

jicelhay added 5 commits January 15, 2026 14:14

add jvm metrics

fe9cb38

include all changes

51b3b99

Fix tests and lint errors

398a908

Fix formatting

d96edb5

Instantiate jvmmetrics class in stackdriver module

9703c4e

jicelhay assigned jicelhay and ptkach and unassigned jicelhay Jan 16, 2026

github-advanced-security bot found potential problems Jan 16, 2026

View reviewed changes

core/src/main/java/google/registry/monitoring/whitebox/StackdriverModule.java Fixed Show fixed Hide fixed

add metrics registration behaviour and explicit call

9b4f740

jicelhay unassigned ptkach Jan 20, 2026

jicelhay added 3 commits January 20, 2026 12:30

redo tests

18cbfb9

fix formatting/variable name

8f77297

lint

fefff6e

jicelhay assigned ptkach Jan 20, 2026

jicelhay requested a review from ptkach January 21, 2026 16:29

jicelhay unassigned ptkach Jan 21, 2026

ptkach reviewed Jan 21, 2026

View reviewed changes

ptkach approved these changes Jan 21, 2026

View reviewed changes

jicelhay added this pull request to the merge queue Jan 21, 2026

Merged via the queue into google:master with commit a5c1412 Jan 21, 2026
9 of 10 checks passed

jicelhay deleted the jvmmetrics branch January 21, 2026 22:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Collect JVM memory metrics #2937

Collect JVM memory metrics #2937

jicelhay commented Jan 16, 2026 •

edited by gbrodman

Loading

Uh oh!

Uh oh!

ptkach left a comment

Uh oh!

jicelhay commented Jan 21, 2026

Uh oh!

ptkach left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Collect JVM memory metrics #2937

Collect JVM memory metrics #2937

Conversation

jicelhay commented Jan 16, 2026 • edited by gbrodman Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ptkach left a comment

Choose a reason for hiding this comment

Uh oh!

jicelhay commented Jan 21, 2026

Uh oh!

ptkach left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jicelhay commented Jan 16, 2026 •

edited by gbrodman

Loading