UCP/MM: Add MAX_HCA_PER_GPU policy for GPU dmabuf registrations by tomerg-nvidia · Pull Request #11422 · openucx/ucx

tomerg-nvidia · 2026-05-05T19:43:39Z

What?

Add UCX_MAX_HCA_PER_GPU to limit how many HCAs UCP registers GPU memory on through dmabuf-capable MDs.

Supported values:

inf: register on all reachable HCAs, default
0: register on the closest reachable HCA set
<N> : register on up to N closest reachable HCAs

Why?

UCX currently registers on all MDs, causing high ICM memory usage. This should reduce it.

How?

Precompute dmabuf-capable MD reachability and latency per memory system device during context setup. During user ucp_mem_map(), narrow only the dmabuf-capable part of the registration MD map according to UCX_MAX_HCA_PER_GPU. keep non-dmabuf MDs unchanged.

Selection uses topology latency first, then MD use count, then MD name for deterministic tie breaking.

Add a configurable policy (UCX_DMABUF_REG_DEVICES) to control how many dmabuf-capable MDs are registered during ucp_mem_map for GPU memory. Three modes are supported: "all" (default, register every reachable dmabuf MD), "closest" (only MDs on the same NUMA/bus), and a numeric limit N (pick the N closest reachable MDs, load-balanced by use count). The policy is applied in a place so that later registrations (e.g. zcopy, RNDV staging) bypass it.

tvegas1 · 2026-05-07T15:31:00Z

   ucs_offsetof(ucp_context_config_t, proto_use_single_net_device),
   UCS_CONFIG_TYPE_BOOL},

+  {"DMABUF_REG_DEVICES", "all",


Suggest to rename without DMABUF as this will remain an existing option even when it becomes not strictly dmabuf related?

It should eventually replace GDA_MAX_HCA_PER_GPU, maybe MAX_HCA_PER_GPU=0/N/inf, 0 for closest, inf for all? Or a string like currently done?

tvegas1 · 2026-05-07T15:31:15Z

   UCS_CONFIG_TYPE_BOOL},

+  {"DMABUF_REG_DEVICES", "all",
+   "Specifies which dmabuf-capable UCP memory domain(s) to use for broad "


no dmabuf reference here

tvegas1 · 2026-05-07T15:33:02Z

+    md_map &= context->dmabuf_reg_md_map;
+    ucs_for_each_bit(md_index, md_map) {
+        context->dmabuf_reg_md[md_index].last_used =
+                ucs_atomic_fadd32(&context->dmabuf_reg_timestamp, 1) + 1;


no need for atomic here?

I converted it to use count (though context-local only), it needs to be atomic because it can be accessed from multiple threads.

tvegas1 · 2026-05-07T15:34:38Z

+    ucp_dmabuf_reg_md_t           dmabuf_reg_md[UCP_MAX_MDS];
+
+    /* Monotonic counter for LRU-based dmabuf MD selection */
+    volatile uint32_t             dmabuf_reg_timestamp;


without timestamp as it is not really time, rather sequence number?

converted to use count (per context)

tvegas1 · 2026-05-07T15:38:51Z

+}
+
+static ucp_md_map_t
+ucp_dmabuf_reg_select_limit(const ucp_context_config_t *config,


we can just remove all dmabuf prefixes in struct's and functions

we can keep the actual function implemented around dmabuf functionality though

tvegas1 · 2026-05-07T16:59:59Z

+                                                     dmabuf_md_map);
+    selected      = ucp_context_select_dmabuf_reg_md_map(context, reachable,
+                                                         sys_dev);
+    ucs_trace("dmabuf_policy: mem_type=%d sys_dev=%d dmabuf=0x%" PRIx64


debug level, and such computation once at startup if it is possible?

tvegas1 · 2026-05-07T17:00:20Z

+    selected      = ucp_context_select_dmabuf_reg_md_map(context, reachable,
+                                                         sys_dev);
+    ucs_trace("dmabuf_policy: mem_type=%d sys_dev=%d dmabuf=0x%" PRIx64
+              " reachable=0x%" PRIx64 " selected=0x%" PRIx64,


TBH for usability we'd need the HCA name if possible

tvegas1 · 2026-05-07T17:29:20Z

   ucs_offsetof(ucp_context_config_t, proto_use_single_net_device),
   UCS_CONFIG_TYPE_BOOL},

+  {"DMABUF_REG_DEVICES", "all",


set to '1' by default at the potential expense of throughput.

tvegas1 · 2026-05-07T17:40:36Z

+    ucp_dmabuf_reg_select_md_t select_mds[UCP_MAX_MDS];
+    ucp_md_index_t md_index;
+
+    md_map &= context->dmabuf_reg_md_map;


probably not needed as done by caller?

tomerg-nvidia added the WIP-DNM Work in progress / Do not review label May 5, 2026

tomerg-nvidia force-pushed the limit_md_registrations branch from 23d6c18 to 648f3ec Compare May 6, 2026 15:39

tomerg-nvidia requested a review from brminich May 6, 2026 15:45

tomerg-nvidia removed the WIP-DNM Work in progress / Do not review label May 6, 2026

tomerg-nvidia marked this pull request as ready for review May 6, 2026 15:45

tomerg-nvidia requested a review from tvegas1 May 7, 2026 03:09

tvegas1 reviewed May 7, 2026

View reviewed changes

UCP/MM: PR fixes

50fc149

tomerg-nvidia changed the title ~~UCP/MM: Add UCX_DMABUF_REG_DEVICES policy to limit dmabuf registrations~~ UCP/MM: Add MAX_HCA_PER_GPU policy for GPU dmabuf registrations May 8, 2026

UCP/MM: fix format

e9daf44

tomerg-nvidia requested a review from tvegas1 May 8, 2026 09:25

TEST/GTEST/CONTEXT: fix bad test compilation

e2c141c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UCP/MM: Add MAX_HCA_PER_GPU policy for GPU dmabuf registrations#11422

UCP/MM: Add MAX_HCA_PER_GPU policy for GPU dmabuf registrations#11422
tomerg-nvidia wants to merge 4 commits intoopenucx:masterfrom
tomerg-nvidia:limit_md_registrations

tomerg-nvidia commented May 5, 2026 •

edited

Loading

Uh oh!

tvegas1 May 7, 2026

Uh oh!

tvegas1 May 7, 2026

Uh oh!

tvegas1 May 7, 2026

Uh oh!

tomerg-nvidia May 8, 2026

Uh oh!

tvegas1 May 7, 2026

Uh oh!

tomerg-nvidia May 8, 2026

Uh oh!

tvegas1 May 7, 2026

Uh oh!

tvegas1 May 7, 2026

Uh oh!

tvegas1 May 7, 2026

Uh oh!

tvegas1 May 7, 2026

Uh oh!

tvegas1 May 7, 2026

Uh oh!

tvegas1 May 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tomerg-nvidia commented May 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What?

Why?

How?

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tomerg-nvidia commented May 5, 2026 •

edited

Loading