[Draft] [plugin][profiler] refine OOT profiler with record_function by zejunchen-zejun · Pull Request #348 · ROCm/ATOM

zejunchen-zejun · 2026-03-17T14:29:19Z

We use below label to demonstrate the info in profiler
d[req128, tok128, dec128, tok128, pre0, tok0, ext0, tok0] , which means decode step, 128 requests, 128 tokens
pre means prefill, ext means extend path for attention

record function Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

Copilot

Pull request overview

Refines ATOM’s vLLM OOT plugin profiling by adding torch.profiler.record_function spans around the model forward pass, with labels derived from vLLM forward-context attention metadata.

Changes:

Adds helpers to extract step-level attention/plugin metadata from vLLM forward context.
Builds a compact per-step profiler label (decode vs prefill/extend) from plugin metadata counters.
Wraps self.model(...) in a conditional record_function(...) span when torch profiling is enabled via vLLM config.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

atom/plugin/vllm/model_wrapper.py

+    # Shorthand label format:
+    # d = decode-only step, p = step containing prefill/extend work.
+    # req/tok = total requests/tokens in this step.
+    # dec/pre/ext each carry request count followed by token count.
+    step = "p" if (num_prefills > 0 or num_extends > 0) else "d"
+    return (
+        f"{step}[req{total_reqs}, tok{num_actual_tokens}, "
+        f"dec{num_decodes}, tok{num_decode_tokens}, "
+        f"pre{num_prefills}, tok{num_prefill_tokens}, "
+        f"ext{num_extends}, tok{num_extend_tokens}]"


atom/plugin/vllm/model_wrapper.py

+    if isinstance(attn_metadata, list):
+        # In ubatch mode, vLLM stores one metadata dict per microbatch. We need
+        # the first actual per-layer metadata object, not the outer list itself.
+        # Keep the empty-dict guard for robustness if a placeholder slips through.
+        for ubatch_attn_metadata in attn_metadata:
+            if not ubatch_attn_metadata:
+                continue
+            return next(iter(ubatch_attn_metadata.values()), None)
+        return None


atom/plugin/vllm/model_wrapper.py

+        return None
+
+    if isinstance(attn_metadata, dict):
+        return next(iter(attn_metadata.values()), None)


valarLip · 2026-03-18T03:38:11Z

please make sure have aligned style with ATOM main

zejunchen-zejun · 2026-03-18T03:45:56Z

please make sure have aligned style with ATOM main

🆗 sure, this PR has bugs for now, no label info in profiler json, will fix soon

Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

zejunchen-zejun · 2026-03-18T11:31:29Z

Pending this PR, because it is not easy to add customized label and make all of the kernels from this step to be attributed into this customized label. Here is the issue we found. We have customized the label info and record it before the model run, while the kernels cannot be attributed to this step exclude the last step, so the label will be treated as the normal user annotation instead of the gpu user annotation, so the profiler json is shown as below. Only last step has correct kernels and associated labels. For other steps, it doesn't work for now.

[plugin][profiler] refine OOT profiler with

9fd974a

record function Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

Copilot AI review requested due to automatic review settings March 17, 2026 14:29

zejunchen-zejun marked this pull request as draft March 17, 2026 14:29

Copilot started reviewing on behalf of zejunchen-zejun March 17, 2026 14:30 View session

Copilot AI reviewed Mar 17, 2026

View reviewed changes

zejunchen-zejun added 8 commits March 18, 2026 14:36

add

2b572b5

Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

add

d65cd74

Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

add

99e5c79

Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

add

89b431c

Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

add

9d57033

Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

add

fcb8cdc

Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

add

42e5334

Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

add

09a988a

Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Draft] [plugin][profiler] refine OOT profiler with record_function#348

[Draft] [plugin][profiler] refine OOT profiler with record_function#348
zejunchen-zejun wants to merge 9 commits intomainfrom
zejun/refine_oot_profiler

zejunchen-zejun commented Mar 17, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

valarLip commented Mar 18, 2026

Uh oh!

zejunchen-zejun commented Mar 18, 2026

Uh oh!

zejunchen-zejun commented Mar 18, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

zejunchen-zejun commented Mar 17, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

valarLip commented Mar 18, 2026

Uh oh!

zejunchen-zejun commented Mar 18, 2026

Uh oh!

zejunchen-zejun commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zejunchen-zejun commented Mar 18, 2026 •

edited

Loading