Skip to content

Add XPU device support for PD disaggregation with NIXL backend#3

Draft
Copilot wants to merge 5 commits into
devfrom
copilot/support-xpu-connector-pd-disaggregation
Draft

Add XPU device support for PD disaggregation with NIXL backend#3
Copilot wants to merge 5 commits into
devfrom
copilot/support-xpu-connector-pd-disaggregation

Conversation

Copilot AI commented Jan 28, 2026

Copy link
Copy Markdown

NIXL backend for PD disaggregation was hardcoded to CUDA. This adds Intel XPU support.

Changes

  • Device detection (transfer_utils.py): Extended get_correct_device() to handle xpu device strings
  • NIXL transfer channel (nixl_channel.py):
    • Added device parameter to NixlChannel and NixlAgentWrapper
    • Dynamic mem_type determination: cuda/xpu/cpu based on device string
    • Validates device types, raises on unsupported values
  • Device initialization: Updated all NIXL code paths to support XPU
    • pd_backend.py: Passes device to transfer channel, uses torch.xpu.set_device() when applicable
    • p2p_backend.py: Explicitly passes device="cpu" for CPU-based transfers
    • nixl_storage_backend.py: XPU device setting support
    • cache_engine.py: XPU support in NIXL allocator
  • Error handling: Checks hasattr(torch, "xpu") before XPU operations with clear error messages

Usage

pd_buffer_device: "xpu"  # Previously only "cuda" or "cpu"
transfer_channel: "nixl"
nixl_backends: ["UCX"]

Backward compatible: defaults to CUDA when device not specified.


✨ Let Copilot coding agent set things up for you — coding agent works faster and does higher quality work when set up for your repo.

Copilot AI and others added 4 commits January 28, 2026 02:54
Co-authored-by: zhenwei-intel <109187816+zhenwei-intel@users.noreply.github.com>
Co-authored-by: zhenwei-intel <109187816+zhenwei-intel@users.noreply.github.com>
Co-authored-by: zhenwei-intel <109187816+zhenwei-intel@users.noreply.github.com>
Co-authored-by: zhenwei-intel <109187816+zhenwei-intel@users.noreply.github.com>
Copilot AI changed the title [WIP] Add XPU connector support for PD disaggregation using NIXL backend Add XPU device support for PD disaggregation with NIXL backend Jan 28, 2026
Copilot AI requested a review from zhenwei-intel January 28, 2026 03:05
@github-actions

Copy link
Copy Markdown

This pull request has been automatically marked as stale because it has not had activity within 60 days. It will be automatically closed if no further activity occurs within 30 days.

@github-actions github-actions Bot added the stale label Mar 29, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants