Skip to content

Enable GGUF support gating for XPU in quantization compatibility checks#7

Draft
Copilot wants to merge 2 commits into
mainfrom
copilot/add-gguf-model-support-to-xpu
Draft

Enable GGUF support gating for XPU in quantization compatibility checks#7
Copilot wants to merge 2 commits into
mainfrom
copilot/add-gguf-model-support-to-xpu

Conversation

Copilot AI commented Mar 14, 2026

Copy link
Copy Markdown

This update addresses XPU GGUF enablement by removing a CUDA/ROCm-only assumption in quantization support detection. The GGUF path is now recognized as supported on XPU, so GGUF coverage is no longer blocked by platform gating logic.

  • Problem addressed

    • Quantization support checks treated non-CUDA/ROCm platforms as unsupported by default.
    • On XPU, this incorrectly rejected gguf and skipped GGUF-specific validation paths.
  • What changed

    • Updated tests/quantization/utils.py::is_quant_method_supported to:
      • allow gguf on current_platform.is_xpu()
      • handle None device capability (XPU behavior) without failing, while still restricting non-GGUF methods
    • Preserved existing behavior for CUDA/ROCm and capability-based checks.
  • Focused coverage added

    • New tests in tests/quantization/test_utils.py verify:
      • XPU + gguf => supported
      • XPU + non-gguf (e.g. fp8) => unsupported
if capability is None:
    return current_platform.is_xpu() and quant_method == "gguf"

🔒 GitHub Advanced Security automatically protects Copilot coding agent pull requests. You can protect all pull requests by enabling Advanced Security for your repositories. Learn more about Advanced Security.

Copilot AI self-assigned this Mar 14, 2026
Co-authored-by: zhenwei-intel <109187816+zhenwei-intel@users.noreply.github.com>
Copilot AI changed the title [WIP] Add support for GGUF model on XPU Enable GGUF support gating for XPU in quantization compatibility checks Mar 14, 2026
Copilot AI requested a review from zhenwei-intel March 14, 2026 09:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants