[LLM] Validate accelerator_type for CPU VLLM engine config by LeThienTrong · Pull Request #64235 · ray-project/ray

LeThienTrong · 2026-06-20T17:28:26Z

Why are these changes needed?

This PR addresses #62138.

LLMConfig already rejects accelerator_type when the resolved hardware configuration is CPU-only. However, VLLMEngineConfig can still be instantiated directly with both accelerator_type and CPUConfig, which can make the accelerator hint inconsistent with a CPU-only configuration.

What changes were proposed in this pull request?

Added validation in VLLMEngineConfig to reject accelerator_type when accelerator_config is CPUConfig.
Added regression coverage for the invalid CPU config case.
Added a positive test case to ensure accelerator_type remains valid with a GPU config.

How was this patch tested?

Added unit tests in python/ray/llm/tests/serve/cpu/configs/test_models.py.

I could not run the full test locally because the Ray compiled extension _raylet is not built in my local Windows source checkout.

gemini-code-assist

Code Review

This pull request introduces a validation check in VLLMEngineConfig to prevent the use of an accelerator_type with CPU-only configurations, raising an error if both are provided. It also adds corresponding unit tests to verify this behavior. The feedback points out that because VLLMEngineConfig is a Pydantic model, instantiating it with invalid values will raise a pydantic.ValidationError rather than a raw ValueError, and suggests updating the test assertion accordingly.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

Signed-off-by: LeThienTrong <lethientrong655@gmail.com>

kouroshHakha · 2026-06-20T20:24:52Z

@LeThienTrong could u run the lint hook?

LeThienTrong requested a review from a team as a code owner June 20, 2026 17:28

gemini-code-assist Bot reviewed Jun 20, 2026

View reviewed changes

Comment thread python/ray/llm/tests/serve/cpu/configs/test_models.py

LeThienTrong added 2 commits June 21, 2026 00:39

Validate accelerator_type for CPU VLLM engine config

0b25f51

Signed-off-by: LeThienTrong <lethientrong655@gmail.com>

Use Pydantic validation error in VLLM engine config test

eb3d361

Signed-off-by: LeThienTrong <lethientrong655@gmail.com>

LeThienTrong force-pushed the fix-llm-accelerator-cpu-validation branch from 8e2ee46 to eb3d361 Compare June 20, 2026 17:40

ray-gardener Bot added serve Ray Serve Related Issue llm community-contribution Contributed by the community labels Jun 20, 2026

kouroshHakha approved these changes Jun 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LLM] Validate accelerator_type for CPU VLLM engine config#64235

[LLM] Validate accelerator_type for CPU VLLM engine config#64235
LeThienTrong wants to merge 2 commits into
ray-project:masterfrom
LeThienTrong:fix-llm-accelerator-cpu-validation

LeThienTrong commented Jun 20, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

kouroshHakha commented Jun 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

LeThienTrong commented Jun 20, 2026

Why are these changes needed?

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

kouroshHakha commented Jun 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants