test: improve pytest infrastructure and vLLM backend testing by planetf1 · Pull Request #416 · generative-computing/mellea

planetf1 · 2026-02-05T15:29:17Z

Improve pytest infrastructure and vLLM backend testing

Type of PR

Bug Fix
New Feature
Documentation
Other

Description

Link to Issue: vllm test cleanup #415, test: ensure all hf tests run (cuda) #397, feat: add pytest markers for test categorization (#322) #326

Fixes #415

Enhances pytest infrastructure with capability detection and process isolation for GPU tests. Fixes process isolation to only activate on CUDA systems, preventing unnecessary overhead on macOS and systems without GPU.

Key changes:

Add pytest skip mechanism with CLI options (--ignore-gpu-check, --ignore-ram-check, etc.)
Implement CUDA-specific process isolation for heavy GPU tests (vLLM, HuggingFace)
Fix vLLM structured output token limits (one test was not specifying max tokens & failing)

Testing

Tests added to the respective file if code was changed
New code has 100% coverage if code as added
Ensure existing tests and github automation passes (a maintainer will kick off the github automation when the rest of the PR is populated)

Tested

hugging face only tests on large CUDA system
vllm only on CUDA
all tests (except ollama) on CUDA
pytest locally on macOS (32GB, so some tests skipped)
NOT tested on >32GB macOS, or small gpu Linux

Note that if any large tests are run on cuda, pytest will enforce additional isolation - effectively batching up the test into groups.

github-actions · 2026-02-05T15:29:29Z

The PR description has been updated. Please fill out the template for your PR to be reviewed.

mergify · 2026-02-05T15:29:54Z

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert|release)(?:\(.+\))?:

- Add pytest skip mechanism with capability detection and CLI options - Implement process isolation for GPU-intensive vLLM tests - Enhance test configuration with safe option registration - Fix vLLM structured output token limits and update documentation

jakelorocco

The changes make sense to me. Can you please provide example outputs of the example and regular tests? I'd like to see what the vllm / heavy_gpu tests running in isolation looks like.

psschwei · 2026-02-06T13:41:48Z

test/backends/test_vllm.py

    )
-    return backend
+    yield backend
+    # Cleanup: shutdown vLLM engine and release GPU memory


Would it make sense to pull this code into a function that could be used both here and in test_vllm_tools.py rather than repeating it in both places?

psschwei · 2026-02-06T13:44:04Z

test/backends/test_vllm.py

 @pytest.fixture(scope="module")
 def backend():
    """Shared vllm backend for all tests in this module."""
+    # Import cleanup dependencies at top to avoid scoping issues


what are the scope issues?

planetf1 force-pushed the test/vllm-clean branch 5 times, most recently from 5a0cdf5 to d25e4d7 Compare February 5, 2026 16:57

planetf1 marked this pull request as ready for review February 5, 2026 16:58

planetf1 force-pushed the test/vllm-clean branch from d25e4d7 to cd5a632 Compare February 5, 2026 16:59

planetf1 mentioned this pull request Feb 5, 2026

feat: migrate from Granite 3 to Granite 4 hybrid models #357

Open

8 tasks

planetf1 enabled auto-merge (squash) February 5, 2026 18:03

jakelorocco reviewed Feb 6, 2026

View reviewed changes

psschwei reviewed Feb 6, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: improve pytest infrastructure and vLLM backend testing#416

test: improve pytest infrastructure and vLLM backend testing#416
planetf1 wants to merge 1 commit intogenerative-computing:mainfrom
planetf1:test/vllm-clean

planetf1 commented Feb 5, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Feb 5, 2026

Uh oh!

mergify bot commented Feb 5, 2026

Uh oh!

jakelorocco left a comment

Uh oh!

psschwei Feb 6, 2026

Uh oh!

psschwei Feb 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

planetf1 commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Improve pytest infrastructure and vLLM backend testing

Type of PR

Description

Testing

Uh oh!

github-actions bot commented Feb 5, 2026

Uh oh!

mergify bot commented Feb 5, 2026

Merge Protections

🟢 Enforce conventional commit

Uh oh!

jakelorocco left a comment

Choose a reason for hiding this comment

Uh oh!

psschwei Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

psschwei Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

planetf1 commented Feb 5, 2026 •

edited

Loading