Skip to content

ci: add GPU test job using self-hosted runners#77

Open
camille-004 wants to merge 3 commits intomainfrom
cd/add-gpu-test-ci
Open

ci: add GPU test job using self-hosted runners#77
camille-004 wants to merge 3 commits intomainfrom
cd/add-gpu-test-ci

Conversation

@camille-004
Copy link
Collaborator

Add a test matrix job that runs on self-hosted GPU runners. Tests run inside Docker containers with --gpus all using the pre-built test images from GHCR. Also update all image tags to 2026-03-18 builds which include tileiras 13.2 (adds sm_86 support).

Description

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

Add a test matrix job that runs on self-hosted GPU runners (AWS EC2
Ampere instances). Tests run inside Docker containers with --gpus all
using the pre-built test images from GHCR. Also update all image tags
to 2026-03-18 builds which include tileiras 13.2 (adds sm_86 support).
Docker containers run as root, so files created during tests (e.g.
.pytest_cache) are root-owned. Subsequent jobs on the same runner
fail when actions/checkout tries to clean the workspace. Fix by
restoring ownership after each test run.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant