Draft: CI/CD pipeline to test reference implementations by anandhu-eng · Pull Request #2585 · mlcommons/inference

anandhu-eng · 2026-05-05T15:34:54Z

This PR introduces a CI/CD pipeline to test reference implementations. It triggers on PRs raised against the inference repository. The goal is to automatically detect which benchmark is relevant based on the files changed, run it on our self-hosted MLC2 GPU server, and post the results back as a PR comment with an initial approval step.

When a PR is opened:

Approval gate: A maintainer reviews and approves the benchmark run via GitHub's environment protection UI.
GitHub App (hosted on MLC2): On approval, our GitHub App receives a workflow_run event and determines which repo the PR came from.
Routing logic:

Inference repo: Automatically detects which folder was modified (e.g. vision/classification_and_detection/yolo/) and dispatches the corresponding benchmark workflow. No extra input from the maintainer needed.
Endpoints repo: Routes based on PR labels (benchmark:llama3, benchmark:all, etc.) added by the maintainer before approval.

Benchmark runs on MLC2 via a self-hosted GitHub Actions runner in a private repo, keeping all infrastructure and credentials completely isolated from the public repo.
Results posted back as a comment on the originating PR.

TBD in inference repo to change the draft state of the PR

GitHub App to be installed on mlcommons/inference
GitHub Environment self-hosted-runner-benchmark-approval to be configured with required reviewers
GitHub repo mapping updated for production repo names

Note:

Currently the worfklow supports yolo benchmark only. More to be added in time

github-actions · 2026-05-05T15:35:11Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

Draft: CI/CD pipeline to test reference implementations

ab7e209

anandhu-eng requested a review from a team as a code owner May 5, 2026 15:34

anandhu-eng marked this pull request as draft May 5, 2026 15:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Draft: CI/CD pipeline to test reference implementations#2585

Draft: CI/CD pipeline to test reference implementations#2585
anandhu-eng wants to merge 1 commit into
masterfrom
anandhu-eng-patch-21

anandhu-eng commented May 5, 2026

Uh oh!

github-actions Bot commented May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

anandhu-eng commented May 5, 2026

Uh oh!

github-actions Bot commented May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant