test(e2e): migrate test-skill-agent-e2e.sh to vitest by jyaunches · Pull Request #5222 · NVIDIA/NemoClaw

jyaunches · 2026-06-11T13:06:30Z

Summary

Migrate test/e2e/test-skill-agent-e2e.sh with the simplest equivalent Vitest coverage.

Related Issues

Refs #5098
Refs #2644

Contract mapping

Legacy assertion: Docker and NVIDIA API key prerequisites are required before live skill-agent validation.
- Replacement: test/e2e-scenario/live/skill-agent.test.ts checks docker info and NVIDIA_API_KEY shape.
- Boundary preserved: real host Docker probe and secret-gated live test.
Legacy assertion: the target OpenClaw sandbox can be created/recreated before skill-agent validation.
- Replacement: skill-agent.test.ts runs node bin/nemoclaw.js onboard --fresh --non-interactive --yes --yes-i-accept-third-party-software with NEMOCLAW_SANDBOX_NAME=e2e-skill-agent.
- Boundary preserved: real repo CLI/onboard/OpenShell/Docker sandbox creation.
- Scope note: this PR does not replace the legacy script's installer/PATH-discovery coverage (install.sh, shell profile/NVM/local-bin sourcing, installed nemoclaw lookup); this conversion preserves the skill-agent/onboard boundary and leaves installer fidelity to existing installer coverage / separate migrations.
Legacy assertion: skill-smoke-fixture is injected and queryable in sandbox skill roots.
- Replacement: skill-agent.test.ts invokes existing add-sandbox-skill.sh and verifies SKILL_SMOKE_VERIFY_K9X2 via openshell sandbox exec.
- Boundary preserved: real helper shell, sandbox SSH/exec, and filesystem checks.
Legacy assertion: the real OpenClaw agent reads SKILL.md and returns SKILL_SMOKE_VERIFY_K9X2, with retry/fuzzy token matching and external provider/transport timeout handling.
- Replacement: skill-agent.test.ts invokes existing verify-sandbox-skill-via-agent.sh, retries, fuzzy-checks the delimited agent output section, preserves helper fail-closed markers before fuzzy token success, and only skips timeout/rate-limit external flakes after fixture presence is proven; OpenClaw tool/runtime errors fail closed.
- Boundary preserved: real OpenClaw agent turn, provider call, sandbox session lock cleanup, and redacted artifacts.

Simplicity check

Test shape: simple live Vitest test.
Original runner/lane: .github/workflows/nightly-e2e.yaml job skill-agent-e2e, ubuntu-latest, Docker/OpenShell, NVIDIA_API_KEY, 30 minute timeout.
Replacement runner: .github/workflows/e2e-vitest-scenarios.yaml job skill-agent-vitest, same ubuntu-latest + Docker/OpenShell + NVIDIA_API_KEY runner class, 30 minute timeout.
New shared helpers: none; one-off parsing, cleanup, retry, fixture-presence, and local classifier checks stay local to the test.
New framework/registry/ledger: none.
Workflow changes: adds a dedicated skill-agent-vitest job, validates it in the workflow boundary test, and adds allowlisted free-standing jobs= dispatch support in e2e-vitest-scenarios.yaml.
Selective dispatch: gh workflow run e2e-vitest-scenarios.yaml --repo NVIDIA/NemoClaw --ref e2e-migrate/test-skill-agent-e2e -f jobs=skill-agent-vitest -f pr_number=5222.
Legacy shell deletion/workflow retirement: deferred to Epic: Migrate legacy bash E2E into the Vitest E2E system #5098 Phase 11.

Verification

npm run build:cli
npx tsc --noEmit --allowImportingTsExtensions --module NodeNext --moduleResolution NodeNext --target ES2022 --types vitest,node test/e2e-scenario/live/skill-agent.test.ts test/e2e-scenario/support-tests/e2e-scenarios-workflow.test.ts tools/e2e-scenarios/workflow-boundary.mts
NEMOCLAW_RUN_E2E_SCENARIOS=1 npx vitest run --project e2e-scenarios-live test/e2e-scenario/live/skill-agent.test.ts --silent=false --reporter=default (local: classifier tests pass; live test skips without NVIDIA_API_KEY)
npx vitest run --project e2e-vitest-support test/e2e-scenario/support-tests/e2e-scenarios-workflow.test.ts --silent=false --reporter=default
npx @biomejs/biome check test/e2e-scenario/live/skill-agent.test.ts test/e2e-scenario/support-tests/e2e-scenarios-workflow.test.ts
git diff --check
PR: test(e2e): migrate test-skill-agent-e2e.sh to vitest #5222
Same-runner selective run: https://github.com/NVIDIA/NemoClaw/actions/runs/27354513166 — skill-agent-vitest passed.
Same-runner selective run for fail-closed classifier update: https://github.com/NVIDIA/NemoClaw/actions/runs/27355475444 — skill-agent-vitest passed.
Latest same-runner selective run for workflow guard/cleanup hardening: https://github.com/NVIDIA/NemoClaw/actions/runs/27356507429 — pending.

Summary by CodeRabbit

New Features
- Added a gated live end-to-end skill-agent verification scenario.
- Added a live gateway guard-chain recovery regression scenario.
Tests
- Expanded e2e fixtures and helpers for sandbox/gateway disruption and recovery.
- Added broad unit and e2e tests for guard-chain, PID stability, log checks, and verification-token parsing.
Chores
- GitHub Actions: new standalone jobs, per-job dispatch/reporting, and workflow validation updates.
- Clarified test config to exclude live scenarios from local CLI runs.

coderabbitai · 2026-06-11T13:06:44Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Walkthrough

Adds gated live Vitest scenarios (skill-agent, gateway-guard-recovery), extends Sandbox/Gateway fixture clients with in-sandbox probe/disruption helpers, adds unit tests and ScriptedRunner, updates workflow to accept a jobs selector with new free-standing jobs, and tightens workflow-boundary validation and artifact rules.

Changes

E2E Tests & Fixtures

Layer / File(s)	Summary
Gateway client & probe env `test/e2e-scenario/fixtures/clients/gateway.ts`	Adds sandbox probe env, guard-chain markers, log path, option interfaces, and extends GatewayClient to accept a SandboxClient and probe in-sandbox state.
Sandbox disruption helpers `test/e2e-scenario/fixtures/clients/sandbox.ts`	Adds `openshellProbeEnv()`, changes `sandbox status` to `--name`, and adds `wipeGuardChain` and `killGatewayTree`.
Fixture wiring `test/e2e-scenario/fixtures/e2e-test.ts`	Adds `sandbox` fixture and wires `gateway` to `new GatewayClient(host, sandbox)`.
Recovery helper unit tests `test/e2e-scenario/support-tests/e2e-recovery-helpers.test.ts`	Adds `ScriptedRunner` test double and comprehensive tests for GatewayClient guard/log/PID helpers and SandboxClient disruption helpers.
Gateway recovery live test `test/e2e-scenario/live/gateway-guard-recovery.test.ts`, `test/e2e/test-issue-2478-crash-loop-recovery.sh`	New live test that simulates `/tmp` wipe and gateway kill, triggers probe-only recovery, reasserts guard chain and PID stability, and adds post-recovery contract assertion in script.
Skill-agent live scenario `test/e2e-scenario/live/skill-agent.test.ts`	New live scenario with helpers for output classification, SKILL.md probe, gating/prereqs, onboarding, skill injection, retryed verification loop with token parsing and flake-skip logic, cleanup, and scenario result reporting.
Support-test adjustments & workflow template tests `test/e2e-scenario/support-tests/*`	Updates tests to reflect new client wiring and CLI arg changes; updates workflow-template expectations for the new job and selector usage.

Workflows, Validators & CI

Layer / File(s)	Summary
Dispatch input, concurrency, matrix passthrough `.github/workflows/e2e-vitest-scenarios.yaml`, `tools/e2e-scenarios/workflow-boundary.mts`	Adds `workflow_dispatch.inputs.jobs`, includes it in concurrency key, passes `JOBS` to matrix generation, validates requested jobs, and gates `live-scenarios` to run only when `inputs.jobs` is empty.
skill-agent Vitest workflow job `.github/workflows/e2e-vitest-scenarios.yaml`	Adds `skill-agent-vitest` free-standing job with Docker login retry logic, Node setup, CLI build, live Vitest execution of the skill-agent test, and artifact uploads.
gateway-guard-recovery workflow job `.github/workflows/e2e-vitest-scenarios.yaml`	Adds `gateway-guard-recovery` free-standing job with matrix gating, environment setup, OpenShell install, test run, and artifact upload.
Workflow boundary validator updates `tools/e2e-scenarios/workflow-boundary.mts`, `test/e2e-scenario/support-tests/e2e-scenarios-workflow.test.ts`	Adds `expectedFreeStandingJobSelector` and `validateSkillAgentVitestJob`, enforces job.if selector, required env/steps, artifact naming/retention, and matrix inclusion for standalone jobs; updates tests accordingly.
PR limit and vitest config `.github/workflows/pr-limit.yaml`, `vitest.config.ts`	Expands PR-limit EXEMPT list by two usernames and documents live-test exclusion in Vitest CLI project comments.

Sequence Diagram(s)

sequenceDiagram
  participant GitHubActions
  participant Job as skill-agent-vitest
  participant Docker
  participant Nemoclaw as NemoclawOnboard
  participant Sandbox
  participant SkillInjector
  participant Agent
  GitHubActions->>Job: triggered via inputs.jobs selector
  Job->>Docker: attempt docker login (up to 3 tries) / anonymous fallback
  Job->>Nemoclaw: run onboarding with injected env
  Nemoclaw->>Sandbox: create/recreate sandbox
  Job->>SkillInjector: run add-skill helper to inject fixture
  SkillInjector->>Sandbox: place SKILL.md fixture
  Job->>Agent: run verification script (retry loop)
  Agent->>Sandbox: read SKILL.md and verify
  Agent-->>Job: exit code + delimited agent output
  Job->>Job: evaluate output, detect flakes, write scenario-result.json

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related issues

Epic: Migrate legacy bash E2E into the Vitest E2E system #5098: Migration and consolidation of shell E2E into TypeScript/Vitest and fixture/workflow wiring overlaps with this PR's changes.

Possibly related PRs

NVIDIA/NemoClaw#5243: Overlaps on adding workflow_dispatch.inputs.jobs selector and related workflow-boundary updates.
test(e2e): failing-test-first guard for #2701 + recovery framework helpers #5049: Related additions touching gateway recovery helpers and free-standing workflow jobs.
test(e2e): migrate test-openshell-version-pin.sh to free-standing Vitest live test #5107: Related gating/selector adjustments for free-standing live test jobs.

Suggested labels

area: e2e, chore

Suggested reviewers

cv
prekshivyas

Poem

🐰 In sandbox fields the SKILL.md sings,
The agent hops and fetches golden things.
Retries and probes in jittered, patient song,
The token found — the test reports "passed" along.

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 8.33% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title directly and accurately describes the main change: migrating a shell-based e2e test (test-skill-agent-e2e.sh) to a Vitest test suite.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch e2e-migrate/test-skill-agent-e2e

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

github-actions · 2026-06-11T13:08:12Z

E2E Advisor Recommendation

Required E2E: skill-agent-vitest
Optional E2E: network-policy-vitest, onboard-negative-paths-vitest

Dispatch hint: skill-agent-vitest

Workflow run

Full advisor summary

E2E Recommendation Advisor

Base: origin/main
Head: HEAD
Confidence: high

Required E2E

skill-agent-vitest (high; live sandbox plus external NVIDIA provider/agent turn): Directly validates the newly added workflow job and live test path for skill-agent: repo CLI onboarding, OpenShell installation, sandbox lifecycle, NVIDIA_API_KEY handling, skill injection, real OpenClaw agent verification, and artifact upload behavior.

Optional E2E

network-policy-vitest (high; live sandbox/policy validation): Optional adjacent confidence for secret-bearing live workflow safety and sandbox network/security boundaries, but this PR does not modify network policy runtime behavior directly.
onboard-negative-paths-vitest (medium): Optional quick-ish confidence for CLI onboarding failure-path behavior because the new skill-agent lane relies on onboarding, but the PR does not change onboarding implementation.

New E2E recommendations

None.

Dispatch hint

Workflow: .github/workflows/e2e-vitest-scenarios.yaml
jobs input: skill-agent-vitest

github-actions · 2026-06-11T13:08:13Z

Vitest E2E Scenario Recommendation

Required Vitest E2E scenarios: skill-agent-vitest
Optional Vitest E2E scenarios: None

Dispatch required Vitest E2E scenarios:

gh workflow run e2e-vitest-scenarios.yaml --ref <pr-head-ref> --field jobs=skill-agent-vitest

Workflow run

Full Vitest E2E advisor summary

Vitest E2E Scenario Advisor

Base: origin/main
Head: HEAD
Confidence: high

Required Vitest E2E scenarios

skill-agent-vitest: Focused free-standing Vitest job wired for changed live test test/e2e-scenario/live/skill-agent.test.ts.
- Dispatch: gh workflow run e2e-vitest-scenarios.yaml --ref <pr-head-ref> --field jobs=skill-agent-vitest

Optional Vitest E2E scenarios

None.

Relevant changed files

.github/workflows/e2e-vitest-scenarios.yaml
test/e2e-scenario/live/skill-agent.test.ts
test/e2e-scenario/support-tests/e2e-scenarios-workflow.test.ts
tools/e2e-scenarios/workflow-boundary.mts

coderabbitai

🧹 Nitpick comments (1)

test/e2e-scenario/live/skill-agent.test.ts (1)

74-85: ⚡ Quick win

Consider refactoring the inline shell script for readability.

Line 78 contains a 500+ character inline shell script that is difficult to read and maintain. Consider extracting it to a multi-line template literal or a separate helper function.

♻️ Suggested refactor

 async function verifySkillFixturePresent(
   sandbox: SandboxClient,
   sandboxName: string,
 ): Promise<boolean> {
-  const script = `token=${shellQuote(VERIFY_PHRASE)}; skill=${shellQuote(SKILL_ID)}; found=0; for path in "/sandbox/.openclaw/skills/${SKILL_ID}/SKILL.md" "\${HOME:-/home/sandbox}/.openclaw/skills/${SKILL_ID}/SKILL.md" "/home/sandbox/.openclaw/skills/${SKILL_ID}/SKILL.md" "/home/openclaw/.openclaw/skills/${SKILL_ID}/SKILL.md"; do if [ -f "$path" ] && grep -Fq "$token" "$path"; then echo "SKILL_TOKEN_PATH=$path"; found=1; fi; done; test "$found" = 1`;
+  const script = `
+    token=${shellQuote(VERIFY_PHRASE)}
+    skill=${shellQuote(SKILL_ID)}
+    found=0
+    for path in \\
+      "/sandbox/.openclaw/skills/${SKILL_ID}/SKILL.md" \\
+      "\${HOME:-/home/sandbox}/.openclaw/skills/${SKILL_ID}/SKILL.md" \\
+      "/home/sandbox/.openclaw/skills/${SKILL_ID}/SKILL.md" \\
+      "/home/openclaw/.openclaw/skills/${SKILL_ID}/SKILL.md"
+    do
+      if [ -f "$path" ] && grep -Fq "$token" "$path"; then
+        echo "SKILL_TOKEN_PATH=$path"
+        found=1
+      fi
+    done
+    test "$found" = 1
+  `.trim();
   const result = await sandbox.execShell(sandboxName, trustedSandboxShellScript(script), {

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@test/e2e-scenario/live/skill-agent.test.ts` around lines 74 - 85, The inline
one-liner shell script assigned to the variable script in
verifySkillFixturePresent is hard to read and maintain; refactor by extracting
the shell logic into a clearly formatted multi-line template literal or a small
helper function (e.g., buildVerifySkillScript or verifySkillShell) and then pass
the returned script into trustedSandboxShellScript when calling
sandbox.execShell; keep the same token/tokenization (VERIFY_PHRASE, SKILL_ID)
and preserve env/timeout/artifactName behavior.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Nitpick comments:
In `@test/e2e-scenario/live/skill-agent.test.ts`:
- Around line 74-85: The inline one-liner shell script assigned to the variable
script in verifySkillFixturePresent is hard to read and maintain; refactor by
extracting the shell logic into a clearly formatted multi-line template literal
or a small helper function (e.g., buildVerifySkillScript or verifySkillShell)
and then pass the returned script into trustedSandboxShellScript when calling
sandbox.execShell; keep the same token/tokenization (VERIFY_PHRASE, SKILL_ID)
and preserve env/timeout/artifactName behavior.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: 1426949d-9f5d-4057-a085-174ac87fe5ca

📥 Commits

Reviewing files that changed from the base of the PR and between 6622476 and ba4bb37.

📒 Files selected for processing (1)

test/e2e-scenario/live/skill-agent.test.ts

github-actions · 2026-06-11T13:11:24Z

PR Review Advisor

Findings: 0 needs attention, 1 worth checking, 0 nice ideas
Top item: PR review advisor unavailable

Review findings

🛠️ Needs attention

None.

🔎 Worth checking

PR review advisor unavailable: The automated advisor could not complete: Could not parse JSON from PR review advisor output; see /home/runner/work/NemoClaw/NemoClaw/artifacts/pr-review-advisor/pr-review-advisor-raw-output.txt
- Recommendation: Re-run the PR Review Advisor or perform a manual review.
- Evidence: Could not parse JSON from PR review advisor output; see /home/runner/work/NemoClaw/NemoClaw/artifacts/pr-review-advisor/pr-review-advisor-raw-output.txt

🌱 Nice ideas

None.

Consider writing more tests for

**Runtime validation** — Add or identify targeted runtime/integration validation for the changed behavior; do not report external E2E job pass/fail here.. Runtime/sandbox/infrastructure paths need behavioral runtime validation: .github/workflows/e2e-vitest-scenarios.yaml, tools/e2e-scenarios/workflow-boundary.mts.

Workflow run details

This is an automated advisory review. A human maintainer must make the final merge decision.

github-actions · 2026-06-11T14:40:56Z

Vitest E2E Scenario Results — ✅ All jobs passed

Run: 27354513166
Workflow ref: e2e-migrate/test-skill-agent-e2e
Requested scenarios: (default — all supported)
Requested jobs: skill-agent-vitest
Summary: 2 passed, 0 failed, 5 skipped

Job	Result
gateway-guard-recovery	⏭️ skipped
generate-matrix	✅ success
live-scenarios	⏭️ skipped
onboard-negative-paths-vitest	⏭️ skipped
openclaw-tui-chat-correlation-vitest	⏭️ skipped
openshell-version-pin-vitest	⏭️ skipped
skill-agent-vitest	✅ success

github-actions · 2026-06-11T14:49:17Z

Vitest E2E Scenario Results — ✅ All jobs passed

Run: 27355031920
Workflow ref: e2e-migrate/test-skill-agent-e2e
Requested scenarios: (default — all supported)
Requested jobs: skill-agent-vitest
Summary: 2 passed, 0 failed, 5 skipped

Job	Result
gateway-guard-recovery	⏭️ skipped
generate-matrix	✅ success
live-scenarios	⏭️ skipped
onboard-negative-paths-vitest	⏭️ skipped
openclaw-tui-chat-correlation-vitest	⏭️ skipped
openshell-version-pin-vitest	⏭️ skipped
skill-agent-vitest	✅ success

…l-agent-e2e

github-actions · 2026-06-11T17:25:43Z

Vitest E2E Scenario Results — ❌ Some jobs failed

Run: 27364843413
Workflow ref: e2e-migrate/test-skill-agent-e2e
Requested scenarios: (default — all supported)
Requested jobs: (default — all free-standing when no scenarios are requested)
Summary: 5 passed, 3 failed, 0 skipped

Job	Result
gateway-guard-recovery	❌ failure
generate-matrix	✅ success
live-scenarios	❌ failure
onboard-negative-paths-vitest	✅ success
openclaw-tui-chat-correlation-vitest	❌ failure
openshell-version-pin-vitest	✅ success
skill-agent-vitest	✅ success
validate-jobs	✅ success

Failed jobs: gateway-guard-recovery, live-scenarios, openclaw-tui-chat-correlation-vitest. Check run artifacts for logs.

github-actions · 2026-06-11T17:36:57Z

Vitest E2E Scenario Results — ✅ All jobs passed

Run: 27365416785
Workflow ref: e2e-migrate/test-skill-agent-e2e
Requested scenarios: (default — all supported)
Requested jobs: skill-agent-vitest
Summary: 3 passed, 0 failed, 5 skipped

Job	Result
gateway-guard-recovery	⏭️ skipped
generate-matrix	✅ success
live-scenarios	⏭️ skipped
onboard-negative-paths-vitest	⏭️ skipped
openclaw-tui-chat-correlation-vitest	⏭️ skipped
openshell-version-pin-vitest	⏭️ skipped
skill-agent-vitest	✅ success
validate-jobs	✅ success

github-actions · 2026-06-11T17:42:29Z

Vitest E2E Scenario Results — ❌ Some jobs failed

Run: 27365712997
Workflow ref: e2e-migrate/test-skill-agent-e2e
Requested scenarios: (default — all supported)
Requested jobs: (default — all free-standing when no scenarios are requested)
Summary: 5 passed, 3 failed, 0 skipped

Job	Result
gateway-guard-recovery	❌ failure
generate-matrix	✅ success
live-scenarios	❌ failure
onboard-negative-paths-vitest	✅ success
openclaw-tui-chat-correlation-vitest	❌ failure
openshell-version-pin-vitest	✅ success
skill-agent-vitest	✅ success
validate-jobs	✅ success

Failed jobs: gateway-guard-recovery, live-scenarios, openclaw-tui-chat-correlation-vitest. Check run artifacts for logs.

…l-agent-e2e

github-actions · 2026-06-11T18:20:44Z

Vitest E2E Scenario Results — ✅ All jobs passed

Run: 27368166084
Workflow ref: e2e-migrate/test-skill-agent-e2e
Requested scenarios: (default — all supported)
Requested jobs: skill-agent-vitest
Summary: 3 passed, 0 failed, 5 skipped

Job	Result
gateway-guard-recovery	⏭️ skipped
generate-matrix	✅ success
live-scenarios	⏭️ skipped
onboard-negative-paths-vitest	⏭️ skipped
openclaw-tui-chat-correlation-vitest	⏭️ skipped
openshell-version-pin-vitest	⏭️ skipped
skill-agent-vitest	✅ success
validate-jobs	✅ success

github-actions · 2026-06-11T18:28:05Z

Vitest E2E Scenario Results — ❌ Some jobs failed

Run: 27368570241
Workflow ref: e2e-migrate/test-skill-agent-e2e
Requested scenarios: (default — all supported)
Requested jobs: (default — all free-standing when no scenarios are requested)
Summary: 5 passed, 3 failed, 0 skipped

Job	Result
gateway-guard-recovery	❌ failure
generate-matrix	✅ success
live-scenarios	❌ failure
onboard-negative-paths-vitest	✅ success
openclaw-tui-chat-correlation-vitest	❌ failure
openshell-version-pin-vitest	✅ success
skill-agent-vitest	✅ success
validate-jobs	✅ success

Failed jobs: gateway-guard-recovery, live-scenarios, openclaw-tui-chat-correlation-vitest. Check run artifacts for logs.

…l-agent-e2e

github-actions · 2026-06-11T18:48:38Z

Vitest E2E Scenario Results — ✅ All jobs passed

Run: 27369508373
Workflow ref: e2e-migrate/test-skill-agent-e2e
Requested scenarios: (default — all supported)
Requested jobs: skill-agent-vitest
Summary: 3 passed, 0 failed, 5 skipped

Job	Result
gateway-guard-recovery	⏭️ skipped
generate-matrix	✅ success
live-scenarios	⏭️ skipped
onboard-negative-paths-vitest	⏭️ skipped
openclaw-tui-chat-correlation-vitest	⏭️ skipped
openshell-version-pin-vitest	⏭️ skipped
skill-agent-vitest	✅ success
validate-jobs	✅ success

…l-agent-e2e

github-actions · 2026-06-11T19:02:34Z

Vitest E2E Scenario Results — ✅ All jobs passed

Run: 27370270382
Workflow ref: e2e-migrate/test-skill-agent-e2e
Requested scenarios: (default — all supported)
Requested jobs: skill-agent-vitest
Summary: 3 passed, 0 failed, 5 skipped

Job	Result
gateway-guard-recovery	⏭️ skipped
generate-matrix	✅ success
live-scenarios	⏭️ skipped
onboard-negative-paths-vitest	⏭️ skipped
openclaw-tui-chat-correlation-vitest	⏭️ skipped
openshell-version-pin-vitest	⏭️ skipped
skill-agent-vitest	✅ success
validate-jobs	✅ success

github-actions · 2026-06-11T19:20:56Z

Vitest E2E Scenario Results — ✅ All jobs passed

Run: 27371575011
Workflow ref: e2e-migrate/test-skill-agent-e2e
Requested scenarios: (default — all supported)
Requested jobs: skill-agent-vitest
Summary: 3 passed, 0 failed, 5 skipped

Job	Result
gateway-guard-recovery	⏭️ skipped
generate-matrix	✅ success
live-scenarios	⏭️ skipped
onboard-negative-paths-vitest	⏭️ skipped
openclaw-tui-chat-correlation-vitest	⏭️ skipped
openshell-version-pin-vitest	⏭️ skipped
skill-agent-vitest	✅ success
validate-jobs	✅ success

github-actions · 2026-06-11T19:31:30Z

Vitest E2E Scenario Results — ❌ Some jobs failed

Run: 27371918493
Workflow ref: e2e-migrate/test-skill-agent-e2e
Requested scenarios: (default — all supported)
Requested jobs: (default — all free-standing when no scenarios are requested)
Summary: 5 passed, 3 failed, 0 skipped

Job	Result
gateway-guard-recovery	❌ failure
generate-matrix	✅ success
live-scenarios	❌ failure
onboard-negative-paths-vitest	✅ success
openclaw-tui-chat-correlation-vitest	❌ failure
openshell-version-pin-vitest	✅ success
skill-agent-vitest	✅ success
validate-jobs	✅ success

Failed jobs: gateway-guard-recovery, live-scenarios, openclaw-tui-chat-correlation-vitest. Check run artifacts for logs.

…l-agent-e2e # Conflicts: # .github/workflows/e2e-vitest-scenarios.yaml # test/e2e-scenario/support-tests/e2e-scenarios-workflow.test.ts # tools/e2e-scenarios/workflow-boundary.mts

wscurran · 2026-06-12T14:24:53Z

✨
Related open issues:

test(e2e): add skill agent Vitest coverage

ba4bb37

coderabbitai Bot reviewed Jun 11, 2026

View reviewed changes

jyaunches changed the title ~~test(e2e): add skill-agent Vitest coverage~~ Migrate test-skill-agent-e2e.sh to vitest Jun 11, 2026

jyaunches changed the title ~~Migrate test-skill-agent-e2e.sh to vitest~~ test(e2e): migrate test-skill-agent-e2e.sh to vitest Jun 11, 2026

jyaunches added 2 commits June 11, 2026 09:21

test(e2e): address skill agent review feedback

3d90c7b

ci(e2e): wire skill agent Vitest job

5a3f792

jyaunches changed the title ~~test(e2e): migrate test-skill-agent-e2e.sh to vitest~~ test(e2e): migrate test-skill-agent-e2e.sh to Vitest Jun 11, 2026

jyaunches added 12 commits June 11, 2026 09:43

test(e2e): keep skill probe command single-line

b282f16

test(e2e): fix skill probe shell syntax

7d27cd7

test(e2e): skip skill agent on provider outage

5750bba

ci(e2e): allow selective Vitest jobs

50ceb08

test(e2e): add skill agent Vitest coverage

dc492b8

test(e2e): address skill agent review feedback

422abb0

ci(e2e): wire skill agent Vitest job

7488799

test(e2e): keep skill probe command single-line

ef80221

test(e2e): fix skill probe shell syntax

449c8d6

test(e2e): skip skill agent on provider outage

8ebd2c5

ci(e2e): allow selective Vitest jobs

d4550b7

Merge branch 'e2e-migrate/test-skill-agent-e2e'

379ee2b

ci(e2e): allow selective Vitest job dispatch

2095029

jyaunches changed the title ~~test(e2e): migrate test-skill-agent-e2e.sh to Vitest~~ test(e2e): migrate test-skill-agent-e2e.sh to vitest Jun 11, 2026

jyaunches changed the title ~~test(e2e): migrate test-skill-agent-e2e.sh to vitest~~ test(e2e): P2 independent 9 migrate test-skill-agent-e2e.sh to vitest Jun 11, 2026

test(e2e): preserve skill agent fail-closed checks

e7152aa

jyaunches changed the title ~~test(e2e): P2 independent 9 migrate test-skill-agent-e2e.sh to vitest~~ test(e2e): migrate test-skill-agent-e2e.sh to Vitest Jun 11, 2026

Merge remote-tracking branch 'origin/main' into e2e-migrate/test-skil…

c48831d

…l-agent-e2e

Merge remote-tracking branch 'origin/main' into e2e-migrate/test-skil…

e8f1691

…l-agent-e2e

Merge remote-tracking branch 'origin/main' into e2e-migrate/test-skil…

a8065bf

…l-agent-e2e

jyaunches added 2 commits June 11, 2026 14:50

Merge remote-tracking branch 'origin/main' into e2e-migrate/test-skil…

13c1c2b

…l-agent-e2e

Merge remote-tracking branch 'origin/main' into e2e-migrate/test-skil…

464bdcf

…l-agent-e2e

test(e2e): install openshell for skill-agent vitest

bdb6887

cv approved these changes Jun 11, 2026

View reviewed changes

cv added v0.0.65 Release target and removed v0.0.64 Release target labels Jun 12, 2026

Merge remote-tracking branch 'origin/main' into e2e-migrate/test-skil…

e51c162

…l-agent-e2e # Conflicts: # .github/workflows/e2e-vitest-scenarios.yaml # test/e2e-scenario/support-tests/e2e-scenarios-workflow.test.ts # tools/e2e-scenarios/workflow-boundary.mts

wscurran added area: e2e End-to-end tests, nightly failures, or validation infrastructure chore Build, CI, dependency, or tooling maintenance labels Jun 12, 2026

jyaunches merged commit d95fcd0 into main Jun 12, 2026
38 checks passed

jyaunches deleted the e2e-migrate/test-skill-agent-e2e branch June 12, 2026 14:34

This was referenced Jun 12, 2026

test(e2e): migrate test-shields-config.sh to vitest #5337

Merged

test(e2e): migrate test-sandbox-survival.sh to vitest #5332

Merged

test(e2e): migrate OpenClaw skill CLI scenario #5354

Merged

cv mentioned this pull request Jun 13, 2026

Epic: Migrate legacy bash E2E into the Vitest E2E system #5098

Open

79 tasks

This was referenced Jun 13, 2026

fix(e2e): route nightly hosted inference as custom #5385

Merged

test(e2e): migrate test-snapshot-commands.sh to vitest #5346

Open

Conversation

jyaunches commented Jun 11, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Related Issues

Contract mapping

Simplicity check

Verification

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Jun 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related issues

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

❌ Failed checks (1 warning)

Uh oh!

github-actions Bot commented Jun 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

E2E Advisor Recommendation

E2E Recommendation Advisor

Required E2E

Optional E2E

New E2E recommendations

Dispatch hint

Uh oh!

github-actions Bot commented Jun 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Vitest E2E Scenario Recommendation

Vitest E2E Scenario Advisor

Required Vitest E2E scenarios

Optional Vitest E2E scenarios

Relevant changed files

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Jun 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Review Advisor

🛠️ Needs attention

🔎 Worth checking

🌱 Nice ideas

Uh oh!

github-actions Bot commented Jun 11, 2026

Vitest E2E Scenario Results — ✅ All jobs passed

Uh oh!

github-actions Bot commented Jun 11, 2026

Vitest E2E Scenario Results — ✅ All jobs passed

Uh oh!

github-actions Bot commented Jun 11, 2026

Vitest E2E Scenario Results — ❌ Some jobs failed

Uh oh!

github-actions Bot commented Jun 11, 2026

Vitest E2E Scenario Results — ✅ All jobs passed

Uh oh!

github-actions Bot commented Jun 11, 2026

Vitest E2E Scenario Results — ❌ Some jobs failed

Uh oh!

github-actions Bot commented Jun 11, 2026

Vitest E2E Scenario Results — ✅ All jobs passed

Uh oh!

github-actions Bot commented Jun 11, 2026

Vitest E2E Scenario Results — ❌ Some jobs failed

Uh oh!

github-actions Bot commented Jun 11, 2026

Vitest E2E Scenario Results — ✅ All jobs passed

Uh oh!

github-actions Bot commented Jun 11, 2026

Vitest E2E Scenario Results — ✅ All jobs passed

Uh oh!

github-actions Bot commented Jun 11, 2026

Vitest E2E Scenario Results — ✅ All jobs passed

jyaunches commented Jun 11, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Jun 11, 2026 •

edited

Loading

github-actions Bot commented Jun 11, 2026 •

edited

Loading

github-actions Bot commented Jun 11, 2026 •

edited

Loading

github-actions Bot commented Jun 11, 2026 •

edited

Loading