feat(e2e): Playwright E2E for all 6 Mothership question types + CI pipeline (#423) by OgnjenGligoric · Pull Request #444 · andresharpe/dotbot

OgnjenGligoric · 2026-05-22T12:50:15Z

Summary

Closes #423.

This PR adds end-to-end Playwright coverage for all six Mothership question delivery types and wires the test suite into CI so it runs automatically on every PR.

What was done

New question type coverage (Layer 5 E2E)

Previously only singleChoice, approval, and documentReview were covered. Added the remaining three:

multiChoice — radio selection (same UI as singleChoice; type distinction handled server-side)
freeText — textarea fill + submit
priorityRanking — drag-and-drop list; JS serialises rankedItemsJson on submit; tests use default order (no drag simulation needed in headless)

Server: TestModeEndpoints.cs

Extended TestResponseRequest with an optional List<RankedItem>? RankedItems field so the /api/test/responses inject endpoint can persist priority ranking payloads directly, matching what the real respond form produces. Validation updated to accept rankedItems as a valid payload alongside selectedKey, freeText, and attachments.

Test seeder: Test-E2E-Mothership-QA.ps1

Added fixture definitions for multiChoice, freeText, and priorityRanking
Added -Headed switch for local debugging (opens real Chromium window)
Corrected layer label from Layer 4 → Layer 5 throughout

Playwright spec: mothership-question-flow.spec.ts

Extended Scenario interface with freeText? and rankedItems? submit fields
Render test checks textarea[name="freeText"] and .rank-item for new types
Submit test fills textarea / no-ops for ranking
Inject test routes payload by type: freeText → freeText field, priorityRanking → rankedItems, others → selectedKey
Verify assertions check type-appropriate response fields

CI: test.yml

Added test-mothership job (Layer 5, Ubuntu) that runs on every PR without manual trigger:

Builds the .NET server (Release)
Installs Playwright + Chromium
Starts Azurite, seeds answers and conversation-references containers
Starts DotbotServer with DOTBOT_TEST_MODE=true and BlobStorage__Backend=Local
Health-polls until server is up (fails the job if it never starts)
Runs Test-E2E-Mothership-QA.ps1 — seeds all 6 question types, mints magic-link JWTs, and runs 3 Playwright assertions per type (18 total)
Uploads Playwright HTML report as an artifact on failure

Docs: server/docs/MOTHERSHIP-E2E-SETUP.md

Updated to document all 6 question types and the rankedItems inject support. Running locally:

# Terminal 1
azurite --skipApiVersionCheck --location C:\azurite

# Terminal 2
az storage container create --name answers --connection-string "UseDevelopmentStorage=true"
az storage container create --name conversation-references --connection-string "UseDevelopmentStorage=true"
cd server/src/Dotbot.Server && dotnet run --launch-profile http-test

# Terminal 3 (from repo root)
$env:DOTBOT_SERVER_URL = "http://localhost:5048"
$env:DOTBOT_API_KEY    = "******"
pwsh tests/Test-E2E-Mothership-QA.ps1 -Headed   # omit -Headed for CI/headless

Add a new GitHub Actions job (test-mothership) to run Layer 5 Mothership E2E tests with Playwright: sets up .NET/Node, caches Playwright browsers, installs Azurite, seeds blob containers, starts DotbotServer in test mode and runs the Playwright suite, uploading reports on failure. Extend server test endpoints to accept and persist RankedItems (validation, response model, and injection support). Update the local E2E PS runner (tests/Test-E2E-Mothership-QA.ps1) to mark Layer 5, add a --Headed flag, and seed additional scenarios (multiChoice, freeText, priorityRanking). Update the Playwright spec to handle multiChoice, freeText and priorityRanking flows, inject rankedItems, and verify persisted responses. Update docs to list all six question types covered by the tests.

…ARIOS not set Prevents the spec from throwing at load time in the standard Layer 5 UI regression run, which does not seed the scenario manifest. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Set PLAYWRIGHT_HEADED in the PowerShell test runner instead of passing --headed, and clean it up after the run. The Playwright config now reads process.env.PLAYWRIGHT_HEADED to determine headless mode (headless = !PLAYWRIGHT_HEADED). This centralizes headed control via an environment variable and removes the previous --headed CLI argument handling.

Modify the GitHub Actions test workflow to redirect the Dotbot server stdout/stderr to /tmp/dotbot-server.log and run it in the background, saving its PID to /tmp/dotbot-server.pid. On the health-check timeout path, print the server startup log to aid debugging of CI failures. This adds visibility into server startup issues when E2E tests fail to connect.

OgnjenGligoric · 2026-05-22T13:38:07Z

1. CI secrets in workflow

The test-mothership job needs two values to start the server in CI:

BlobStorage__ConnectionString: 'UseDevelopmentStorage=true' (Azurite local emulator — not a real secret)
Auth__JwtSigningKey: 'ci-test-signing-key-32-chars-min!!' (throwaway JWT key used only for test magic-link tokens)

These are committed in plain text in .github/workflows/test.yml. Since this is a test-only context with no access to real systems, it may be acceptable — but worth confirming:

Question: Should these be moved to GitHub Actions repository secrets (${{ secrets.DOTBOT_CI_JWT_KEY }} etc.), or is hardcoding throwaway CI-only values acceptable for this repo?

2. Why this was discovered — server startup error log

The server was silently crashing during CI. Added log capture to the health-check step to surface it. The actual error was:

[FTL] Application terminated unexpectedly
System.InvalidOperationException: Either BlobStorage:AccountUri or BlobStorage:ConnectionString must be configured

Root cause: --no-launch-profile wasn't set, so launchSettings.json was loaded but appsettings.Development.json was NOT read (env vars from the env: block weren't passed through correctly). Fixed by adding --no-launch-profile and supplying the required config explicitly via env vars.

Question: Is it acceptable to keep the server startup log capture (cat /tmp/dotbot-server.log) in the health-check step permanently, or should it be a separate debug step gated on failure only?

IBondarenko-iwg

@OgnjenGligoric Two issues worth fixing.

1. CI failure — missing Azurite connection string

Server crashes immediately with:

System.InvalidOperationException: Either BlobStorage:AccountUri or BlobStorage:ConnectionString must be configured

BlobStorage__Backend=Local is set but Program.cs:100 still validates that AccountUri or ConnectionString exists regardless of backend.

Fix: Add to the Start DotbotServer env block in test.yml:

BlobStorage__ConnectionString: 'UseDevelopmentStorage=true'
This is the standard Azurite connection string — validation passes naturally, no server code changes needed.

2. Layer inconsistency

Test-E2E-Mothership-QA.ps1 header says Layer 5. test.yml job name says Layer 5. But Run-Tests.ps1 puts it in the Layer 4 block alongside the real Claude E2E tests. This means Run-Tests.ps1 -Layer 5 does not run the Mothership tests, and -Layer 4 runs them only if ANTHROPIC_API_KEY is present (scheduled/manual CI only).

Since this test needs Playwright + Azurite but not Claude API credentials, it belongs in Layer 5. Move $mothershipExit from the if (4 -in $layersToRun) block to a if (5 -in $layersToRun) block in Run-Tests.ps1.

Copilot

Pull request overview

Adds a new Layer 5 end-to-end test suite for the Mothership “respond via magic link” web flow using Playwright, expands server test-mode support to inject priorityRanking responses, and wires the E2E suite into CI so it runs automatically on PRs.

Changes:

Added a PowerShell seeder/runner that creates templates + instances for all 6 Mothership question types and runs Playwright assertions.
Extended /api/test/responses (test mode) to accept and persist rankedItems payloads.
Added a dedicated GitHub Actions job to spin up Azurite + DotbotServer and run the Playwright suite; added local setup docs.

Reviewed changes

Copilot reviewed 7 out of 8 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
`tests/Test-E2E-Mothership-QA.ps1`	New seeding + Playwright runner script for the Mothership respond flow across all question types.
`tests/Run-Tests.ps1`	Hooks the new Mothership E2E script into the layered test runner.
`tests/e2e/specs/mothership-question-flow.spec.ts`	New Playwright spec validating render/submit/storage for each seeded scenario.
`tests/e2e/playwright.config.ts`	Adds `PLAYWRIGHT_HEADED`-controlled headless toggle for Chromium runs.
`server/src/Dotbot.Server/TestModeEndpoints.cs`	Extends response injection validation/model to support `rankedItems`.
`server/docs/MOTHERSHIP-E2E-SETUP.md`	New/updated local setup documentation for running the Playwright + Azurite E2E suite.
`.gitignore`	Ignores Azurite local artifacts.
`.github/workflows/test.yml`	Adds `test-mothership` CI job to run the new Mothership Playwright E2E suite on PRs.

Comments suppressed due to low confidence (2)

tests/Test-E2E-Mothership-QA.ps1:244

For question types without Options (e.g., freeText), $Qt.Options is $null, so $options becomes $null and options = @($options) serializes as [null]. The server validator rejects this (options[0] must not be null) because QuestionTemplate.Options is required. Build options as an empty array when there are no options (or conditionally omit the ForEach and set options = @()).

    $options = $Qt.Options | ForEach-Object {
        @{
            optionId    = [guid]::NewGuid().ToString()
            key         = $_.key
            title       = $_.label
        }
    }

    $body = @{
        questionId         = [guid]::NewGuid().ToString()
        version            = 1
        type               = $Qt.Type
        title              = $Qt.Title
        context            = "Playwright E2E test fixture"
        deliverableSummary = if ($Qt.ContainsKey('DeliverableSummary')) { $Qt.DeliverableSummary } else { $null }
        options            = @($options)
        project            = @{

tests/Test-E2E-Mothership-QA.ps1:272

New-Instance hard-codes channel = "email", but DeliveryChannels:Email:Enabled is false by default (and the CI job doesn’t enable it), so /api/instances will return 400 “Delivery channel 'email' is not enabled…”. Use the default channel (omit channel) or switch to teams, or explicitly enable email in the CI server env if email is intended.

    # Use 'email' channel — delivery will fail (no SMTP) but the instance record
    # is persisted before delivery attempts, so /respond can still render it.
    $body = @{
        projectId       = $projectId
        questionId      = $QuestionId
        questionVersion = $Version
        channel         = "email"
        recipients      = @{ emails = @($testRecipient) }
    }

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Move the Layer 5 Mothership Playwright job out of .github/workflows/test.yml into .github/workflows/server.yml and remove the duplicate job. The new job provisions .NET/Node/PowerShell, installs Playwright (with browser cache and deps), seeds Azurite, starts the Dotbot server, waits for health, runs the Mothership E2E PowerShell runner, and uploads Playwright artifacts on failure. Also update docs and test scripts: clarify how to set BlobStorage connection string and how to run headed Playwright locally in server/docs/MOTHERSHIP-E2E-SETUP.md; adjust tests/Run-Tests.ps1 to move the Mothership test from layer 4 into layer 5 and update layer result logic; and improve tests/Test-E2E-Mothership-QA.ps1 to generate optionIds for templates, return OptionIds with the template, build rankedItems from those OptionIds for priorityRanking submits, and change the default test channel to 'teams' (so instances persist reliably).

Include tests/e2e/** and tests/Test-E2E-Mothership-QA.ps1 in the server workflow triggers for push and pull_request so CI runs when E2E test files change. Update .gitignore to ignore Azurite local storage DB files created during local E2E testing and add explanatory comments.

Add ASPNETCORE_ENVIRONMENT='Development' to the Start DotbotServer job in .github/workflows/server.yml so the app runs with the Development configuration during CI. This ensures development-specific settings are used when running the server in tests.

Copilot

Pull request overview

Copilot reviewed 7 out of 8 changed files in this pull request and generated 5 comments.

Rename and relocate the Playwright E2E suite from tests/e2e to tests/e2e-server. Add package.json, package-lock.json, playwright.config.ts, tsconfig.json, and .gitignore for the new suite, and move the mothership question flow spec. Update the spec to use DOTBOT_SERVER_URL (with a localhost fallback). Adjust GitHub Actions workflow to watch the new path, update npm cache dependency path, Playwright cache key, and artifact paths. Also update the PowerShell test runner to point at the new e2e-server directory.

Update .github/workflows/server.yml to use the tests/e2e-server directory for Playwright-related steps. Changed working-directory for: Install Playwright npm dependencies, Install Playwright Chromium + system deps, and Install Playwright system deps only (cached browser). No behavioral changes beyond using the new test folder path.

OgnjenGligoric added 3 commits May 21, 2026 14:09

initial testing setup

5283daa

multiple choice questions tested.

2390535

github-project-automation Bot added this to Dotbot v4 Roadmap May 22, 2026

github-project-automation Bot moved this to Inbox in Dotbot v4 Roadmap May 22, 2026

OgnjenGligoric and others added 3 commits May 22, 2026 15:11

fix(e2e): skip mothership spec gracefully when DOTBOT_MOTHERSHIP_SCEN…

aab71b3

…ARIOS not set Prevents the spec from throwing at load time in the standard Layer 5 UI regression run, which does not seed the scenario manifest. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

IBondarenko-iwg reviewed May 22, 2026

View reviewed changes

DKuleshov reviewed May 25, 2026

View reviewed changes

Comment thread tests/Run-Tests.ps1 Outdated

Comment thread .gitignore

Comment thread .github/workflows/test.yml Outdated

DKuleshov requested a review from Copilot May 25, 2026 08:57

Copilot started reviewing on behalf of DKuleshov May 25, 2026 08:57 View session

Copilot AI reviewed May 25, 2026

View reviewed changes

Comment thread tests/Test-E2E-Mothership-QA.ps1 Outdated

Comment thread tests/Run-Tests.ps1 Outdated

Comment thread .github/workflows/test.yml Outdated

Comment thread server/docs/MOTHERSHIP-E2E-SETUP.md

Comment thread server/docs/MOTHERSHIP-E2E-SETUP.md Outdated

OgnjenGligoric added 3 commits May 25, 2026 12:42

OgnjenGligoric marked this pull request as ready for review May 25, 2026 11:07

OgnjenGligoric requested a review from andresharpe as a code owner May 25, 2026 11:07

DKuleshov requested a review from Copilot May 26, 2026 08:01

Copilot started reviewing on behalf of DKuleshov May 26, 2026 08:01 View session

Copilot AI reviewed May 26, 2026

View reviewed changes

Comment thread tests/e2e-server/specs/mothership-question-flow.spec.ts

Comment thread server/src/Dotbot.Server/TestModeEndpoints.cs

Comment thread tests/Test-E2E-Mothership-QA.ps1 Outdated

Comment thread server/docs/MOTHERSHIP-E2E-SETUP.md

Comment thread tests/Test-E2E-Mothership-QA.ps1 Outdated

DKuleshov reviewed May 26, 2026

View reviewed changes

Comment thread .github/workflows/server.yml Outdated

OgnjenGligoric added 2 commits May 27, 2026 12:50

andresharpe approved these changes May 28, 2026

View reviewed changes

andresharpe merged commit 551cce3 into andresharpe:main May 28, 2026
10 checks passed

github-project-automation Bot moved this from Inbox to Done in Dotbot v4 Roadmap May 28, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(e2e): Playwright E2E for all 6 Mothership question types + CI pipeline (#423)#444

feat(e2e): Playwright E2E for all 6 Mothership question types + CI pipeline (#423)#444
andresharpe merged 11 commits into
andresharpe:mainfrom
OgnjenGligoric:feature/423-e2e-playwright-mothership

OgnjenGligoric commented May 22, 2026 •

edited by DKuleshov

Loading

Uh oh!

OgnjenGligoric commented May 22, 2026 •

edited

Loading

Uh oh!

IBondarenko-iwg left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

OgnjenGligoric commented May 22, 2026 • edited by DKuleshov Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What was done

Uh oh!

OgnjenGligoric commented May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

1. CI secrets in workflow

2. Why this was discovered — server startup error log

Uh oh!

IBondarenko-iwg left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

1. CI failure — missing Azurite connection string

2. Layer inconsistency

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

OgnjenGligoric commented May 22, 2026 •

edited by DKuleshov

Loading

OgnjenGligoric commented May 22, 2026 •

edited

Loading

IBondarenko-iwg left a comment •

edited

Loading