title	Testing

Testing

Current: 724 tests | 32 suites | ~2s execution time

Running tests

npm test                              # All tests
npm test -- parser.test.ts            # Single file
npm test -- --testNamePattern="name"  # Pattern match
npm test -- --watch                   # Watch mode
npm test -- --coverage                # Coverage report
npm test -- --verbose                 # Detailed output

Before committing, always run:

npm test && npm run build

Test structure

Core Action tests (12 suites)

src/__tests__/
├── parser.test.ts               # MyST parsing, frontmatter extraction
├── parser-components.test.ts    # Document component parsing
├── diff-detector.test.ts        # Change detection, section comparison
├── file-processor.test.ts       # Section matching, reconstruction, subsection duplication regression
├── heading-map.test.ts          # Heading-map CRUD, subsection regression
├── language-config.test.ts      # Language-specific rules, validation
├── integration.test.ts          # Full parse → detect → translate → reconstruct
├── e2e-fixtures.test.ts         # End-to-end fixture tests
├── component-reconstruction.test.ts  # Component assembly
├── reviewer.test.ts             # Review mode prompts, validation
├── translator.test.ts           # Translation prompts, token estimation
└── inputs.test.ts               # Mode validation, PR events, input parsing

CLI tests — Backward/Status (12 suites)

src/cli/__tests__/
├── document-comparator.test.ts  # Stage 1 triage
├── backward-evaluator.test.ts   # Stage 2 evaluation
├── section-matcher.test.ts      # Cross-language section matching
├── git-metadata.test.ts         # Git metadata extraction
├── report-generator.test.ts     # Markdown/JSON report formatting
├── schema.test.ts               # Zod schema validation
├── backward.test.ts             # Backward command integration
├── bulk-backward.test.ts        # Bulk backward processing
├── status.test.ts               # Status command
├── sync-orchestrator.test.ts    # Sync pipeline
├── pr-creator.test.ts           # PR creation
└── translator-retry.test.ts     # Retry logic

CLI tests — Review/Forward (8 suites)

src/cli/__tests__/
├── review.test.ts               # Command loading, filtering, pipeline
├── review-formatter.test.ts     # Chalk card rendering, categories
├── review-session.test.ts       # Accept/skip/reject state machine
├── issue-generator.test.ts      # Issue title, body, labels
├── issue-creator.test.ts        # gh issue create calls
├── forward.test.ts              # Forward command
├── forward-triage.test.ts       # Content-vs-i18n filter
└── forward-pr-creator.test.ts   # Git ops + PR creation

Test philosophy

Tests catch bugs in seconds, not minutes. Without tests: code → build → push → PR → Actions → check results (10–15 min). With tests: code → npm test → results (2 sec).

What we test:

Unit tests — individual functions and modules
Integration tests — full pipelines (parse → detect → translate → reconstruct)
Regression tests — previously fixed bugs stay fixed (tagged by version)
Edge cases — unusual inputs, empty documents, malformed frontmatter

What we mock:

GitHub API calls
Claude API calls
File system operations (where needed)
Terminal output formatting

Writing tests

Follow the Arrange–Act–Assert pattern:

describe('ComponentName', () => {
  it('should do something specific', () => {
    // Arrange
    const input = createTestFixture();

    // Act
    const result = processInput(input);

    // Assert
    expect(result).toBe('expected');
  });
});

Guidelines:

One behavior per test
Descriptive test names — the name should explain what's being tested
No inter-test dependencies
Test behavior, not implementation details

Adding regression tests

When fixing a bug:

Write a test that reproduces the bug (should fail)
Fix the bug
Verify the test passes
Tag with the version in the test comment

it('should not duplicate subsections in output (v0.4.3 regression)', () => {
  const sections = [{
    heading: '## Overview',
    content: '## Overview\n\n### Subsection\n\nText.\n\n',
    subsections: [{ heading: '### Subsection', /* ... */ }]
  }];

  const result = reconstructFromSections(sections);
  expect(result.match(/### Subsection/g)).toHaveLength(1);
});

Key regression tests

v0.4.3: Subsection duplication

Bug: Subsections appeared twice in output — once from content and once from subsections array.

Tests: 5 in file-processor.test.ts — verify reconstructFromSections produces exactly one copy of each subsection.

Root cause: Code read subsections from section.content and also appended from section.subsections. Fix: always use contentWithoutSubsections, then append from section.subsections.

v0.4.3: Incomplete heading-maps

Bug: Heading-maps didn't include subsection headings.

Tests: 5 in heading-map.test.ts — verify updateHeadingMap includes subsections at all nesting levels.

v0.4.6: Section comparison

Tests: In diff-detector.test.ts — verify recursive subsection comparison works correctly.

GitHub integration testing

Beyond unit tests, we validate the action end-to-end on real GitHub repositories.

Test repositories

Repo	Role
`QuantEcon/test-translation-sync`	SOURCE (English)
`QuantEcon/test-translation-sync.zh-cn`	TARGET (Chinese)

Running integration tests

./tool-test-action-on-github/test-action-on-github.sh

The script automatically:

Resets test repositories to a clean state
Closes all old PRs
Creates 24 test PRs covering different scenarios
Triggers GitHub Actions on each PR
Validates translations in the target repo

Scenarios covered: New files, section updates, deletions, subsections, root-level files, MyST directives, heading-map updates.

When to use each

Aspect	`npm test`	GitHub testing
Speed	~2 seconds	~2–3 minutes per scenario
Scope	Unit + integration	Full end-to-end workflow
Cost	Free	~$0.50 per run (Claude API)
Use	Every commit, TDD	Pre-release validation

Monitoring test runs

gh pr list --repo QuantEcon/test-translation-sync --label test-translation
gh pr list --repo QuantEcon/test-translation-sync.zh-cn
gh run list --repo QuantEcon/test-translation-sync
gh run view <run-id> --log

For full details on the test harness, see the test script README.

Coverage

Not covered (intentionally):

GitHub API calls (mocked)
Claude API calls (mocked)
Terminal output formatting
Development-only logging

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Testing

Running tests

Test structure

Core Action tests (12 suites)

CLI tests — Backward/Status (12 suites)

CLI tests — Review/Forward (8 suites)

Test philosophy

Writing tests

Adding regression tests

Key regression tests

v0.4.3: Subsection duplication

v0.4.3: Incomplete heading-maps

v0.4.6: Section comparison

GitHub integration testing

Test repositories

Running integration tests

When to use each

Monitoring test runs

Coverage

Uh oh!

FilesExpand file tree

testing.md

Latest commit

History

testing.md

File metadata and controls

Testing

Running tests

Test structure

Core Action tests (12 suites)

CLI tests — Backward/Status (12 suites)

CLI tests — Review/Forward (8 suites)

Test philosophy

Writing tests

Adding regression tests

Key regression tests

v0.4.3: Subsection duplication

v0.4.3: Incomplete heading-maps

v0.4.6: Section comparison

GitHub integration testing

Test repositories

Running integration tests

When to use each

Monitoring test runs

Coverage