Skip to content

Extract conflict harness implementation and policy#7

Merged
EdgeCaser merged 1 commit into
mainfrom
codex/cross-model-debate-harness-impl
Apr 15, 2026
Merged

Extract conflict harness implementation and policy#7
EdgeCaser merged 1 commit into
mainfrom
codex/cross-model-debate-harness-impl

Conversation

@EdgeCaser
Copy link
Copy Markdown
Owner

Summary

This PR extracts the implementation/policy slice from the much larger cross-model debate harness research branch.

Included here:

  • conflict harness scripts and replay/rejudge tooling
  • conflict verdict and run schemas
  • project-local Gemini alias defaults used by the harness
  • orchestrator routing and judge-escalation guidance
  • scenario taxonomy updates on existing scenarios
  • new real-world strategy scenarios
  • focused harness tests
  • core conflict-harness spec doc

Deliberately not included here:

  • generated benchmarks/results/** artifacts
  • large replay/run corpora
  • most long-form research memos and analysis writeups
  • outreach drafts

Validation

Passed locally:

  • node tests/build-case-packet.test.mjs
  • node tests/rejudge-conflict-batch.test.mjs
  • node tests/rejudge-conflict-run.test.mjs
  • node tests/run-conflict-batch.test.mjs
  • node tests/run-conflict-harness.test.mjs

Why This Exists

PR #6 captures the full research branch, but it is too large to review safely as a single merge unit. This PR is the reviewable implementation/policy subset intended to land first.

Copy link
Copy Markdown
Owner Author

Recommended landing PR.

This is the clean implementation/policy extraction from PR #6 and is the one that should merge first.

Why this is the right merge unit:

  • 38 changed files instead of 6,707
  • no generated benchmarks/results/** corpus
  • focused harness/test/schema/policy review surface
  • validated locally with the harness test suite listed in the PR body

After this lands, the remaining question is whether any research memos or benchmark artifacts from PR #6 should come over in smaller follow-up PRs.

@EdgeCaser EdgeCaser merged commit 52cc5d3 into main Apr 15, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant