refactor: unify next api backend proxies by jerry609 · Pull Request #395 · jerry609/PaperBot

jerry609 · 2026-03-15T04:41:22Z

What changed

introduce a shared Next API backend proxy helper for JSON, text/HTML, binary, and SSE passthrough cases
migrate hand-written backend proxy routes under web/src/app/api onto the shared helper path
remove dead runbook proxy routes that do not exist on the backend
add contract tests to keep route handlers from regressing back to direct fetch wrappers

Why

reduce duplicated proxy logic and drift across auth, runbook, studio, paperscool, newsletter, and export routes
keep frontend/backend route contracts easier to review and maintain
make route-level failures and auth handling consistent

Validation

pytest -q tests/unit/test_next_backend_proxy_contracts.py
cd web && npm exec vitest run src/app/api/_utils/backend-proxy.test.ts

Notes

cd web && npm exec tsc --noEmit still reports an existing unrelated failure in src/lib/agent-events/parsers.test.ts due to a missing type field in a test fixture. This PR does not touch that area.

Summary by CodeRabbit

Release Notes

New Features
- Real-time agent activity dashboard with live event streaming.
- Three-panel interface: tasks panel, activity feed, and file change tracker.
- Kanban board for task visualization with agent status badges and error tracking.
- Agent status indicators with connection state monitoring.
Documentation
- Added comprehensive architecture, stack, and codebase planning documentation.

vercel · 2026-03-15T04:41:26Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
paper-bot	Ready	Preview, Comment	Mar 15, 2026 4:54am

coderabbitai · 2026-03-15T04:41:47Z

Warning

Rate limit exceeded

@jerry609 has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 13 minutes and 20 seconds before requesting another review.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: b9b40db4-116c-4727-83cb-bc3253872747

📥 Commits

Reviewing files that changed from the base of the PR and between f899732 and 0450c4d.

📒 Files selected for processing (36)

tests/unit/test_next_backend_proxy_contracts.py
web/src/app/api/_utils/auth-headers.ts
web/src/app/api/_utils/backend-proxy.test.ts
web/src/app/api/_utils/backend-proxy.ts
web/src/app/api/auth/forgot-password/route.ts
web/src/app/api/auth/login-check/route.ts
web/src/app/api/auth/me/change-password/route.ts
web/src/app/api/auth/me/route.ts
web/src/app/api/auth/register/route.ts
web/src/app/api/auth/reset-password/route.ts
web/src/app/api/gen-code/route.ts
web/src/app/api/newsletter/unsubscribe/[token]/route.ts
web/src/app/api/papers/export/route.ts
web/src/app/api/research/_base.ts
web/src/app/api/research/paperscool/analyze/route.ts
web/src/app/api/research/paperscool/daily/route.ts
web/src/app/api/research/paperscool/sessions/[sessionId]/route.ts
web/src/app/api/research/repro/context/route.ts
web/src/app/api/runbook/allowed-dirs/route.ts
web/src/app/api/runbook/changes/route.ts
web/src/app/api/runbook/delete/route.ts
web/src/app/api/runbook/diff/route.ts
web/src/app/api/runbook/file/route.ts
web/src/app/api/runbook/files/route.ts
web/src/app/api/runbook/project-dir/prepare/route.ts
web/src/app/api/runbook/revert-hunks/route.ts
web/src/app/api/runbook/revert-project/route.ts
web/src/app/api/runbook/revert/route.ts
web/src/app/api/runbook/runs/[runId]/route.ts
web/src/app/api/runbook/smoke/route.ts
web/src/app/api/runbook/snapshots/[snapshotId]/route.ts
web/src/app/api/runbook/snapshots/route.ts
web/src/app/api/sandbox/status/route.ts
web/src/app/api/studio/chat/route.ts
web/src/app/api/studio/cwd/route.ts
web/src/app/api/studio/status/route.ts

📝 Walkthrough

Walkthrough

The PR fundamentally restructures PaperBot from a Claude Code-specific orchestration system to an agent-agnostic dashboard. It introduces a unified EventType vocabulary and event envelope system, implements real-time SSE-based event streaming with a bounded Zustand store, develops a comprehensive three-panel UI dashboard with activity feed and file diffs, adds a Kanban board for task visualization with Codex delegation tracking, and consolidates Next.js API routes via a centralized backend proxy utility. Extensive planning documentation details v1.2 milestones and the shift toward agent-agnostic architecture.

Changes

Cohort / File(s)	Summary
Planning & Architecture Documentation `.planning/PROJECT.md`, `.planning/REQUIREMENTS.md`, `.planning/ROADMAP.md`, `.planning/STATE.md`, `.planning/codebase/ARCHITECTURE.md`, `.planning/codebase/CONCERNS.md`, `.planning/codebase/CONVENTIONS.md`, `.planning/codebase/INTEGRATIONS.md`, `.planning/codebase/STACK.md`, `.planning/codebase/STRUCTURE.md`, `.planning/codebase/TESTING.md`	Comprehensive documentation updates reframing project from Claude Code-specific to agent-agnostic dashboard. Introduces architecture layers, tech stack details, conventions, integrations, concerns/tech-debt catalog, and testing patterns. New v1.2 DeepCode Agent Dashboard milestones replace v2.0 PostgreSQL migration narrative.
Phase 08 Planning: Agent Event Vocabulary `.planning/phases/08-agent-event-vocabulary/*`	Phase planning, research, validation, and verification docs detailing EventType constants, lifecycle/tool-call event helpers, frontend SSE hook, Zustand store, and three UI components (ActivityFeed, AgentStatusPanel, ToolCallTimeline). Test-driven implementation roadmap.
Phase 09 Planning: Three-Panel Dashboard `.planning/phases/09-three-panel-dashboard/*`	Phase planning for file-change tracking, three-panel UI layout (TasksPanel, FileListPanel, InlineDiffPanel), Kanban integration, and sidebar navigation updates. Documents data contracts for file touch tracking and per-panel filtering.
Phase 10 Planning: Agent-Board Codex Bridge `.planning/phases/10-agent-board-codex-bridge/*`	Phase planning for Codex delegation events (CODEX_DISPATCHED, CODEX_ACCEPTED, CODEX_COMPLETED, CODEX_FAILED), Kanban board implementation with agent identity badges, Codex error surfacing, and Claude Code sub-agent definition.
Research & Strategic Documentation `.planning/research/ARCHITECTURE.md`, `.planning/research/FEATURES.md`, `.planning/research/PITFALLS.md`, `.planning/research/STACK.md`, `.planning/research/SUMMARY.md`	Updated research docs shifting focus from PostgreSQL migration to agent-agnostic proxy dashboard. Details adapter layer patterns, multi-agent translation, real-time event normalization, and scaling considerations. Reframes technology stack and identifies key architectural pitfalls.
Python Event System `src/paperbot/application/collaboration/message_schema.py`, `src/paperbot/application/collaboration/agent_events.py`, `src/paperbot/mcp/tools/_audit.py`	Adds EventType class with lifecycle, tool-call, file-change, and Codex delegation event constants. Introduces `make_lifecycle_event()` and `make_tool_call_event()` helpers for standardized envelope creation. Updates audit logging to use EventType constants.
Python Codex Integration `src/paperbot/api/routes/agent_board.py`, `src/paperbot/repro/orchestrator.py`	Adds Codex delegation event emission (`_emit_codex_event()`) in agent board routes at dispatch, acceptance, completion, and failure points. Introduces `_should_overflow_to_codex()` helper for environment-controlled Codex overflow routing.
Frontend Agent Events Data Layer `web/src/lib/agent-events/types.ts`, `web/src/lib/agent-events/parsers.ts`, `web/src/lib/agent-events/store.ts`, `web/src/lib/agent-events/useAgentEvents.ts`	TypeScript types mirroring Python EventType vocabulary (AgentLifecycleEvent, ToolCallEvent, FileTouchedEntry, CodexDelegationEntry). Parsers transform raw envelopes into typed display objects. Zustand store maintains bounded feeds (200 items), tool calls (100), file touches (20 runs), codex delegations, and Kanban tasks with deduplication logic.
Frontend Agent Events UI Components `web/src/components/agent-events/ActivityFeed.tsx`, `web/src/components/agent-events/AgentStatusPanel.tsx`, `web/src/components/agent-events/ToolCallTimeline.tsx`	Real-time activity feed with color-coded event types; per-agent status grid with working-state animation; chronological tool-call timeline with duration, arguments, and error display. Components consume central store via Zustand.
Frontend Dashboard UI Components `web/src/app/agent-dashboard/page.tsx`, `web/src/components/agent-dashboard/TasksPanel.tsx`, `web/src/components/agent-dashboard/FileListPanel.tsx`, `web/src/components/agent-dashboard/InlineDiffPanel.tsx`, `web/src/components/agent-dashboard/KanbanBoard.tsx`, `web/src/components/layout/Sidebar.tsx`	Three-panel agent dashboard page with TasksPanel (left rail, run selector), ActivityFeed (center), FileListPanel (right, with InlineDiffPanel detail). Separate Kanban board view with five columns (Planned, In Progress, Review, Done, Blocked), agent identity badges, and Codex failure indicators. Sidebar navigation entry for Agent Dashboard.
API Proxy Refactoring `web/src/app/api/_utils/backend-proxy.ts`, `web/src/app/api/_utils/backend-proxy.test.ts`, `web/src/app/api/_utils/auth-headers.ts`, `web/src/app/api/auth/`, `web/src/app/api/gen-code/`, `web/src/app/api/newsletter/`, `web/src/app/api/papers/`, `web/src/app/api/research/`, `web/src/app/api/runbook/`, `web/src/app/api/sandbox/`, `web/src/app/api/studio/`	New centralized backend proxy utility (`proxyJson`, `proxyText`, `proxyBinary`, `proxyStream`) replacing inline fetch logic across 20+ Next.js API routes. Consolidates header auth, timeout handling, error responses, and content negotiation. Removes dead routes (`/api/runbook/runs/[runId]`, `/api/runbook/smoke`).
Testing & Validation `tests/unit/test_agent_events_vocab.py`, `tests/unit/test_agent_board_codex_events.py`, `tests/unit/test_codex_overflow.py`, `tests/unit/test_next_backend_proxy_contracts.py`, `web/src/lib/agent-events/parsers.test.ts`, `web/src/lib/agent-events/store.test.ts`, `web/src/components/agent-dashboard/KanbanBoard.test.tsx`, `web/vitest.config.ts`	Python unit tests for EventType constants, lifecycle/tool-call helpers, Codex event emission, and overflow routing. TypeScript tests for parsers (null-safety, mappings), store (feed/tool caps, file deduplication, 20-run eviction), and KanbanBoard (columns, badges, error states). Proxy contract tests enforce shared backend proxy usage and validate removal of dead routes. Vitest jsdom environment configured for component tests.
Configuration & Package Updates `web/package.json`	Added dependencies: `@radix-ui/react-popover`, `next-auth`. Added dev dependencies: `@testing-library/jest-dom`, `@testing-library/react`, `jsdom`.

Sequence Diagram

sequenceDiagram
    participant Agent as Agent/MCP
    participant EventBus as EventBus<br/>(Python)
    participant SSE as SSE Endpoint<br/>(/api/events/stream)
    participant Hook as useAgentEvents<br/>(React Hook)
    participant Store as Zustand Store<br/>(agent-events)
    participant UI as UI Components<br/>(Activity, Dashboard)

    Agent->>EventBus: emit_lifecycle_event<br/>(AGENT_STARTED)
    Agent->>EventBus: emit_tool_call_event<br/>(TOOL_RESULT)
    EventBus->>SSE: append event to stream
    
    Hook->>SSE: readSSE() with AbortController
    SSE-->>Hook: text/event-stream message
    
    Hook->>Hook: parseActivityItem(raw)
    Hook->>Hook: parseToolCall(raw)
    Hook->>Hook: parseFileTouched(raw)
    Hook->>Hook: parseCodexDelegation(raw)
    
    Hook->>Store: addFeedItem(parsed)
    Hook->>Store: addToolCall(parsed)
    Hook->>Store: addFileTouched(parsed)
    Hook->>Store: addCodexDelegation(parsed)
    
    Store-->>UI: feed (capped 200)
    Store-->>UI: toolCalls (capped 100)
    Store-->>UI: filesTouched (20 runs)
    Store-->>UI: codexDelegations (capped 100)
    
    UI->>UI: ActivityFeed renders feed items
    UI->>UI: ToolCallTimeline displays calls
    UI->>UI: FileListPanel lists changes
    UI->>UI: KanbanBoard shows task status

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related PRs

feat(phases 2-7): MCP tools, resources, transport, agent skills, EventBus SSE #392: Directly overlaps on EventType constants, event-vocabulary implementation in message_schema.py and agent_events.py, SSE/EventBus plumbing, and /api/events/stream route infrastructure for real-time event streaming.
feat(Claude coplit): 增加Claude可视化，删除local docker，E2B，codex broker #136: Shares agent/Studio surface modifications, EventType and Codex event handling, studio_chat integration, and removal of deprecated executor exports (E2B/Docker/Codex).
Fix/studio output dir validation p2c design #148: Related through overlapping backend repro_context API routes and frontend proxy/route refactoring patterns for P2C (Paper-to-Context) workflows.

Poem

🐰 A dashboard so fine, now agent-agnostic,
Events stream real-time, the flow is majestic,
Kanban boards shimmer with tasks in their place,
While files dance freely through diffs we embrace,
From Codex to Claude, no vendor in sight—
Just pure observation, luminous and bright! ✨

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch refactor/pr1-next-api-proxy

github-actions · 2026-03-15T04:42:53Z

Vercel Preview

Preview: https://paper-qlt775u82-jerrys-projects-bfbe8245.vercel.app
Resolved from Vercel Git Deployments and reused for CI smoke checks.
Automation bypass is expected via VERCEL_AUTOMATION_BYPASS_SECRET for CI and Playwright.

gemini-code-assist · 2026-03-15T04:43:45Z

Summary of Changes

Hello, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly refactors the Next.js API layer by introducing a unified backend proxy, streamlining frontend-to-backend communication and reducing code duplication. It also lays substantial groundwork for an agent-agnostic dashboard, enhancing real-time observability of agent activities, tool calls, and file changes through new frontend components and an extended event vocabulary. The changes include new UI elements for task management and detailed codebase documentation, preparing the project for advanced multi-agent orchestration features.

Highlights

Unified Next.js API Backend Proxy: Introduced a shared utility (backend-proxy.ts) to centralize backend API calls from Next.js API routes, supporting JSON, text/HTML, binary, and Server-Sent Events (SSE) passthrough. This significantly reduces duplicated logic across various frontend-to-backend communication paths.
Migration of Next.js API Routes: Migrated numerous existing Next.js API routes (including authentication, code generation, newsletter, papers export, research, runbook, and studio endpoints) to utilize the new shared backend proxy helper, ensuring consistent handling of requests, authentication, and error responses.
Enhanced Agent Event Vocabulary and Frontend Display: Expanded the Python EventType constants with new lifecycle, tool call, file change, and Codex delegation events. Developed a comprehensive frontend event consumption layer including TypeScript types, parsers, a Zustand store, an SSE hook, and real-time display components for activity feed, agent status, and tool call timelines.
New Agent Dashboard with Three-Panel Layout: Implemented a new /agent-dashboard page featuring a three-panel IDE-style layout. This includes a TasksPanel (agent status, run selector), an ActivityFeed (real-time events), and a FileListPanel (per-task file changes with inline diff viewer). The layout is resizable and persists across sessions.
Codex Delegation and Kanban Board Integration: Added Codex delegation event constants and emission logic in the backend, along with a stub for Paper2Code overflow routing to Codex. A new KanbanBoard component was introduced to visualize agent tasks by status, including agent identity badges and prominent display of Codex-specific error states. A Claude Code sub-agent definition (codex-worker.md) was created to facilitate delegation.
Comprehensive Codebase Documentation: New detailed documentation files were added under .planning/codebase/ covering the project's architecture, technical concerns, coding conventions, external integrations, technology stack, and testing patterns. This provides a foundational understanding of the codebase for future development and maintenance.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Changelog

.planning/PROJECT.md
- Updated project description to reflect a pivot towards an agent-agnostic dashboard named DeepCode.
- Expanded 'Active' features list to include agent-agnostic proxy layer, multi-agent adapter layer, and dashboard control surface.
- Modified 'Out of Scope' items to align with the agent-agnostic vision, emphasizing visualization over orchestration.
- Updated 'Context' and 'Constraints' sections to reflect the new architectural direction and agent-agnostic principles.
- Renamed 'Current Milestone' to 'v1.2 DeepCode Agent Dashboard' and detailed its goal and target features.
.planning/REQUIREMENTS.md
- Updated core value to 'agent-agnostic dashboard'.
- Marked several v1.1 Event System, Dashboard, and File Visualization requirements as complete.
- Added a new 'v1.2 Requirements' section detailing needs for Agent Adapter Layer, Activity Monitoring, Chat & Control, Session Management, Visualization, and Domain Integration.
- Revised 'Out of Scope' features to align with the agent-agnostic dashboard vision, clarifying what PaperBot will and will not do.
.planning/ROADMAP.md
- Added 'v1.2 DeepCode Agent Dashboard' milestone with phases 18-23.
- Updated status of Phase 9 ('Three-Panel Dashboard') to 'Complete'.
- Updated status of Phase 8 ('Agent Event Vocabulary') to 'In Progress'.
- Updated status of Phase 10 ('Agent Board + Codex Bridge') to 'In Progress'.
- Detailed plans for Phase 8, Phase 9, and Phase 10, including goals, dependencies, requirements, success criteria, and specific tasks.
.planning/STATE.md
- Updated milestone to 'v1.1 Agent Orchestration Dashboard' and status to 'verifying'.
- Updated 'stopped_at' to 'Completed 10-02-PLAN.md' and 'last_updated' to '2026-03-15T04:12:57.059Z'.
- Revised progress metrics to reflect 7 completed phases out of 21 total, and 15 completed plans out of 16 total.
- Updated 'Core value' and 'Current focus' to align with the v1.2 DeepCode Agent Dashboard.
- Updated 'Current Position' to Phase 9, 'Active' status, awaiting human verification.
- Added new performance metrics for recent phases (08, 09, 10).
- Added new pending todos and updated session continuity information.
.planning/codebase/ARCHITECTURE.md
- Added a new document detailing the overall architecture, pattern overview, layered structure (Domain, Application, Core, Agent, Infrastructure, Paper2Code, API, Context Engine, Memory, MCP Server), data flow, key abstractions, entry points, error handling, and cross-cutting concerns.
.planning/codebase/CONCERNS.md
- Added a new document outlining codebase concerns including tech debt (N+1 queries, rate limiting, incomplete TODOs), known bugs (exception handling, SSE client loss), security considerations (GDPR, access control, path traversal), performance bottlenecks (large files, memory growth, vector search), fragile areas (broad exceptions, circular imports, untracked async tasks), scaling limits (single-user DB, in-memory caches, ARQ blocking), dependencies at risk, and test coverage gaps.
.planning/codebase/CONVENTIONS.md
- Added a new document detailing coding conventions for naming patterns (files, functions, variables, types), code style (formatting, linting, isort), import organization, error handling, logging, comments (when to, docstrings), function design, module design, and file structure/length.
.planning/codebase/INTEGRATIONS.md
- Added a new document detailing external integrations including APIs & External Services (LLM providers, academic sources, social, code repos, PDF processing, university libraries), Data Storage (databases, file storage, caching, vector search), Authentication & Identity, Monitoring & Observability, CI/CD & Deployment, Environment Configuration, Webhooks & Callbacks, and MCP Server Integration.
.planning/codebase/STACK.md
- Added a new document detailing the technology stack including languages, runtime, package managers, frameworks (backend, LLM/AI, web frontend, CLI, testing, build/dev), key dependencies (critical, infrastructure, PDF, security, data/AI, sandbox, notifications, MCP, utilities), configuration, and platform requirements.
.planning/codebase/STRUCTURE.md
- Added a new document detailing the codebase structure including directory layout, directory purposes, key file locations, naming conventions, where to add new code, and special directories.
.planning/codebase/TESTING.md
- Added a new document detailing testing patterns including test framework, file organization, test structure, mocking, fixtures and factories, coverage, test types (unit, integration, E2E, eval smoke/memory), common patterns, and test examples by module.
.planning/phases/08-agent-event-vocabulary/08-01-PLAN.md
- Added a new plan for defining Python event vocabulary constants and helper functions, including TDD tasks for EventType, lifecycle, and tool call helpers, and migration of _audit.py.
.planning/phases/08-agent-event-vocabulary/08-01-SUMMARY.md
- Added a summary for Phase 8 Plan 01, detailing the addition of EventType constants, make_lifecycle_event/make_tool_call_event helpers, and migration of _audit.py.
.planning/phases/08-agent-event-vocabulary/08-02-PLAN.md
- Added a new plan for building the frontend event consumption layer, including TypeScript types, parser functions, Zustand store, SSE hook, display components (ActivityFeed, AgentStatusPanel, ToolCallTimeline), and a test harness page.
.planning/phases/08-agent-event-vocabulary/08-02-SUMMARY.md
- Added a summary for Phase 8 Plan 02, detailing the implementation of the frontend agent event consumer layer.
.planning/phases/08-agent-event-vocabulary/08-RESEARCH.md
- Added a research document for Phase 8, covering standard stack, architecture patterns (Python vocabulary, lifecycle helper, TypeScript types, Zustand store, useAgentEvents hook, parser helpers), anti-patterns, common pitfalls, code examples, state of the art, open questions, and validation architecture.
.planning/phases/08-agent-event-vocabulary/08-VALIDATION.md
- Added a validation strategy document for Phase 8, detailing test infrastructure, sampling rate, per-task verification map, wave 0 requirements, and manual-only verifications.
.planning/phases/08-agent-event-vocabulary/08-VERIFICATION.md
- Added a verification report for Phase 8, confirming the achievement of goals, required artifacts, key link verification, requirements coverage, anti-patterns found, test suite results, and human verification requirements.
.planning/phases/09-three-panel-dashboard/09-01-PLAN.md
- Added a new plan for extending the agent event data layer with file-change tracking, including Python EventType.FILE_CHANGE, TypeScript FileTouchedEntry, parseFileTouched() parser, Zustand store extension, and SSE hook wiring.
.planning/phases/09-three-panel-dashboard/09-01-SUMMARY.md
- Added a summary for Phase 9 Plan 01, detailing the implementation of the file tracking data layer for the three-panel dashboard.
.planning/phases/09-three-panel-dashboard/09-02-PLAN.md
- Added a new plan for building the three-panel agent dashboard UI, including TasksPanel, FileListPanel, InlineDiffPanel components, AgentStatusPanel compact mode, and sidebar navigation link.
.planning/phases/09-three-panel-dashboard/09-02-SUMMARY.md
- Added a summary for Phase 9 Plan 02, detailing the implementation of the three-panel agent dashboard.
.planning/phases/09-three-panel-dashboard/09-RESEARCH.md
- Added a research document for Phase 9, covering standard stack, architecture patterns (three-panel layout, store extension for file tracking, FileListPanel, InlineDiffPanel, TasksPanel, SplitPanels API), anti-patterns, common pitfalls, code examples, state of the art, open questions, and validation architecture.
.planning/phases/09-three-panel-dashboard/09-VALIDATION.md
- Added a validation strategy document for Phase 9, detailing test infrastructure, sampling rate, per-task verification map, wave 0 requirements, and manual-only verifications.
.planning/phases/09-three-panel-dashboard/09-VERIFICATION.md
- Added a verification report for Phase 9, confirming the achievement of goals, required artifacts, key link verification, requirements coverage, anti-patterns found, test suite results, and human verification requirements.
.planning/phases/10-agent-board-codex-bridge/10-01-PLAN.md
- Added a new plan for implementing EventType constants, a delegation event emission helper, and an overflow stub for Codex delegation.
.planning/phases/10-agent-board-codex-bridge/10-01-SUMMARY.md
- Added a summary for Phase 10 Plan 01, detailing the implementation of Codex delegation EventType constants and emission helper.
.planning/phases/10-agent-board-codex-bridge/10-02-PLAN.md
- Added a new plan for building the KanbanBoard component with agent badges and Codex error surfacing, and extending the Zustand store and SSE hook for delegation events.
.planning/phases/10-agent-board-codex-bridge/10-02-SUMMARY.md
- Added a summary for Phase 10 Plan 02, detailing the implementation of the KanbanBoard and Codex delegation events.
.planning/phases/10-agent-board-codex-bridge/10-03-PLAN.md
- Added a new plan for creating the codex-worker.md sub-agent definition and wiring the KanbanBoard into the agent dashboard page with a view-mode toggle.
.planning/phases/10-agent-board-codex-bridge/10-RESEARCH.md
- Added a research document for Phase 10, covering standard stack, architecture patterns (Kanban board, wiring Kanban, Codex error display, codex-worker.md, delegation event types, frontend parser, Paper2Code overflow), anti-patterns, common pitfalls, code examples, state of the art, open questions, and validation architecture.
.planning/phases/10-agent-board-codex-bridge/10-VALIDATION.md
- Added a validation strategy document for Phase 10, detailing test infrastructure, sampling rate, per-task verification map, wave 0 requirements, and manual-only verifications.
.planning/research/ARCHITECTURE.md
- Updated the architecture research document to focus on 'v1.2 Agent-Agnostic Dashboard', outlining system overview, component responsibilities, recommended project structure, architectural patterns (adapter interface, subprocess adapter, event routing, control surface, lifecycle management, hybrid discovery), data flow, integration points, suggested build order, and scaling considerations.
.planning/research/FEATURES.md
- Updated the feature research document to focus on 'v1.2 DeepCode Agent Dashboard', outlining scope, table stakes features (real-time stream, tool call log, file diff, session list/detail, status, chat, token cost, error surfacing, connection status), differentiators (agent-agnostic proxy, hybrid discovery, team visualization, control surface, HITL, Paper2Code integration, MCP tool visibility, session replay), anti-features, feature dependencies, MVP definition, and competitor analysis.
.planning/research/PITFALLS.md
- Updated the pitfalls research document to focus on 'v1.2 Agent Dashboard', outlining critical pitfalls (subprocess PTY absence, context reloading, abstraction collapse, SSE reconnection, race conditions, hardcoding agent detection, multi-session chaos), technical debt patterns, integration gotchas, performance traps, security mistakes, UX pitfalls, 'looks done but isn't' checklist, recovery strategies, and pitfall-to-phase mapping.
.planning/research/STACK.md
- Updated the technology stack research document to focus on 'v1.2 Agent Dashboard Additions', outlining new additions for Python backend (none), Node.js/Next.js (agent SDKs, node-pty, ws), and web frontend (XTerm WebSocket addon), installation changes, alternatives considered, what NOT to install, agent protocol details, and version compatibility.
src/paperbot/api/routes/agent_board.py
- Added _get_event_log_from_container helper for lazy event log retrieval.
- Added _emit_codex_event async helper to emit Codex delegation lifecycle events.
- Integrated _emit_codex_event calls at various points in dispatch_task and _execute_task_stream for codex_dispatched, codex_accepted, codex_failed, and codex_completed events.
src/paperbot/application/collaboration/agent_events.py
- Added a new file with make_lifecycle_event and make_tool_call_event helper functions to produce AgentEventEnvelope instances for common event families.
src/paperbot/application/collaboration/message_schema.py
- Added EventType class with string constants for agent lifecycle (started, working, completed, error), tool calls (call, result, error), file changes (file_change), and Codex delegation (dispatched, accepted, completed, failed).
- Documented existing event types for completeness.
src/paperbot/mcp/tools/_audit.py
- Updated log_tool_call to use EventType.TOOL_ERROR and EventType.TOOL_RESULT constants instead of raw string literals.
src/paperbot/repro/orchestrator.py
- Added _should_overflow_to_codex function to check the PAPERBOT_CODEX_OVERFLOW_THRESHOLD environment variable for Codex delegation.
tests/unit/test_agent_board_codex_events.py
- Added a new unit test file for _emit_codex_event helper, including tests for event log appending, event type setting, and silent returns on unavailable event log.
tests/unit/test_agent_events_vocab.py
- Added a new unit test file for EventType constants and agent event helpers, covering lifecycle events, tool call events, _audit.py constant usage, file change event type, and Codex delegation event types.
tests/unit/test_codex_overflow.py
- Added a new unit test file for _should_overflow_to_codex function, testing its behavior based on the PAPERBOT_CODEX_OVERFLOW_THRESHOLD environment variable.
tests/unit/test_next_backend_proxy_contracts.py
- Added a new unit test file to ensure Next.js API routes use shared backend proxy helpers and that dead runbook proxy routes are removed.
web/package-lock.json
- Updated package-lock.json to reflect changes in dependencies, including new @radix-ui/react-popover, next-auth, @testing-library/jest-dom, @testing-library/react, jsdom, and various transitive dependencies.
web/package.json
- Added @radix-ui/react-popover for UI components.
- Added next-auth for authentication.
- Added @testing-library/jest-dom and @testing-library/react for frontend testing.
- Added jsdom as a dev dependency for Vitest component tests.
web/src/app/agent-dashboard/page.tsx
- Added a new client-side page for the Agent Dashboard, integrating useAgentEvents, SplitPanels, TasksPanel, ActivityFeed, and FileListPanel to create a three-panel layout.
web/src/app/agent-events/page.tsx
- Added a new client-side debug page for Agent Events, mounting useAgentEvents and displaying ActivityFeed, AgentStatusPanel, and ToolCallTimeline components.
web/src/app/api/_utils/auth-headers.ts
- Modified backendBaseUrl to also check process.env.PAPERBOT_API_BASE_URL as a fallback.
web/src/app/api/_utils/backend-proxy.test.ts
- Added a new test file for the backend-proxy utility functions, covering JSON, text, binary, and stream proxying with authentication, error handling, and header preservation.
web/src/app/api/_utils/backend-proxy.ts
- Added a new file containing shared backend proxy helper functions (proxyJson, proxyText, proxyBinary, proxyStream) to centralize API calls, handle authentication, timeouts, and content types.
web/src/app/api/auth/forgot-password/route.ts
- Refactored POST handler to use proxyJson from backend-proxy.ts.
web/src/app/api/auth/login-check/route.ts
- Refactored POST handler to use proxyJson from backend-proxy.ts with a custom error handler.
web/src/app/api/auth/me/change-password/route.ts
- Refactored POST handler to use proxyJson from backend-proxy.ts with authentication.
web/src/app/api/auth/me/route.ts
- Refactored GET, PATCH, and DELETE handlers to use proxyJson from backend-proxy.ts with authentication.
web/src/app/api/auth/register/route.ts
- Refactored POST handler to use proxyJson from backend-proxy.ts.
web/src/app/api/auth/reset-password/route.ts
- Refactored POST handler to use proxyJson from backend-proxy.ts.
web/src/app/api/gen-code/route.ts
- Refactored POST handler to use proxyStream from backend-proxy.ts for SSE streaming with authentication.
web/src/app/api/newsletter/unsubscribe/[token]/route.ts
- Refactored GET handler to use proxyText from backend-proxy.ts with custom error handling for HTML responses.
web/src/app/api/papers/export/route.ts
- Refactored GET handler to use proxyBinary from backend-proxy.ts for binary file downloads.
web/src/app/api/research/_base.ts
- Refactored proxyJson to use the new proxyBackendJson from backend-proxy.ts with authentication.
web/src/app/api/research/paperscool/analyze/route.ts
- Refactored POST handler to use proxyStream from backend-proxy.ts for SSE streaming.
web/src/app/api/research/paperscool/daily/route.ts
- Refactored POST handler to use proxyStream from backend-proxy.ts for SSE streaming with JSON fallback and authentication.
web/src/app/api/research/paperscool/sessions/[sessionId]/route.ts
- Refactored GET handler to use proxyText from backend-proxy.ts for session data.
web/src/app/api/research/repro/context/route.ts
- Refactored POST handler to use proxyStream from backend-proxy.ts for SSE streaming with authentication.
web/src/app/api/runbook/allowed-dirs/route.ts
- Refactored GET and POST handlers to use proxyJson from backend-proxy.ts.
web/src/app/api/runbook/changes/route.ts
- Refactored GET handler to use proxyJson from backend-proxy.ts.
web/src/app/api/runbook/delete/route.ts
- Refactored POST handler to use proxyJson from backend-proxy.ts with authentication.
web/src/app/api/runbook/diff/route.ts
- Refactored GET handler to use proxyJson from backend-proxy.ts.
web/src/app/api/runbook/file/route.ts
- Refactored GET and POST handlers to use proxyJson from backend-proxy.ts.
web/src/app/api/runbook/files/route.ts
- Refactored GET handler to use proxyJson from backend-proxy.ts.
web/src/app/api/runbook/project-dir/prepare/route.ts
- Refactored POST handler to use proxyJson from backend-proxy.ts.
web/src/app/api/runbook/revert-hunks/route.ts
- Refactored POST handler to use proxyJson from backend-proxy.ts.
web/src/app/api/runbook/revert-project/route.ts
- Refactored POST handler to use proxyJson from backend-proxy.ts with authentication.
web/src/app/api/runbook/revert/route.ts
- Refactored POST handler to use proxyJson from backend-proxy.ts.
web/src/app/api/runbook/runs/[runId]/route.ts
- Removed dead runbook proxy route.
web/src/app/api/runbook/smoke/route.ts
- Removed dead runbook proxy route.
web/src/app/api/sandbox/status/route.ts
- Refactored GET handler to use proxyJson from backend-proxy.ts.
web/src/app/api/studio/chat/route.ts
- Refactored POST handler to use proxyStream from backend-proxy.ts with authentication.
web/src/app/api/studio/cwd/route.ts
- Refactored GET handler to use proxyJson from backend-proxy.ts with custom error handling.
web/src/app/api/studio/status/route.ts
- Refactored GET handler to use proxyJson from backend-proxy.ts with custom error handling.
web/src/components/agent-dashboard/FileListPanel.tsx
- Added a new client-side component for displaying a list of files touched by agents, with status indicators and inline diff navigation.
web/src/components/agent-dashboard/InlineDiffPanel.tsx
- Added a new client-side component for displaying inline file differences, wrapping DiffViewer with back navigation and fallback content.
web/src/components/agent-dashboard/KanbanBoard.test.tsx
- Added a new test file for the KanbanBoard component, covering column rendering, agent identity badges, error badges, and helper functions.
web/src/components/agent-dashboard/KanbanBoard.tsx
- Added a new client-side component for a Kanban-style task board, displaying tasks across five columns with agent identity badges and Codex-specific error surfacing.
web/src/components/agent-dashboard/TasksPanel.tsx
- Added a new client-side component for the left rail of the agent dashboard, displaying compact agent status and a scrollable list of recent runs.
web/src/components/agent-events/ActivityFeed.tsx
- Added a new client-side component for displaying a real-time, scrolling activity feed of agent events, with color-coded event types and formatted timestamps.
web/src/components/agent-events/AgentStatusPanel.tsx
- Added a new client-side component for displaying agent lifecycle statuses, with status icons, color-coding, and a compact mode for embedded use.
web/src/components/agent-events/ToolCallTimeline.tsx
- Added a new client-side component for displaying a timeline of agent tool calls, including tool name, arguments, result summary, duration, and error status.
web/src/components/layout/Sidebar.tsx
- Added Monitor icon import from lucide-react.
- Added 'Agent Dashboard' entry to the sidebar navigation with the Monitor icon and a link to /agent-dashboard.
web/src/lib/agent-events/parsers.test.ts
- Added a new test file for agent event parsers, covering parseActivityItem, parseAgentStatus, parseToolCall, parseFileTouched, and parseCodexDelegation.
web/src/lib/agent-events/parsers.ts
- Added a new file with client-side parsing logic for agent events, including parseActivityItem, deriveHumanSummary, parseAgentStatus, parseToolCall, parseFileTouched, and parseCodexDelegation.
web/src/lib/agent-events/store.test.ts
- Added a new test file for the Zustand agent event store, covering setConnected, addFeedItem, addToolCall, updateAgentStatus, addFileTouched, addCodexDelegation, and upsertKanbanTask.
web/src/lib/agent-events/store.ts
- Added a new file with a Zustand store (useAgentEventStore) for managing agent event state, including connection status, activity feed, agent statuses, tool calls, file changes, Codex delegations, and Kanban tasks, with capping and eviction logic.
web/src/lib/agent-events/types.ts
- Added a new file defining TypeScript types mirroring the Python EventType vocabulary, including AgentStatus, AgentLifecycleEvent, ToolCallEvent, AgentEventEnvelopeRaw, ActivityFeedItem, AgentStatusEntry, ToolCallEntry, FileChangeStatus, FileTouchedEntry, and CodexDelegationEntry.
web/src/lib/agent-events/useAgentEvents.ts
- Added a new client-side React hook (useAgentEvents) for consuming Server-Sent Events (SSE) from the backend, parsing them, and dispatching updates to the Zustand agent event store.
web/vitest.config.ts
- Modified Vitest configuration to use jsdom environment for src/components/**/*.test.tsx and src/components/**/*.test.ts files, while retaining node environment for others.

Activity

The pull request introduces a significant refactoring of Next.js API routes to use a new shared backend proxy, reducing code duplication and improving consistency.
Extensive planning and research documents have been added, detailing the architecture, concerns, conventions, integrations, stack, and testing patterns for the codebase.
New features for an agent-agnostic dashboard have been implemented, including real-time activity feeds, agent status panels, tool call timelines, file change tracking, and a Kanban board for Codex delegation.
Unit tests have been added for the new backend proxy utilities, agent event vocabulary, Codex overflow logic, and frontend Kanban board components, ensuring code quality and preventing regressions.
The project roadmap and state have been updated to reflect the progress on v1.1 and the introduction of the v1.2 DeepCode Agent Dashboard milestone, with detailed plans for upcoming phases.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request is a significant and well-executed refactoring that unifies the Next.js API backend proxies into a shared helper, greatly reducing code duplication and improving maintainability. The introduction of contract tests to enforce this new pattern is an excellent addition. The changes are clean and the new proxy abstraction is well-designed. I have one suggestion to improve debuggability in the new event emission helper.

gemini-code-assist · 2026-03-15T04:45:20Z

src/paperbot/api/routes/agent_board.py

+    except Exception as exc:  # pragma: no cover
+        log.debug("_emit_codex_event failed silently: %s", exc)


To improve debuggability, it's a good practice to include the exception traceback when logging caught exceptions. Using exc_info=True will automatically add the traceback to the log record, which can be invaluable for diagnosing why event emission might be failing silently.

Suggested change

except Exception as exc: # pragma: no cover

log.debug("_emit_codex_event failed silently: %s", exc)

log.debug("_emit_codex_event failed silently: %s", exc, exc_info=True)

Copilot

Pull request overview

This PR centralizes Next.js API route “backend proxy” behavior into a shared helper (JSON/text/binary/SSE), migrates many /web/src/app/api/**/route.ts handlers to use it, and adds contract tests to prevent regressions. It also introduces/extends agent dashboard + agent-event vocabulary plumbing across the web UI and backend.

Changes:

Add web/src/app/api/_utils/backend-proxy.ts and migrate many Next route handlers to use proxyJson/proxyText/proxyBinary/proxyStream.
Remove dead runbook proxy routes and add contract tests (tests/unit/test_next_backend_proxy_contracts.py, web/src/app/api/_utils/backend-proxy.test.ts).
Add agent-events store/parsers/UI components and backend codex delegation event helpers/constants.

Reviewed changes

Copilot reviewed 100 out of 101 changed files in this pull request and generated 7 comments.

Show a summary per file

File	Description
web/vitest.config.ts	Configure jsdom env for component tests via `environmentMatchGlobs`.
web/src/lib/agent-events/useAgentEvents.ts	Client SSE hook for consuming backend `/api/events/stream` into Zustand.
web/src/lib/agent-events/types.ts	Agent event + Codex delegation display types.
web/src/lib/agent-events/store.ts	Zustand store for agent events, file touches, codex delegations, kanban tasks.
web/src/lib/agent-events/parsers.ts	Parse raw SSE envelopes into display/store entries.
web/src/components/layout/Sidebar.tsx	Add sidebar entry for Agent Dashboard.
web/src/components/agent-events/ToolCallTimeline.tsx	UI for tool call timeline panel.
web/src/components/agent-events/AgentStatusPanel.tsx	UI for agent status grid + connection indicator.
web/src/components/agent-events/ActivityFeed.tsx	UI for activity feed panel.
web/src/components/agent-dashboard/TasksPanel.tsx	Left rail panel (agents + runs selector).
web/src/components/agent-dashboard/KanbanBoard.tsx	Kanban board UI for agent tasks by status.
web/src/components/agent-dashboard/KanbanBoard.test.tsx	Component tests for KanbanBoard rendering/helpers.
web/src/components/agent-dashboard/InlineDiffPanel.tsx	Diff viewer wrapper panel for a selected file change.
web/src/components/agent-dashboard/FileListPanel.tsx	File list panel + navigation into InlineDiffPanel.
web/src/app/api/studio/status/route.ts	Migrate route to shared `proxyJson`.
web/src/app/api/studio/cwd/route.ts	Migrate route to shared `proxyJson`.
web/src/app/api/studio/chat/route.ts	Migrate SSE route to shared `proxyStream` with auth.
web/src/app/api/sandbox/status/route.ts	Migrate route to shared `proxyJson`.
web/src/app/api/runbook/snapshots/route.ts	Migrate route to shared `proxyJson`.
web/src/app/api/runbook/snapshots/[snapshotId]/route.ts	Migrate route to shared `proxyJson`.
web/src/app/api/runbook/smoke/route.ts	Remove dead proxy route.
web/src/app/api/runbook/runs/[runId]/route.ts	Remove dead proxy route.
web/src/app/api/runbook/revert/route.ts	Migrate route to shared `proxyJson`.
web/src/app/api/runbook/revert-project/route.ts	Migrate route to shared `proxyJson` (auth).
web/src/app/api/runbook/revert-hunks/route.ts	Migrate route to shared `proxyJson`.
web/src/app/api/runbook/project-dir/prepare/route.ts	Migrate route to shared `proxyJson`.
web/src/app/api/runbook/files/route.ts	Migrate route to shared `proxyJson` (query passthrough).
web/src/app/api/runbook/file/route.ts	Migrate GET/POST to shared `proxyJson`.
web/src/app/api/runbook/diff/route.ts	Migrate route to shared `proxyJson` (query passthrough).
web/src/app/api/runbook/delete/route.ts	Migrate route to shared `proxyJson` (auth).
web/src/app/api/runbook/changes/route.ts	Migrate route to shared `proxyJson` (query passthrough).
web/src/app/api/runbook/allowed-dirs/route.ts	Migrate GET/POST to shared `proxyJson`.
web/src/app/api/research/repro/context/route.ts	Use shared proxy stream helper for SSE generation.
web/src/app/api/research/paperscool/sessions/[sessionId]/route.ts	Migrate route to shared `proxyText`.
web/src/app/api/research/paperscool/daily/route.ts	Migrate to `proxyStream` with JSON fallback passthrough.
web/src/app/api/research/paperscool/analyze/route.ts	Migrate to `proxyStream`.
web/src/app/api/research/_base.ts	Replace local proxy helper with wrapper around shared proxy (auth=true).
web/src/app/api/papers/export/route.ts	Migrate binary download to shared `proxyBinary`.
web/src/app/api/newsletter/unsubscribe/[token]/route.ts	Migrate HTML passthrough to shared `proxyText` with custom error handler.
web/src/app/api/gen-code/route.ts	Migrate to shared `proxyStream` (auth).
web/src/app/api/auth/reset-password/route.ts	Migrate to shared `proxyJson`.
web/src/app/api/auth/register/route.ts	Migrate to shared `proxyJson`.
web/src/app/api/auth/me/route.ts	Migrate to shared `proxyJson` (auth) for GET/PATCH/DELETE.
web/src/app/api/auth/me/change-password/route.ts	Migrate to shared `proxyJson` (auth).
web/src/app/api/auth/login-check/route.ts	Migrate to shared `proxyJson` with onError fallback.
web/src/app/api/auth/forgot-password/route.ts	Migrate to shared `proxyJson`.
web/src/app/api/_utils/backend-proxy.ts	New shared proxy implementation (JSON/text/binary/SSE).
web/src/app/api/_utils/backend-proxy.test.ts	Unit tests for proxy helper behavior.
web/src/app/api/_utils/auth-headers.ts	Expand base URL env fallback chain.
web/src/app/agent-events/page.tsx	Agent events debug page mounting SSE hook + panels.
web/src/app/agent-dashboard/page.tsx	Agent dashboard page mounting SSE hook + SplitPanels layout.
web/package.json	Add testing deps + reorder deps (testing-library, jsdom, next-auth, radix popover).
tests/unit/test_next_backend_proxy_contracts.py	Contract tests ensuring Next route handlers don’t regress to direct fetch/env base URLs.
tests/unit/test_codex_overflow.py	Unit tests for codex overflow env-var stub.
tests/unit/test_agent_events_vocab.py	Extend EventType vocabulary tests incl. codex constants.
tests/unit/test_agent_board_codex_events.py	Unit tests for codex delegation event emission helper.
src/paperbot/repro/orchestrator.py	Add `_should_overflow_to_codex()` env-var stub.
src/paperbot/mcp/tools/_audit.py	Use EventType constants for tool result/error envelope types.
src/paperbot/application/collaboration/message_schema.py	Introduce `EventType` constants incl. codex delegation types.
src/paperbot/application/collaboration/agent_events.py	New helper module for lifecycle + tool-call event envelope creation.
src/paperbot/api/routes/agent_board.py	Add codex event emission helper + emit at dispatch/accept/complete/fail points.
.planning/phases/10-agent-board-codex-bridge/10-VALIDATION.md	Phase validation docs update.
.planning/phases/10-agent-board-codex-bridge/10-03-PLAN.md	Plan doc for dashboard integration/codex worker.
.planning/phases/10-agent-board-codex-bridge/10-02-SUMMARY.md	Summary doc for phase outputs.
.planning/phases/10-agent-board-codex-bridge/10-01-SUMMARY.md	Summary doc for backend codex event work.
.planning/phases/10-agent-board-codex-bridge/10-01-PLAN.md	Plan doc for codex event emission/constants.
.planning/phases/09-three-panel-dashboard/09-VERIFICATION.md	Phase verification docs update.
.planning/phases/09-three-panel-dashboard/09-VALIDATION.md	Phase validation docs update.
.planning/phases/09-three-panel-dashboard/09-02-SUMMARY.md	Summary doc update.
.planning/phases/09-three-panel-dashboard/09-01-SUMMARY.md	Summary doc update.
.planning/phases/08-agent-event-vocabulary/08-VERIFICATION.md	Phase verification docs update.
.planning/phases/08-agent-event-vocabulary/08-VALIDATION.md	Phase validation docs update.
.planning/phases/08-agent-event-vocabulary/08-02-SUMMARY.md	Summary doc update.
.planning/phases/08-agent-event-vocabulary/08-01-SUMMARY.md	Summary doc update.
.planning/phases/08-agent-event-vocabulary/08-01-PLAN.md	Plan doc update.
.planning/codebase/STACK.md	New/updated stack documentation.
.planning/codebase/CONVENTIONS.md	New/updated conventions documentation.
.planning/STATE.md	Update project state tracking.
.planning/REQUIREMENTS.md	Update requirements tracking/completion state.
.planning/PROJECT.md	Update project scope/roadmap narrative.

Files not reviewed (1)

web/package-lock.json: Language not supported

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

You can also share your feedback on Copilot code review. Take the survey.

web/src/lib/agent-events/useAgentEvents.ts

+  const abortRef = useRef<AbortController | null>(null)
+
+  useEffect(() => {
+    const controller = new AbortController()
+    abortRef.current = controller


web/src/app/api/_utils/backend-proxy.ts

+  try {
+    return await fetch(upstreamUrl, {
+      ...init,
+      method,
+      headers: await resolveHeaders(req, method, options),
+      body: await resolveBody(req, method),
+      cache: options.cache,
+      signal: controller?.signal,
+    } as RequestInit & { dispatcher?: Agent })


web/src/components/layout/Sidebar.tsx

@@ -43,6 +44,7 @@ const routes = [
  { label: "Papers", icon: FileText, href: "/papers" },
  { label: "Workflows", icon: Workflow, href: "/workflows" },
  { label: "DeepCode Studio", icon: Code2, href: "/studio" },
+  { label: "Agent Dashboard", icon: Monitor, href: "/agent-dashboard" },


src/paperbot/api/routes/agent_board.py

+def _get_event_log_from_container():
+    """Lazily retrieve the event_log from the DI Container.
+
+    Lazy import inside function body prevents circular imports at module load
+    time. Returns None if the container is not configured or has no event_log.
+    """
+    try:
+        from paperbot.core.di import Container  # noqa: PLC0415
+
+        return Container.instance().event_log
+    except Exception:
+        return None


src/paperbot/api/routes/agent_board.py

+    await _emit_codex_event(
+        "codex_dispatched",
+        task,
+        session,
+        {"assignee": task.assignee},
+    )


web/src/lib/agent-events/useAgentEvents.ts

+    async function connect() {
+      try {
+        const res = await fetch(`${BACKEND_URL}/api/events/stream`, {
+          signal: controller.signal,
+          headers: { Accept: "text/event-stream" },
+        })
+        if (!res.ok || !res.body) return


web/src/lib/agent-events/useAgentEvents.ts

+      } catch (err) {
+        if ((err as Error)?.name !== "AbortError") {
+          console.warn("[useAgentEvents] disconnected, will retry in 3s", err)
+          setTimeout(connect, 3000)
+        }
+      } finally {
+        setConnected(false)
+      }


sonarqubecloud · 2026-03-15T04:53:52Z

Quality Gate passed

Issues
4 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

coderabbitai

Actionable comments posted: 16

Note

Due to the large number of review comments, Critical, Major severity comments were prioritized as inline comments.

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

.planning/REQUIREMENTS.md (1)

259-297: ⚠️ Potential issue | 🟡 Minor

Traceability table is split, so v1.2 rows are detached from the table structure.

ADAPT-01 through DOMAIN-02 appear after the Coverage bullet list, which breaks table continuity. Move those rows into the main traceability table (or start a new table with headers) so the mapping stays readable and machine-parseable.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.planning/REQUIREMENTS.md around lines 259 - 297, The traceability rows for
ADAPT-01 through DOMAIN-02 are outside the main table and break parsing; move
all those rows back into the traceability table structure (or create a new
markdown table with headers) so they are contiguous with the rest of the table;
specifically relocate the block containing ADAPT-01 ... DOMAIN-02 into the main
table body (or replace it with a properly formatted table header + rows)
ensuring each requirement ID (e.g., ADAPT-01, MONIT-01, CTRL-01, SESS-01,
VIS-01, DOMAIN-01) is a table row with the same columns as the existing
traceability table for machine-parseability.

.planning/STATE.md (1)

43-72: ⚠️ Potential issue | 🟡 Minor

Performance metrics section is internally inconsistent and partially malformed.

Line 44 conflicts with Line 13 (6 vs 15 completed plans), and Lines 63-72 are table-like rows without a valid table header context.

Proposed cleanup

 **Velocity:**
-- Total plans completed: 6
+- Total plans completed: 15
 - Average duration: 6 min
 - Total execution time: 0.6 hours

 **Recent Trend:**
 - Last 3 plans: 3min, 2min, 3min
 - Trend: Stable
-| Phase 06-agent-skills P01 | 3 | 2 tasks | 5 files |
-| Phase 07-eventbus-sse-foundation P01 | 3 | 2 tasks | 3 files |
+ 
+| Recent Phase | Duration | Tasks | Files |
+|--------------|----------|-------|-------|
+| Phase 06-agent-skills P01 | 3min | 2 tasks | 5 files |
+| Phase 07-eventbus-sse-foundation P01 | 3min | 2 tasks | 3 files |
 ...

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.planning/STATE.md around lines 43 - 72, The "Velocity:" and "By Phase:"
sections are inconsistent and the trailing rows under "Recent Trend:" are
malformed table entries; update the "Velocity:" totals to match the sum of plans
in "By Phase:" (or vice versa) and normalize the phase rows into a single valid
Markdown table under the "By Phase:" header, remove or move the misplaced phase
rows from the "Recent Trend:" section, and ensure "Recent Trend:" contains only
simple bullet metrics (e.g., last N plan durations and trend). Specifically edit
the "Velocity:" header, the "By Phase:" table rows (the entries like "02",
"03-remaining-mcp-tools P01", etc.), and the stray rows currently at lines 63-72
so all counts (Total plans, Avg duration, per-phase plan counts) sum
consistently.

🟡 Minor comments (13)

tests/unit/test_agent_board_codex_events.py-67-68 (1)
67-68: ⚠️ Potential issue | 🟡 Minor

Remove unused function _fake_container_instance.

This function is defined but never called. The test uses monkeypatch.setattr directly with a lambda instead.
🧹 Proposed fix
-    def _fake_container_instance():
-        return fake_container
-
     monkeypatch.setattr(
         "paperbot.api.routes.agent_board._get_event_log_from_container",
         lambda: fake_log,
     )
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@tests/unit/test_agent_board_codex_events.py` around lines 67 - 68, Remove the
unused helper function _fake_container_instance from the test file; it is
defined but never called because the test uses monkeypatch.setattr with a lambda
instead. Delete the _fake_container_instance function definition and any unused
imports/refs tied only to it so the test remains clean and unchanged, and ensure
monkeypatch.setattr usage (the lambda) remains as-is.
.planning/ROADMAP.md-174-179 (1)
174-179: ⚠️ Potential issue | 🟡 Minor

Roadmap status data is internally inconsistent.

Phase 9 is shown as complete (2/2) in Progress, but its plan items are still unchecked, and Phase 8/10 progress rows appear column-shifted (plan counts ending up under Milestone). Please normalize these entries so phase tracking is unambiguous.

Also applies to: 414-416
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.planning/ROADMAP.md around lines 174 - 179, The roadmap's phase rows have
misaligned/mismatched status and checkbox items causing Phase 9 to show complete
while plan checkboxes remain unchecked and Phase 8/10 counts are column-shifted;
fix the table entries under the "Plans:" block so counts, progress (e.g.,
"2/2"), milestone and checkbox list are consistent: update the Phase 9 progress
cell to reflect the unchecked items (e.g., "0/2" or mark the checkboxes checked
to match "2/2"), and realign the numeric plan counts for Phase 8 and Phase 10 so
they sit under the correct headers (Milestone vs Progress), ensuring the two
plan lines ("09-01-PLAN.md" and "09-02-PLAN.md") accurately reflect checked
state and that similar fixes are applied to lines 414-416 to normalize phase
tracking.
.planning/codebase/INTEGRATIONS.md-146-147 (1)
146-147: ⚠️ Potential issue | 🟡 Minor

Markdownlint MD049 warning should be resolved in this section.

This emphasis style is flagged by markdownlint (Expected underscore; Actual asterisk). Please normalize to the repo’s configured emphasis style to keep docs lint-clean.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.planning/codebase/INTEGRATIONS.md around lines 146 - 147, Replace the
asterisk-based emphasis in the indicated INTEGRATIONS.md line (the section
mentioning "Pydantic EmailStr with `email-validator>=2.0.0`") with
underscore-based emphasis to satisfy markdownlint MD049; locate the paragraph or
list item containing "Pydantic EmailStr" and change any *italic* or **bold**
asterisks to _italic_ or __bold__ respectively so the emphasis style matches the
repo's configured underscore convention.
web/src/components/agent-events/ToolCallTimeline.tsx-23-29 (1)
23-29: ⚠️ Potential issue | 🟡 Minor

Fix invalid date handling and consolidate duplicate formatTimestamp function.

The try/catch around new Date(ts) won't trigger for invalid timestamps—JavaScript returns an Invalid Date object (non-fatal) rather than throwing. This causes d.toTimeString() to return the string "Invalid Date", which .slice(0, 8) reduces to "Invalid " in the UI, bypassing the fallback entirely.

Additionally, this function is duplicated identically in ActivityFeed.tsx. Extract to a shared utility module.
Suggested fix
 function formatTimestamp(ts: string): string {
-  try {
-    const d = new Date(ts)
-    return d.toTimeString().slice(0, 8)
-  } catch {
-    return ts.slice(0, 8)
-  }
 }
+  const d = new Date(ts)
+  if (Number.isNaN(d.getTime())) return ts.slice(0, 8)
+  return d.toTimeString().slice(0, 8)
+}
Move this function to a shared utils module (e.g., web/src/lib/agent-events/format.ts) and import in both components. Add tests for both valid and invalid timestamp inputs.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@web/src/components/agent-events/ToolCallTimeline.tsx` around lines 23 - 29,
The formatTimestamp function in ToolCallTimeline.tsx is incorrect because new
Date(ts) does not throw for invalid inputs, so toTimeString() can yield "Invalid
Date"; replace the implementation of formatTimestamp to explicitly check the
Date validity (e.g., const d = new Date(ts); if (isNaN(d.getTime())) return
fallback like ts.slice(0,8); else return d.toTimeString().slice(0,8)), then
remove the duplicate implementation from ActivityFeed.tsx and extract this
function into a shared utility (suggested module name:
web/src/lib/agent-events/format.ts), import formatTimestamp into both
ToolCallTimeline and ActivityFeed, and add unit tests for valid and invalid
timestamp inputs.
tests/unit/test_next_backend_proxy_contracts.py-10-15 (1)
10-15: ⚠️ Potential issue | 🟡 Minor

Guard against vacuous passes when no route files are discovered.

If _route_files() returns empty, this contract test passes without validating anything.
✅ Suggested hardening
 def test_next_api_routes_use_shared_backend_proxy_helpers() -> None:
     offenders: list[str] = []
+    route_files = _route_files()
+    assert route_files, f"No route.ts files found under {API_ROOT}"
 
-    for file_path in _route_files():
+    for file_path in route_files:
         text = file_path.read_text(encoding="utf-8")
         rel = file_path.relative_to(ROOT)
Also applies to: 18-31
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@tests/unit/test_next_backend_proxy_contracts.py` around lines 10 - 15, The
test currently allows a vacuous pass when _route_files() returns an empty list;
modify the test to guard against that by asserting that _route_files() is
non-empty (or raising a clear AssertionError) before proceeding with contract
validations so the test fails if no route files are discovered; locate the
helper _route_files() and the test block(s) that iterate its result (in
tests/unit/test_next_backend_proxy_contracts.py) and add a precondition check
like assert routes, "no route files found: ensure API_ROOT and route.ts files
are present" so missing discovery causes the test to fail rather than pass
vacuously.
web/src/components/agent-events/ActivityFeed.tsx-23-29 (1)
23-29: ⚠️ Potential issue | 🟡 Minor

formatTimestamp try/catch doesn't catch invalid dates — UI shows "Invalid " text.

JavaScript's new Date(ts) doesn't throw on invalid input; it returns an Invalid Date object. The catch block never executes, and Invalid Date.slice(0, 8)renders as"Invalid "` in the UI.

The same bug exists in web/src/components/agent-events/ToolCallTimeline.tsx — both should be fixed consistently. Fix by checking Number.isNaN(d.getTime()) instead of relying on try/catch. Add test coverage per the guideline that behavior changes should have tests.
🛠️ Proposed fix
 function formatTimestamp(ts: string): string {
-  try {
-    const d = new Date(ts)
-    return d.toTimeString().slice(0, 8) // HH:MM:SS
-  } catch {
-    return ts.slice(0, 8)
-  }
+  const d = new Date(ts)
+  if (Number.isNaN(d.getTime())) return ts.slice(0, 8)
+  return d.toISOString().slice(11, 19) // HH:MM:SS (UTC)
 }
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@web/src/components/agent-events/ActivityFeed.tsx` around lines 23 - 29,
formatTimestamp currently assumes new Date(ts) throws on invalid input which it
doesn't, so replace the try/catch with an explicit validity check: in
formatTimestamp (web/src/components/agent-events/ActivityFeed.tsx) create the
Date d from ts, then if Number.isNaN(d.getTime()) return a safe fallback like
ts.slice(0,8) or an empty string, otherwise return d.toTimeString().slice(0,8);
apply the same change in the corresponding formatter in
web/src/components/agent-events/ToolCallTimeline.tsx; add unit tests covering
valid ISO timestamps, invalid strings, and edge cases to ensure the fallback
behavior is exercised.
.planning/phases/08-agent-event-vocabulary/08-RESEARCH.md-12-17 (1)
12-17: ⚠️ Potential issue | 🟡 Minor

Fix malformed Phase Requirements table rendering.

Line 17 is being parsed as part of the table, which causes the table-pipe/column-count lint failures and broken rendering.
Proposed markdown fix
 | EVNT-01 | User can view a real-time scrolling activity feed showing agent events as they happen | useAgentEvents hook consuming /api/events/stream; feed component bound to Zustand activity list |
 | EVNT-02 | User can see each agent's lifecycle status (idle, working, completed, errored) at a glance | agent_started/agent_completed/agent_error typed events update per-agent status map in Zustand store |
 | EVNT-03 | User can view a structured tool call timeline showing tool name, arguments, result summary, and duration | tool_call/tool_result typed events with standardized payload shape; existing _audit.py already emits compatible data |
+
 </phase_requirements>
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.planning/phases/08-agent-event-vocabulary/08-RESEARCH.md around lines 12 -
17, The Phase Requirements markdown table (rows EVNT-01, EVNT-02, EVNT-03) is
being parsed past its end; fix it by closing the table with a blank line (an
empty newline) immediately after the final table row so the parser stops
table-mode, or alternatively add a Markdown paragraph break (e.g., a blank line)
before the following XML/phase tag. Edit the block that contains the table rows
(the EVNT-XX entries) to ensure there is exactly one empty line after the table
to prevent the next line from being treated as a table row.
.planning/phases/09-three-panel-dashboard/09-01-SUMMARY.md-126-127 (1)
126-127: ⚠️ Potential issue | 🟡 Minor

Unify emphasis style to satisfy markdownlint.

Lines 126-127 are flagged by MD049 (expected underscore emphasis style). Please align these footer lines with the configured style.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.planning/phases/09-three-panel-dashboard/09-01-SUMMARY.md around lines 126
- 127, The footer lines currently use asterisks for emphasis ("*Phase:
09-three-panel-dashboard*" and "*Completed: 2026-03-15*"), which triggers MD049;
change them to use underscores for emphasis instead (e.g., _Phase:
09-three-panel-dashboard_ and _Completed: 2026-03-15_) so the emphasis style in
the three-panel-dashboard summary matches the configured underscore style.
.planning/phases/10-agent-board-codex-bridge/10-01-PLAN.md-56-56 (1)
56-56: ⚠️ Potential issue | 🟡 Minor

Resolve markdownlint warnings in this plan doc.

The current content still triggers MD037/MD046/MD040 at these lines (spaces inside emphasis markers, indented code block style, and a fence without language). Please normalize those sections so docs lint stays green.

Also applies to: 171-171, 222-222
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.planning/phases/10-agent-board-codex-bridge/10-01-PLAN.md at line 56, The
plan doc has markdownlint violations (MD037, MD046, MD040); fix them by removing
spaces inside emphasis markers (change "* text *" or "_ text _" to "*text*" or
"_text_"), replace indented code blocks with fenced code blocks using backticks,
and ensure every fenced code block includes a language hint (e.g., ```python)
for the sections that currently show "Output: Python EventType constants,
_emit_codex_event helper, _should_overflow_to_codex method, unit tests for all
three." and the other flagged occurrences; normalize the three mentioned
positions so there are no spaces inside emphasis markers, no indented code
blocks, and all fences include a language.
.planning/phases/09-three-panel-dashboard/09-01-PLAN.md-183-183 (1)
183-183: ⚠️ Potential issue | 🟡 Minor

Convert indented snippets to fenced code blocks.

These lines are flagged by MD046. Use fenced blocks (with language where possible) to match the project markdown style and avoid lint noise.

Also applies to: 197-197, 306-306
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.planning/phases/09-three-panel-dashboard/09-01-PLAN.md at line 183, The
indented code snippets under the "Store file tracking tests (append to
store.test.ts):" section (and the similar occurrences flagged at the other two
locations) must be converted from indented blocks to fenced code blocks using
triple backticks and an appropriate language tag (e.g., ```ts or ```javascript)
to satisfy MD046; find the snippet text under that heading and replace each
indented snippet with a fenced block, and apply the same change to the other
flagged snippets in the document (the ones reported by the linter) so all three
are consistent.
.planning/phases/10-agent-board-codex-bridge/10-02-SUMMARY.md-56-56 (1)
56-56: ⚠️ Potential issue | 🟡 Minor

Fix emphasis marker spacing on this line.

Line 56 is still flagged by markdownlint MD037 (no-space-in-emphasis).
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.planning/phases/10-agent-board-codex-bridge/10-02-SUMMARY.md at line 56,
The MD037 error means there are spaces inside emphasis markers on the quoted
line; edit the line containing "web/package.json, web/package-lock.json (added
`@testing-library/react`, jsdom)" to remove spaces inside any
asterisks/underscores or, better, mark filenames and packages as code using
backticks (e.g. `web/package.json`, `web/package-lock.json`,
`@testing-library/react`, `jsdom`) so there are no spaced emphasis markers
remaining.
web/src/lib/agent-events/parsers.ts-37-40 (1)
37-40: ⚠️ Potential issue | 🟡 Minor

Inconsistent runtime guards for payload fields can leak [object Object] and bypass type validation.

Payload is cast as Record<string, unknown> (line 32), making field values untyped. Unguarded String() coercions throughout the file are unsafe. Additionally, the status cast at line 66 lacks runtime validation and can accept invalid values.

The codebase already uses proper runtime guards elsewhere (e.g., typeof payload.error === "string" at line 22), so the unguarded cases should be hardened consistently.
🛡️ Suggested hardening
+function asText(value: unknown, fallback = ""): string {
+  if (typeof value === "string") return value
+  if (typeof value === "number" || typeof value === "boolean") return String(value)
+  return fallback
+}
+
 ...
-  const tool = String(payload.tool ?? raw.stage ?? "unknown")
+  const tool = asText(payload.tool, asText(raw.stage, "unknown"))
 ...
-  const path = String(
-    (isExplicitFileChange
-      ? payload.path
-      : payload.arguments
-        ? (payload.arguments as Record<string, unknown>).path
-        : undefined) ?? ""
-  )
+  const rawPath = isExplicitFileChange
+    ? payload.path
+    : (payload.arguments as Record<string, unknown> | undefined)?.path
+  const path = typeof rawPath === "string" ? rawPath : ""
   if (!path) return null
 ...
-    status: (payload.status as "created" | "modified") ?? "modified",
+    status: payload.status === "created" || payload.status === "modified" ? payload.status : "modified",
 ...
-  const task_id = String(payload.task_id ?? "")
-  const task_title = String(payload.task_title ?? "")
-  const assignee = String(payload.assignee ?? raw.agent_name ?? "")
-  const session_id = String(payload.session_id ?? "")
+  const task_id = asText(payload.task_id)
+  const task_title = asText(payload.task_title)
+  const assignee = asText(payload.assignee, asText(raw.agent_name))
+  const session_id = asText(payload.session_id)
Also applies to: 81–85, 93–98, 116, 155–159
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@web/src/lib/agent-events/parsers.ts` around lines 37 - 40, The code is using
unsafe String() coercions on untyped payload fields (e.g., payload.detail,
payload.tool, payload.result_summary) and an unchecked cast for status; replace
these with runtime guards and safe formatting: check types like typeof
payload.detail === "string" (or payload.detail == null) before using, for
objects use JSON.stringify(payload.detail) or a fallback like "[object]" when
not a string, and validate status against the expected union of values before
accepting the cast. Update the branches that reference raw.agent_name,
payload.detail, payload.tool, payload.result_summary and the status cast to
perform these guards so object values do not become "[object Object]" and
invalid status values are rejected or defaulted.
.planning/phases/08-agent-event-vocabulary/08-01-PLAN.md-112-118 (1)
112-118: ⚠️ Potential issue | 🟡 Minor

Strengthen the _audit.py regression test.

This only proves the constants appear somewhere in the file. It still passes if type="error" or type="tool_result" survives beside the new constants, so the regression you're trying to prevent can slip through. Add explicit negative assertions for the raw type= literals or inspect the relevant call site instead of doing a presence-only source check.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.planning/phases/08-agent-event-vocabulary/08-01-PLAN.md around lines 112 -
118, Update the test_audit_uses_constants unit test to not only assert that
"EventType.TOOL_ERROR" and "EventType.TOOL_RESULT" appear in the _audit.py
source, but also assert that the raw literals type="error" and
type="tool_result" do NOT appear; alternatively, read the _audit.py AST or
locate the specific call site (e.g., the audit/event creation call) and assert
its type argument uses EventType.TOOL_ERROR / EventType.TOOL_RESULT rather than
raw strings; keep the test synchronous, use direct file read or AST inspection,
and keep the test named test_audit_uses_constants.

🧹 Nitpick comments (21)

.planning/codebase/STRUCTURE.md (1)

7-65: Add language specifier to fenced code block.

The directory layout code block should have a language specifier for proper rendering and accessibility.

📝 Suggested fix

-```
+```text
 PaperBot/
 ├── src/paperbot/                # Python backend source code

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.planning/codebase/STRUCTURE.md around lines 7 - 65, The fenced code block
in .planning/codebase/STRUCTURE.md lacks a language specifier which can affect
rendering; update the opening fence for the directory layout (the
triple-backtick before the "PaperBot/" tree) to include a language tag such as
text (i.e., change ``` to ```text) so the block is properly highlighted and
accessible; ensure only the opening fence is modified and the rest of the ASCII
tree (starting with "PaperBot/") remains unchanged.

.planning/research/STACK.md (1)

117-138: Add language specifier to architecture integration map.

The architecture diagram should have a language specifier for consistency.

📝 Suggested fix

-```
+```text
 User Browser
     │
     ├── SSE stream (EventSource)     → FastAPI /api/agent/events  → EventBus fan-out

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.planning/research/STACK.md around lines 117 - 138, Add a language/specifier
label to the architecture integration map by inserting a short header or inline
tag (e.g., "Language: text" or "Format: text") above or beside the ASCII diagram
so it's clear what syntax/format the diagram uses; update the block containing
"User Browser", "SSE stream (EventSource) → FastAPI /api/agent/events → EventBus
fan-out" and the surrounding diagram lines (including "WebSocket (ws) → Next.js
custom server" and "HTTP fetch (chat proxy) → Next.js API route") to include
that language specifier consistently.

.planning/research/FEATURES.md (1)

75-125: Add language specifier to feature dependency graph.

The feature dependency diagram should have a language specifier for consistency with other documentation.

📝 Suggested fix

-```
+```text
 Chat input / Task dispatch
     |
     +--requires--> Agent adapter layer (ClaudeCodeAdapter / CodexAdapter / OpenCodeAdapter)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.planning/research/FEATURES.md around lines 75 - 125, The ASCII feature
dependency diagram starting with "Chat input / Task dispatch" is missing a
language specifier on its fenced code block; update the opening fence to include
a language (e.g., change ``` to ```text) so the block is consistently rendered
and matches other docs; locate the fenced block containing "Chat input / Task
dispatch" and replace the opening triple-backtick with ```text (keeping the same
closing fence).

web/src/app/api/auth/me/route.ts (1)

1-13: Missing runtime export for consistency.

Other migrated routes (e.g., studio/cwd/route.ts, runbook/revert-project/route.ts) explicitly export runtime = "nodejs". Consider adding this for consistency across the proxy routes:

+export const runtime = "nodejs"
+
 import { apiBaseUrl, proxyJson } from "@/app/api/_utils/backend-proxy"

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@web/src/app/api/auth/me/route.ts` around lines 1 - 13, Add an explicit
runtime export to match other migrated proxy routes: export const runtime =
"nodejs". Place this top-level export in the same module that defines the GET,
PATCH, and DELETE handlers so the runtime is applied to those functions (GET,
PATCH, DELETE) and keeps behavior consistent with routes like
studio/cwd/route.ts and runbook/revert-project/route.ts.

tests/unit/test_codex_overflow.py (1)

10-56: Good TDD coverage for env var parsing.

The test cases cover the primary truthy/falsy scenarios. The import-inside-test pattern is correct for capturing env var state at module load time.

Consider adding test cases for:

"false", "no", "off" (explicit falsy strings)

Case-insensitive variants like "TRUE", "Yes" if the implementation should handle them

This is optional if the implementation intentionally requires lowercase.

,
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@tests/unit/test_codex_overflow.py` around lines 10 - 56, Add missing tests
for explicit falsy strings and case variants for _should_overflow_to_codex:
create tests using monkeypatch.setenv for "false", "no", and "off" asserting the
function returns False, and add at least one case-insensitive check (e.g.,
"TRUE" returns True and "Yes"/"FALSE" behavior) to verify whether
_should_overflow_to_codex handles casing; keep the import-inside-test pattern
and mirror existing test style and names.

web/src/components/agent-dashboard/InlineDiffPanel.tsx (1)

8-11: Consider marking props as Readonly.

Per SonarCloud's suggestion, using Readonly<InlineDiffPanelProps> ensures the component doesn't accidentally mutate its props—a TypeScript best practice for functional components.

Proposed fix

-export function InlineDiffPanel({ entry, onBack }: InlineDiffPanelProps) {
+export function InlineDiffPanel({ entry, onBack }: Readonly<InlineDiffPanelProps>) {

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@web/src/components/agent-dashboard/InlineDiffPanel.tsx` around lines 8 - 11,
Wrap the component props in Readonly to prevent mutation: change usages that
accept InlineDiffPanelProps (e.g., the InlineDiffPanel function/component
signature and any prop-typed variables) to use Readonly<InlineDiffPanelProps> so
the component receives an immutable props type; update the component declaration
that references InlineDiffPanelProps to Readonly<InlineDiffPanelProps> and
adjust any helper function/type annotations that consume the props accordingly.

src/paperbot/repro/orchestrator.py (1)

56-57: Consider narrowing the exception catch.

The bare except Exception is flagged by Ruff (BLE001). Given that os.getenv(), .strip(), and .lower() have well-defined exception behavior, this block is overly defensive. If you want to keep it for safety, catching a more specific set (e.g., (TypeError, AttributeError)) would be cleaner.
Proposed narrower exception handling
-    except Exception:
+    except (TypeError, AttributeError):
         return False
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/paperbot/repro/orchestrator.py` around lines 56 - 57, The bare except in
the block that wraps the os.getenv(...).strip().lower() call is too broad;
narrow it to only the expected exceptions (e.g., catch (TypeError,
AttributeError)) or remove the try/except and handle None explicitly before
calling .strip()/.lower(); locate the try/except surrounding the
os.getenv(...).strip().lower() expression in orchestrator.py and replace "except
Exception:" with a specific tuple of exceptions (TypeError, AttributeError) or
add an explicit None-check on the getenv result before calling
.strip()/.lower().

web/src/app/api/research/paperscool/sessions/[sessionId]/route.ts (1)

5-13: Consider using proxyJson for JSON responses.

The route uses proxyText with responseContentType: "application/json". If the backend returns structured JSON, proxyJson would be more semantically appropriate and likely handles JSON parsing/error cases more consistently with other routes.

💡 Suggested change

-import { apiBaseUrl, proxyText } from "@/app/api/_utils/backend-proxy"
+import { apiBaseUrl, proxyJson } from "@/app/api/_utils/backend-proxy"

 export async function GET(req: Request, ctx: { params: Promise<{ sessionId: string }> }) {
   const { sessionId } = await ctx.params
-  return proxyText(
+  return proxyJson(
     req,
     `${apiBaseUrl()}/api/research/paperscool/sessions/${encodeURIComponent(sessionId)}`,
     "GET",
-    {
-      cache: "no-store",
-      responseContentType: "application/json",
-    },
+    { cache: "no-store" },
   )
 }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@web/src/app/api/research/paperscool/sessions/`[sessionId]/route.ts around
lines 5 - 13, The handler is calling proxyText for an endpoint that returns
JSON; switch to proxyJson to better reflect/handle structured JSON responses.
Replace the proxyText call with proxyJson (keeping the same URL built with
apiBaseUrl() and encodeURIComponent(sessionId)) and remove or adjust the
responseContentType option as proxyJson already handles JSON; ensure the HTTP
method ("GET") and cache option ("no-store") are preserved so the call remains
functionally identical.

web/src/app/api/research/repro/context/route.ts (1)

1-2: Mixed import sources reflect intentional wrapper design but could be unified for consistency.

This file imports proxyJson from "../../_base" (which wraps the backend function to add auth: true by default) but proxyStream from "@/app/api/_utils/backend-proxy". While this pattern is intentional—_base does not wrap proxyStream—unifying both imports from @/app/api/_utils/backend-proxy is valid for consistency. If unified, explicitly pass auth: true to the proxyJson call on line 8 to preserve the current behavior.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@web/src/app/api/research/repro/context/route.ts` around lines 1 - 2, Unify
the imports by importing both proxyJson and proxyStream from
"@/app/api/_utils/backend-proxy" instead of proxyJson from "../../_base", and
when calling proxyJson (the function currently invoked to call apiBaseUrl)
explicitly pass { auth: true } to preserve the existing wrapped behavior; update
the import statements and the proxyJson call site to include the auth option
while leaving proxyStream usage unchanged.

tests/unit/test_agent_board_codex_events.py (1)

33-46: Consider using simple dataclasses instead of MagicMock for test fakes.

The coding guidelines recommend stub/fake classes over unittest.mock. While the current usage is lightweight, a simple dataclass would be more explicit and align with the project's testing conventions.

♻️ Alternative using dataclass

+from dataclasses import dataclass
+
+@dataclass
+class FakeTask:
+    id: str = "task-abc"
+    title: str = "Implement model"
+    assignee: str = "codex-a1b2"
+
+@dataclass
+class FakeSession:
+    session_id: str = "board-test-session"

-def _make_task(task_id: str = "task-abc", title: str = "Implement model", assignee: str = "codex-a1b2"):
-    """Return a minimal AgentTask-like object without importing heavy deps."""
-    task = MagicMock()
-    task.id = task_id
-    task.title = title
-    task.assignee = assignee
-    return task
+def _make_task(task_id: str = "task-abc", title: str = "Implement model", assignee: str = "codex-a1b2"):
+    """Return a minimal AgentTask-like object without importing heavy deps."""
+    return FakeTask(id=task_id, title=title, assignee=assignee)

-def _make_session(session_id: str = "board-test-session"):
-    """Return a minimal BoardSession-like object."""
-    session = MagicMock()
-    session.session_id = session_id
-    return session
+def _make_session(session_id: str = "board-test-session"):
+    """Return a minimal BoardSession-like object."""
+    return FakeSession(session_id=session_id)

Based on learnings: "Use stub/fake classes (e.g., _FakeLLMService) rather than unittest.mock for test doubles in Python tests"

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@tests/unit/test_agent_board_codex_events.py` around lines 33 - 46, Replace
the MagicMock-based test fakes with simple dataclasses: create lightweight
dataclasses (e.g., AgentTaskFake with fields id, title, assignee and
BoardSessionFake with field session_id) and update the helper functions
_make_task and _make_session to return instances of those dataclasses instead of
MagicMock; ensure field names match the expectations of the tests/code that
consumes these objects (id, title, assignee for the AgentTask-like object and
session_id for the BoardSession-like object) so no other test changes are
needed.

web/src/lib/agent-events/store.ts (1)

81-86: Eviction relies on Object.keys() insertion order.

The eviction logic assumes keys[0] is the oldest run. This works because:

Modern JS guarantees string key order matches insertion order
Entries are only added, never re-inserted for existing run_ids

Consider adding a brief comment clarifying this assumption for future maintainers.

📝 Suggested clarifying comment

       // Evict oldest run_id when exceeding 20 runs
+      // Object.keys() preserves insertion order for string keys,
+      // so keys[0] is the earliest-added run_id
       const keys = Object.keys(updated)
       if (keys.length > 20) {
         delete updated[keys[0]]
       }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@web/src/lib/agent-events/store.ts` around lines 81 - 86, Eviction uses
Object.keys(updated)[0] to remove the oldest run but relies on JavaScript’s
insertion-order guarantee for string keys and the fact that run_ids are only
ever added (never re-inserted); add a concise comment next to the eviction block
(around the updated/keys/filesTouched logic) stating this assumption and why
it's safe (JS preserves insertion order for string keys and we don't reinsert
existing run_ids), so future maintainers understand why keys[0] is the oldest
and avoid accidental reordering changes.

web/src/components/agent-dashboard/TasksPanel.tsx (1)

13-23: Consider memoizing the derived runs list.

The runs derivation logic runs on every render. Since feed can contain up to 200 items (per FEED_MAX in store), consider wrapping this in useMemo to avoid recalculating on unrelated re-renders.

♻️ Proposed memoization

+import { useMemo } from "react"
 import { useAgentEventStore } from "@/lib/agent-events/store"
 ...

-  // Derive unique run_ids from feed, most recent first, limit 20
-  const runs: { run_id: string; agent_name: string }[] = []
-  const seen = new Set<string>()
-  for (const item of feed) {
-    const run_id = String(item.raw.run_id ?? "")
-    if (run_id && !seen.has(run_id)) {
-      seen.add(run_id)
-      runs.push({ run_id, agent_name: item.agent_name })
-      if (runs.length >= 20) break
-    }
-  }
+  // Derive unique run_ids from feed, most recent first, limit 20
+  const runs = useMemo(() => {
+    const result: { run_id: string; agent_name: string }[] = []
+    const seen = new Set<string>()
+    for (const item of feed) {
+      const run_id = String(item.raw.run_id ?? "")
+      if (run_id && !seen.has(run_id)) {
+        seen.add(run_id)
+        result.push({ run_id, agent_name: item.agent_name })
+        if (result.length >= 20) break
+      }
+    }
+    return result
+  }, [feed])

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@web/src/components/agent-dashboard/TasksPanel.tsx` around lines 13 - 23, The
runs derivation in the TasksPanel component recomputes on every render; wrap the
logic that builds runs (the seen Set, run_id extraction, and runs.push) in a
useMemo so it only recalculates when feed changes. Import useMemo from React if
needed, replace the current for-loop with a const runs = useMemo(() => { /*
existing loop creating runs array using seen and run_id, limit 20 */; return
runs; }, [feed]) so the memo depends on feed and preserves the same run
objects/shape.

web/src/components/agent-dashboard/KanbanBoard.test.tsx (2)

62-69: The count-badge assertion is too broad.

Asserting global text "2" is prone to false positives. Prefer asserting the badge within the intended column header.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@web/src/components/agent-dashboard/KanbanBoard.test.tsx` around lines 62 -
69, The test "column header shows task count badge" asserts a global "2" which
is too broad; update the test for KanbanBoard to scope the assertion to the
specific column header DOM node: locate the column header element for status
"planning" (e.g., findHeading/getByRole or getByText for the column title) and
then use testing-library's within(headerElement).getByText("2") or
expect(within(headerElement).queryByText("2")).toBeInTheDocument() so the badge
count is asserted only inside that column header.

32-54: Column-placement tests are not actually validating column placement.

These assertions only prove the task title exists somewhere on the board, so they can still pass if a task is rendered in the wrong column. Scope assertions to the specific column container (e.g., via within(...) on the target column).

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@web/src/components/agent-dashboard/KanbanBoard.test.tsx` around lines 32 -
54, The tests for KanbanBoard (tests that create tasks via makeTask and render
<KanbanBoard />) currently assert only that a title exists on the page; change
each test to scope the assertion to the specific column container by locating
the column (e.g., find the "Planned", "In Progress", "Done", or "Blocked" column
using screen.getByRole/getByText or a test id on the column) and then use
within(column).getByText("...") instead of screen.getAllByText(...).length
checks so each test verifies the task appears in the correct column.

src/paperbot/api/routes/agent_board.py (2)

828-833: Use EventType constants instead of raw "codex_*" strings.

Lines 829, 2449, 2503, and 2537 pass raw literals even though this flow is tied to the shared event vocabulary. Using constants avoids typo regressions and keeps backend/frontend contracts aligned.

Also applies to: 2448-2453, 2502-2511, 2536-2545

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@src/paperbot/api/routes/agent_board.py` around lines 828 - 833, Replace raw
string event names with the shared EventType constants: in calls to
_emit_codex_event (and other emit sites) swap the literal "codex_dispatched"
(and other "codex_*" literals) for the corresponding EventType member (e.g.,
EventType.codex_dispatched or the project's canonical EventType name). Update
all occurrences that pass raw "codex_*" strings (including the other emit calls
around the same flow) so they import and use EventType instead of hard-coded
literals to keep the shared event vocabulary consistent.

73-78: Remove the unnecessary async wrapper around _emit_codex_event.

The function is declared as async at line 73 but never awaits, adding unnecessary coroutine overhead and forcing await at every call site without benefit. Convert to a synchronous function.

Proposed refactor

-async def _emit_codex_event(
+def _emit_codex_event(
     event_type: str,
     task,
     session,
     extra: dict,
 ) -> None:
@@
-    await _emit_codex_event(
+    _emit_codex_event(
         "codex_dispatched",
         task,
         session,
         {"assignee": task.assignee},
     )
@@
-    await _emit_codex_event(
+    _emit_codex_event(
         "codex_dispatched",
         task,
         session,
         {"assignee": task.assignee},
     )
@@
-    await _emit_codex_event(
+    _emit_codex_event(
         "codex_accepted",
         task,
         session,
         {"assignee": task.assignee, "model": "codex"},
     )
@@
-        await _emit_codex_event(
+        _emit_codex_event(
             "codex_failed",
             task,
             session,
             {
                 "assignee": task.assignee,
                 "reason_code": result.diagnostics.get("reason_code", "unknown"),
                 "error": str(result.error or ""),
             },
         )
@@
-    await _emit_codex_event(
+    _emit_codex_event(
         "codex_completed",
         task,
         session,
         {
             "assignee": task.assignee,
             "files_generated": task.generated_files,
             "output_preview": (result.output or "")[:200],
         },
     )

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@src/paperbot/api/routes/agent_board.py` around lines 73 - 78,
_emit_codex_event is declared async but contains no awaits, so convert it to a
synchronous function: remove the async keyword and keep signature as def
_emit_codex_event(event_type: str, task, session, extra: dict) -> None, adjust
its return typing if needed, and update all call sites that currently use await
_emit_codex_event(...) to call it directly (drop await); ensure any imports or
annotations expecting a coroutine are updated accordingly.

.planning/research/ARCHITECTURE.md (1)

13-13: Add language tags to fenced blocks for lint/tooling consistency.

This file has repeated unlabeled fences; labeling them (text, python, etc.) will remove MD040 noise and improve rendering in editors.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.planning/research/ARCHITECTURE.md at line 13, The markdown file contains
unlabeled fenced code blocks (``` ... ```) causing MD040 lint noise; update each
unlabeled fence in .planning/research/ARCHITECTURE.md to include an appropriate
language tag (e.g., ```text, ```python, ```json) matching the block content so
tooling and linters correctly recognize the language and remove MD040 warnings.

.planning/phases/10-agent-board-codex-bridge/10-03-PLAN.md (1)

155-199: Use repo-relative paths in verification commands.

Hardcoded local paths reduce portability and can break reproducibility across environments.

Proposed command updates

-<automated>test -f /home/master1/PaperBot/.claude/agents/codex-worker.md && head -3 /home/master1/PaperBot/.claude/agents/codex-worker.md && cd /home/master1/PaperBot/web && npm run build 2>&1 | tail -5</automated>
+<automated>test -f .claude/agents/codex-worker.md && head -3 .claude/agents/codex-worker.md && cd web && npm run build 2>&1 | tail -5</automated>

-<automated>cd /home/master1/PaperBot/web && npm run build 2>&1 | tail -3</automated>
+<automated>cd web && npm run build 2>&1 | tail -3</automated>

-cd /home/master1/PaperBot/web && npm run build 2>&1 | tail -5
+cd web && npm run build 2>&1 | tail -5

-cd /home/master1/PaperBot/web && npm test -- KanbanBoard agent-events 2>&1 | tail -10
+cd web && npm test -- KanbanBoard agent-events 2>&1 | tail -10

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.planning/phases/10-agent-board-codex-bridge/10-03-PLAN.md around lines 155
- 199, The verification commands in the
.planning/phases/10-agent-board-codex-bridge/10-03-PLAN.md file use hardcoded
absolute paths (e.g., /home/master1/PaperBot/web and
/home/master1/PaperBot/.claude) which reduces portability; update the automated
and verification command snippets (the lines under <automated> and the fenced
bash block in <verification>) to use repo-relative paths (e.g., cd web && npm
run build, test -f .claude/agents/codex-worker.md, etc.) so they run correctly
in different environments; locate these commands in the PLAN.md content (search
for the <automated> blocks and the fenced verification bash block) and replace
absolute paths with relative ones while keeping the same checks and outputs.

.planning/phases/08-agent-event-vocabulary/08-RESEARCH.md (1)

69-69: Add language identifiers to fenced code blocks.

Unlabeled fences reduce readability/tooling support and trigger repeated MD040 warnings.

Example fix pattern

-```
+```text
 src/paperbot/
 ├── application/
 │   └── collaboration/
 ...
-```
+```

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.planning/phases/08-agent-event-vocabulary/08-RESEARCH.md at line 69, The
fenced code block at the shown snippet uses unlabeled backticks (```) which
triggers MD040 and reduces tooling support; update the opening fence to include
a language identifier (e.g., change ``` to ```text or ```bash as appropriate)
for all unlabeled blocks in the document (look for occurrences of the
triple-backtick fences such as the example around the project tree) so each
fenced block is labeled and no longer triggers the MD040 warnings.

web/src/lib/agent-events/types.ts (1)

20-21: Include tool_call in the typed tool-event surface.

This file says it mirrors the backend event vocabulary, but ToolCallEvent.type only allows terminal tool events. Any consumer that wants to type the pre-result tool_call event still has to fall back to AgentEventEnvelopeRaw, which defeats the point of exporting a dedicated tool event type.
Suggested fix
 export type ToolCallEvent = {
-  type: "tool_result" | "tool_error"
+  type: "tool_call" | "tool_result" | "tool_error"
   run_id: string
   trace_id: string
   agent_name: string
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@web/src/lib/agent-events/types.ts` around lines 20 - 21, ToolCallEvent
currently only allows "tool_result" | "tool_error" and thus excludes the
pre-result "tool_call" event; update the exported ToolCallEvent type so its type
union includes "tool_call" (i.e., change the type literal union on
ToolCallEvent.type to "tool_call" | "tool_result" | "tool_error") so consumers
can statically type pre-result tool events without falling back to
AgentEventEnvelopeRaw.

.planning/phases/10-agent-board-codex-bridge/10-RESEARCH.md (1)

77-77: Add language specifier to fenced code block.

For better readability and to follow Markdown best practices, specify a language for the fenced code block. For file tree structures, use text or plaintext.
📝 Proposed fix
-```
+```text
 web/src/
 ├── app/
 │   └── agent-dashboard/
As per coding guidelines flagged by markdownlint: fenced code blocks should have a language specified (MD040).
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.planning/phases/10-agent-board-codex-bridge/10-RESEARCH.md at line 77, The
fenced code block showing the file tree (the block containing "web/src/ ├── app/
└── agent-dashboard/") is missing a language specifier which triggers MD040;
update that fenced code block to include a language tag such as "text" or
"plaintext" (e.g., ```text) so markdownlint passes and the tree renders with the
correct formatting.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: c724e8aa-4396-4a83-b08e-6a526ce3ef65

📥 Commits

Reviewing files that changed from the base of the PR and between 751bcb2 and f899732.

⛔ Files ignored due to path filters (1)

web/package-lock.json is excluded by !**/package-lock.json

📒 Files selected for processing (100)

.planning/PROJECT.md
.planning/REQUIREMENTS.md
.planning/ROADMAP.md
.planning/STATE.md
.planning/codebase/ARCHITECTURE.md
.planning/codebase/CONCERNS.md
.planning/codebase/CONVENTIONS.md
.planning/codebase/INTEGRATIONS.md
.planning/codebase/STACK.md
.planning/codebase/STRUCTURE.md
.planning/codebase/TESTING.md
.planning/phases/08-agent-event-vocabulary/08-01-PLAN.md
.planning/phases/08-agent-event-vocabulary/08-01-SUMMARY.md
.planning/phases/08-agent-event-vocabulary/08-02-PLAN.md
.planning/phases/08-agent-event-vocabulary/08-02-SUMMARY.md
.planning/phases/08-agent-event-vocabulary/08-RESEARCH.md
.planning/phases/08-agent-event-vocabulary/08-VALIDATION.md
.planning/phases/08-agent-event-vocabulary/08-VERIFICATION.md
.planning/phases/09-three-panel-dashboard/09-01-PLAN.md
.planning/phases/09-three-panel-dashboard/09-01-SUMMARY.md
.planning/phases/09-three-panel-dashboard/09-02-PLAN.md
.planning/phases/09-three-panel-dashboard/09-02-SUMMARY.md
.planning/phases/09-three-panel-dashboard/09-RESEARCH.md
.planning/phases/09-three-panel-dashboard/09-VALIDATION.md
.planning/phases/09-three-panel-dashboard/09-VERIFICATION.md
.planning/phases/10-agent-board-codex-bridge/10-01-PLAN.md
.planning/phases/10-agent-board-codex-bridge/10-01-SUMMARY.md
.planning/phases/10-agent-board-codex-bridge/10-02-PLAN.md
.planning/phases/10-agent-board-codex-bridge/10-02-SUMMARY.md
.planning/phases/10-agent-board-codex-bridge/10-03-PLAN.md
.planning/phases/10-agent-board-codex-bridge/10-RESEARCH.md
.planning/phases/10-agent-board-codex-bridge/10-VALIDATION.md
.planning/research/ARCHITECTURE.md
.planning/research/FEATURES.md
.planning/research/PITFALLS.md
.planning/research/STACK.md
.planning/research/SUMMARY.md
src/paperbot/api/routes/agent_board.py
src/paperbot/application/collaboration/agent_events.py
src/paperbot/application/collaboration/message_schema.py
src/paperbot/mcp/tools/_audit.py
src/paperbot/repro/orchestrator.py
tests/unit/test_agent_board_codex_events.py
tests/unit/test_agent_events_vocab.py
tests/unit/test_codex_overflow.py
tests/unit/test_next_backend_proxy_contracts.py
web/package.json
web/src/app/agent-dashboard/page.tsx
web/src/app/agent-events/page.tsx
web/src/app/api/_utils/auth-headers.ts
web/src/app/api/_utils/backend-proxy.test.ts
web/src/app/api/_utils/backend-proxy.ts
web/src/app/api/auth/forgot-password/route.ts
web/src/app/api/auth/login-check/route.ts
web/src/app/api/auth/me/change-password/route.ts
web/src/app/api/auth/me/route.ts
web/src/app/api/auth/register/route.ts
web/src/app/api/auth/reset-password/route.ts
web/src/app/api/gen-code/route.ts
web/src/app/api/newsletter/unsubscribe/[token]/route.ts
web/src/app/api/papers/export/route.ts
web/src/app/api/research/_base.ts
web/src/app/api/research/paperscool/analyze/route.ts
web/src/app/api/research/paperscool/daily/route.ts
web/src/app/api/research/paperscool/sessions/[sessionId]/route.ts
web/src/app/api/research/repro/context/route.ts
web/src/app/api/runbook/allowed-dirs/route.ts
web/src/app/api/runbook/changes/route.ts
web/src/app/api/runbook/delete/route.ts
web/src/app/api/runbook/diff/route.ts
web/src/app/api/runbook/file/route.ts
web/src/app/api/runbook/files/route.ts
web/src/app/api/runbook/project-dir/prepare/route.ts
web/src/app/api/runbook/revert-hunks/route.ts
web/src/app/api/runbook/revert-project/route.ts
web/src/app/api/runbook/revert/route.ts
web/src/app/api/runbook/runs/[runId]/route.ts
web/src/app/api/runbook/smoke/route.ts
web/src/app/api/runbook/snapshots/[snapshotId]/route.ts
web/src/app/api/runbook/snapshots/route.ts
web/src/app/api/sandbox/status/route.ts
web/src/app/api/studio/chat/route.ts
web/src/app/api/studio/cwd/route.ts
web/src/app/api/studio/status/route.ts
web/src/components/agent-dashboard/FileListPanel.tsx
web/src/components/agent-dashboard/InlineDiffPanel.tsx
web/src/components/agent-dashboard/KanbanBoard.test.tsx
web/src/components/agent-dashboard/KanbanBoard.tsx
web/src/components/agent-dashboard/TasksPanel.tsx
web/src/components/agent-events/ActivityFeed.tsx
web/src/components/agent-events/AgentStatusPanel.tsx
web/src/components/agent-events/ToolCallTimeline.tsx
web/src/components/layout/Sidebar.tsx
web/src/lib/agent-events/parsers.test.ts
web/src/lib/agent-events/parsers.ts
web/src/lib/agent-events/store.test.ts
web/src/lib/agent-events/store.ts
web/src/lib/agent-events/types.ts
web/src/lib/agent-events/useAgentEvents.ts
web/vitest.config.ts

💤 Files with no reviewable changes (2)

web/src/app/api/runbook/smoke/route.ts
web/src/app/api/runbook/runs/[runId]/route.ts

coderabbitai · 2026-03-15T04:57:07Z

.planning/phases/08-agent-event-vocabulary/08-02-PLAN.md

+1. **Create web/src/lib/agent-events/useAgentEvents.ts** — SSE consumer hook as specified in 08-RESEARCH.md Pattern 5. "use client" at top. Connect to `${BACKEND_URL}/api/events/stream` via fetch + readSSE(). For each incoming message: call parseActivityItem/parseAgentStatus/parseToolCall and dispatch to store. AbortController for cleanup. 3-second reconnect on error (not on AbortError). BACKEND_URL from `process.env.NEXT_PUBLIC_API_URL ?? "http://localhost:8000"`.
+
+    IMPORTANT: readSSE() yields SSEMessage objects, but the SSE endpoint sends raw AgentEventEnvelope dicts as JSON. The readSSE() parser returns whatever JSON.parse produces. Since the events endpoint sends `data: {envelope_dict}\n\n`, readSSE() will yield the parsed envelope dict directly. Cast `msg as AgentEventEnvelopeRaw` and check for `raw?.type` before dispatching.


⚠️ Potential issue | 🟠 Major

Use the proxied /api/events/stream route here.

Pointing the browser hook at ${BACKEND_URL}/api/events/stream bypasses the same-origin Next API layer this refactor is trying to standardize. That reintroduces CORS/cookie/auth drift and contradicts the /api/events/stream contract described earlier in the same plan. Prefer a relative /api/events/stream unless there is a deliberate exception.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In @.planning/phases/08-agent-event-vocabulary/08-02-PLAN.md around lines 181 - 183, The hook useAgentEvents.ts should not call `${BACKEND_URL}/api/events/stream`; update the fetch target to the relative path "/api/events/stream" so the browser uses the Next API proxy and preserves same-origin cookies/auth; keep the rest of the logic (readSSE(), casting msg as AgentEventEnvelopeRaw and checking raw?.type, calling parseActivityItem/parseAgentStatus/parseToolCall and dispatching to store, using AbortController and 3s reconnect on non-AbortError) unchanged.

coderabbitai · 2026-03-15T04:57:07Z

.planning/phases/09-three-panel-dashboard/09-02-PLAN.md

+    - Clicking a run button calls `setSelectedRunId(id)`. Clicking the same run again deselects (passes `null`)
+    - Highlight the selected run with `bg-accent` class


⚠️ Potential issue | 🟠 Major

Clear selectedFile when the active run changes.

With this flow, once a file is open the right pane stays on InlineDiffPanel even after the user picks a different run, so the promised “file list filters to that run’s files” behavior never happens until they manually go back. Have the run-selection path clear selectedFile, or ignore a selected file that does not belong to the active run.

Also applies to: 201-205

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In @.planning/phases/09-three-panel-dashboard/09-02-PLAN.md around lines 194 - 195, When the active run changes via setSelectedRunId(id) ensure the selected file state is reset: call setSelectedFile(null) (or equivalent) inside the run-selection handler (the same place that toggles selected run) so the right pane doesn't remain showing InlineDiffPanel for the previous run; alternatively implement a check in the render path for InlineDiffPanel to ignore or clear selectedFile if selectedFile.runId !== selectedRunId so a file that doesn't belong to the active run is not shown. Update the handler and/or InlineDiffPanel selection validation to enforce this behavior.

coderabbitai · 2026-03-15T04:57:07Z

.planning/phases/10-agent-board-codex-bridge/10-VALIDATION.md

+| **Framework** | vitest 2.1.4 (frontend); pytest + pytest-asyncio (backend) |
+| **Config file** | `web/vitest.config.ts` — environment: "node", alias: "@" → "./src" |
+| **Quick run command** | `cd web && npm test -- KanbanBoard agent-events` |
+| **Full suite command** | `cd web && npm test -- KanbanBoard agent-events && PYTHONPATH=src pytest tests/unit/test_agent_board_route.py tests/unit/test_codex_overflow.py -q` |


⚠️ Potential issue | 🔴 Critical

Run backend pytest from repo root, not inside web.

Line 23 and Line 31 chain cd web && ... && pytest ..., so pytest executes from web/. That can break tests/unit/... path resolution and PYTHONPATH=src targeting for backend tests.

Suggested fix

-| **Full suite command** | `cd web && npm test -- KanbanBoard agent-events && PYTHONPATH=src pytest tests/unit/test_agent_board_route.py tests/unit/test_codex_overflow.py -q` | +| **Full suite command** | `(cd web && npm exec vitest run src/components/agent-dashboard/KanbanBoard.test.tsx src/lib/agent-events/parsers.test.ts) && PYTHONPATH=src pytest tests/unit/test_agent_board_route.py tests/unit/test_codex_overflow.py -q` |

-- **After every plan wave:** Run `cd web && npm test -- KanbanBoard agent-events && PYTHONPATH=src pytest tests/unit/test_agent_board_route.py tests/unit/test_codex_overflow.py -q` +- **After every plan wave:** Run `(cd web && npm exec vitest run src/components/agent-dashboard/KanbanBoard.test.tsx src/lib/agent-events/parsers.test.ts) && PYTHONPATH=src pytest tests/unit/test_agent_board_route.py tests/unit/test_codex_overflow.py -q`

Also applies to: 31-31

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In @.planning/phases/10-agent-board-codex-bridge/10-VALIDATION.md at line 23, The full-suite command currently chains "cd web && npm test -- KanbanBoard agent-events && PYTHONPATH=src pytest tests/unit/..." which causes pytest to run from web/; change it so npm test runs inside web but pytest runs from the repo root, e.g. split the chain so "(cd web && npm test -- KanbanBoard agent-events) && PYTHONPATH=src pytest tests/unit/test_agent_board_route.py tests/unit/test_codex_overflow.py -q" (ensure the command string labeled "Full suite command" is updated accordingly).

coderabbitai · 2026-03-15T04:57:07Z

web/package.json

    "framer-motion": "^12.23.26",
    "lucide-react": "^0.562.0",
    "next": "16.1.0",
+    "next-auth": "5.0.0-beta.30",


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🌐 Web query:

next-auth 5.0 stable release status 2026

💡 Result:

As of March 15, 2026, NextAuth/Auth.js v5 has not had a “stable” (non-beta) 5.0.0 release—it’s still distributed via the next-auth@beta tag (the official v5 migration/upgrade guide still tells you to install v5 using next-auth@beta). [1]

The latest stable release line remains v4 (e.g., 4.24.13, dated Oct 29, 2025), while the latest v5 published version is still a beta (e.g., 5.0.0-beta.30, dated Oct 27, 2025). [2][3]

Use of next-auth beta version requires production readiness assessment.

Version 5.0.0-beta.30 remains in beta as of March 2026 with no stable v5.0.0 release available. The stable release line is still v4 (latest: 4.24.13). Using a beta authentication library poses security and stability risks for production. Consider either:

Migrating to NextAuth v4 for production stability, or

Documenting acceptance of v5 beta risks and planning migration timeline before production deployment.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@web/package.json` at line 46, The package.json currently pins "next-auth": "5.0.0-beta.30" which is a beta release; either revert to the stable v4 line or explicitly document acceptance of beta risk: update the dependency entry "next-auth" to a stable v4 version (e.g., "4.24.13" or a caret range like "^4.24.13") in package.json to ensure production stability, or if you intentionally keep v5 beta, add a clear note in the repo (README or DEPLOYMENT.md) referencing "next-auth": "5.0.0-beta.30" and an approved risk/mitigation plan and migration timeline before production rollout.

coderabbitai · 2026-03-15T04:57:07Z

web/src/app/api/_utils/backend-proxy.ts

+  try {
+    const upstream = await fetchUpstream(req, upstreamUrl, method, options)
+    const body = await upstream.arrayBuffer()
+    const headers = new Headers(options.responseHeaders)
+    headers.set("Cache-Control", "no-cache")
+    headers.set(
+      "Content-Type",
+      upstream.headers.get("content-type") || "application/octet-stream",
+    )
+
+    const contentDisposition = upstream.headers.get("content-disposition")
+    if (contentDisposition) {
+      headers.set("Content-Disposition", contentDisposition)
+    }
+
+    return new Response(body, {
+      status: upstream.status,
+      headers,
+    })


⚠️ Potential issue | 🟠 Major

Stream binary downloads instead of buffering the whole response.

arrayBuffer() forces the entire upstream payload into memory before the first byte is sent. That's a bad fit for export/download routes and also drops useful upstream metadata like length/range headers unless you copy them back manually.

Suggested fix

try { const upstream = await fetchUpstream(req, upstreamUrl, method, options) - const body = await upstream.arrayBuffer() - const headers = new Headers(options.responseHeaders) + const headers = new Headers(upstream.headers) + headers.delete("content-length") + headers.delete("content-encoding") + headers.delete("connection") + headers.delete("transfer-encoding") + for (const [key, value] of new Headers(options.responseHeaders)) { + headers.set(key, value) + } headers.set("Cache-Control", "no-cache") - headers.set( - "Content-Type", - upstream.headers.get("content-type") || "application/octet-stream", - ) - - const contentDisposition = upstream.headers.get("content-disposition") - if (contentDisposition) { - headers.set("Content-Disposition", contentDisposition) + if (!headers.has("Content-Type")) { + headers.set("Content-Type", "application/octet-stream") } - return new Response(body, { + return new Response(upstream.body, { status: upstream.status, headers, })

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

try {

const upstream = await fetchUpstream(req, upstreamUrl, method, options)

const body = await upstream.arrayBuffer()

const headers = new Headers(options.responseHeaders)

headers.set("Cache-Control", "no-cache")

headers.set(

"Content-Type",

upstream.headers.get("content-type") || "application/octet-stream",

)

const contentDisposition = upstream.headers.get("content-disposition")

if (contentDisposition) {

headers.set("Content-Disposition", contentDisposition)

}

return new Response(body, {

status: upstream.status,

headers,

})

try {

const upstream = await fetchUpstream(req, upstreamUrl, method, options)

const headers = new Headers(upstream.headers)

headers.delete("content-length")

headers.delete("content-encoding")

headers.delete("connection")

headers.delete("transfer-encoding")

for (const [key, value] of new Headers(options.responseHeaders)) {

headers.set(key, value)

}

headers.set("Cache-Control", "no-cache")

if (!headers.has("Content-Type")) {

headers.set("Content-Type", "application/octet-stream")

}

return new Response(upstream.body, {

status: upstream.status,

headers,

})

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@web/src/app/api/_utils/backend-proxy.ts` around lines 84 - 102, The handler currently buffers the entire upstream response via upstream.arrayBuffer() which uses excessive memory and drops streaming headers; change the return to stream the upstream.body instead: use upstream.body as the Response body (or a transformed ReadableStream if you need to inject behavior) and avoid arrayBuffer(); preserve and copy upstream headers like content-length, content-range, accept-ranges (in addition to content-disposition and content-type) into the Headers built from options.responseHeaders, still overriding Cache-Control to "no-cache"; update the code around fetchUpstream/const upstream to return new Response(upstream.body, { status: upstream.status, headers }) so downloads and range requests work and memory usage stays low.

coderabbitai · 2026-03-15T04:57:08Z

web/src/components/agent-events/AgentStatusPanel.tsx

+export function AgentStatusPanel({ compact = false }: { compact?: boolean }) {
+  const agentStatuses = useAgentEventStore((s) => s.agentStatuses)
+  const connected = useAgentEventStore((s) => s.connected)
+  const entries = Object.values(agentStatuses)
+
+  return (
+    <div className="flex flex-col gap-2 p-3">
+      {!compact && (
+        <div className="flex items-center justify-between">
+          <h3 className="text-sm font-semibold text-gray-200">Agent Status</h3>
+          <div className="flex items-center gap-1.5">
+            {connected ? (
+              <>
+                <Wifi size={12} className="text-green-400" />
+                <span className="text-xs text-green-400">Connected</span>
+              </>
+            ) : (
+              <>
+                <WifiOff size={12} className="text-amber-400" />
+                <span className="text-xs text-amber-400">Connecting...</span>
+              </>
+            )}
+          </div>
+        </div>
+      )}
+      {entries.length === 0 ? (
+        <div className="text-xs text-gray-500">No agents active</div>
+      ) : (
+        <div className={compact ? "grid grid-cols-1 gap-2" : "grid grid-cols-2 gap-2 sm:grid-cols-3 lg:grid-cols-4"}>
+          {entries.map((entry) => (
+            <AgentStatusBadge key={entry.agent_name} entry={entry} compact={compact} />
+          ))}
+        </div>
+      )}
+    </div>
+  )
+}


⚠️ Potential issue | 🟠 Major

Add tests for AgentStatusPanel rendering and state mapping.

This introduces meaningful UI behavior (status label/icon mapping, compact truncation, connection state, empty state), but no tests are included here to protect it from regressions.

As per coding guidelines If behavior changes, add or update tests.

🧰 Tools

🪛 GitHub Check: SonarCloud Code Analysis

[warning] 67-67: Mark the props of the component as read-only.

See more on https://sonarcloud.io/project/issues?id=jerry609_PaperBot&issues=AZzvzeTCuzevaEjDqEMH&open=AZzvzeTCuzevaEjDqEMH&pullRequest=395

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@web/src/components/agent-events/AgentStatusPanel.tsx` around lines 67 - 103, Add unit tests for the AgentStatusPanel React component to cover rendering and state mapping: mount AgentStatusPanel with mocked useAgentEventStore values to assert (1) empty state shows "No agents active", (2) when entries exist it renders an AgentStatusBadge per agent (keyed by agent_name) and grid classes change when compact=true (compact truncation), (3) connection indicator shows Wifi/WifiOff icon and "Connected"/"Connecting..." text based on connected boolean, and (4) agent status-to-label/icon mapping is passed through to AgentStatusBadge (mock or spy the component to verify props). Use the test renderer/RTL and mock useAgentEventStore to return controlled agentStatuses and connected states for each scenario.

coderabbitai · 2026-03-15T04:57:08Z

web/src/components/agent-events/ToolCallTimeline.tsx

+"use client"
+
+import { useAgentEventStore } from "@/lib/agent-events/store"
+import type { ToolCallEntry } from "@/lib/agent-events/types"
+
+function formatDuration(ms: number): string {
+  if (ms === 0) return "—"
+  if (ms >= 1000) return `${(ms / 1000).toFixed(1)}s`
+  return `${Math.round(ms)}ms`
+}
+
+function formatArgs(args: Record<string, unknown>): string {
+  const keys = Object.keys(args)
+  if (keys.length === 0) return "(no args)"
+  return keys.join(", ")
+}
+
+function truncate(text: string, max = 100): string {
+  if (text.length <= max) return text
+  return text.slice(0, max) + "…"
+}
+
+function formatTimestamp(ts: string): string {
+  try {
+    const d = new Date(ts)
+    return d.toTimeString().slice(0, 8)
+  } catch {
+    return ts.slice(0, 8)
+  }
+}
+
+function ToolCallRow({ entry }: { entry: ToolCallEntry }) {
+  const isError = entry.status === "error"
+
+  return (
+    <li className="flex items-start gap-3 py-2 border-b border-gray-800 last:border-0">
+      <div
+        className={`w-2 h-2 rounded-full mt-1.5 shrink-0 ${isError ? "bg-red-500" : "bg-green-500"}`}
+      />
+      <div className="flex-1 min-w-0">
+        <div className="flex items-center justify-between gap-2">
+          <span className="text-sm font-semibold text-gray-100 truncate">{entry.tool}</span>
+          <div className="flex items-center gap-2 shrink-0">
+            {isError && (
+              <span className="text-xs px-1.5 py-0.5 rounded bg-red-950/60 text-red-400 border border-red-800/50">
+                error
+              </span>
+            )}
+            <span className="text-xs text-gray-400 font-mono">{formatDuration(entry.duration_ms)}</span>
+            <span className="text-xs text-gray-600 font-mono">{formatTimestamp(entry.ts)}</span>
+          </div>
+        </div>
+        <div className="text-xs text-gray-500 mt-0.5">
+          <span className="text-gray-600">args: </span>
+          <span>{formatArgs(entry.arguments)}</span>
+        </div>
+        {entry.result_summary && !isError && (
+          <div className="text-xs text-gray-400 mt-0.5 truncate">
+            {truncate(entry.result_summary)}
+          </div>
+        )}
+        {isError && entry.error && (
+          <div className="text-xs text-red-400 mt-0.5 truncate">{entry.error}</div>
+        )}
+      </div>
+    </li>
+  )
+}
+
+export function ToolCallTimeline() {
+  const toolCalls = useAgentEventStore((s) => s.toolCalls)
+
+  return (
+    <div className="flex flex-col h-full">
+      <div className="flex items-center justify-between px-3 py-2 border-b border-gray-700">
+        <h3 className="text-sm font-semibold text-gray-200">Tool Calls</h3>
+        <span className="text-xs text-gray-500">{toolCalls.length} calls</span>
+      </div>
+      {toolCalls.length === 0 ? (
+        <div className="flex items-center justify-center h-20 text-gray-500 text-sm">
+          No tool calls yet
+        </div>
+      ) : (
+        <div className="overflow-y-auto flex-1">
+          <ul className="px-3 py-1">
+            {toolCalls.map((entry) => (
+              <ToolCallRow key={entry.id} entry={entry} />
+            ))}
+          </ul>
+        </div>
+      )}
+    </div>
+  )
+}


⚠️ Potential issue | 🟠 Major

Please add tests for this new component’s behavior branches.

I don’t see coverage here for duration formatting, invalid timestamp handling, args rendering, and success/error row variants. This should be locked down with a dedicated test file.

As per coding guidelines If behavior changes, add or update tests.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@web/src/components/agent-events/ToolCallTimeline.tsx` around lines 1 - 94, Add unit tests for the ToolCallTimeline/ToolCallRow behaviors: write tests that import ToolCallTimeline (or render ToolCallRow directly) and assert formatDuration outputs for 0ms, <1000ms, and >=1000ms; that formatTimestamp falls back gracefully for invalid ts values; that formatArgs renders "(no args)" for empty objects and lists keys for non-empty args; and that UI shows the success variant (result_summary visible, green dot) vs error variant (red dot, error badge and error text shown). Target the functions formatDuration, formatTimestamp, formatArgs, truncate and the ToolCallRow component so each branch (zero duration, ms vs s, invalid timestamp, empty vs non-empty args, success vs error) has a dedicated test assertion.

coderabbitai · 2026-03-15T04:57:08Z

web/src/lib/agent-events/parsers.ts

+function deriveHumanSummary(raw: AgentEventEnvelopeRaw): string {
+  const t = raw.type ?? ""
+  const payload = (raw.payload ?? {}) as Record<string, unknown>
+
+  if (t === "agent_started") return `${raw.agent_name} started: ${raw.stage}`
+  if (t === "agent_working") return `${raw.agent_name} working on: ${raw.stage}`
+  if (t === "agent_completed") return `${raw.agent_name} completed: ${raw.stage}`
+  if (t === "agent_error") return `${raw.agent_name} error: ${String(payload.detail ?? "")}`
+  if (t === "tool_result" || t === "tool_error" || t === "tool_call") {
+    const tool = String(payload.tool ?? t)
+    return `Tool: ${tool} — ${String(payload.result_summary ?? "").slice(0, 80)}`
+  }
+  if (t === "codex_dispatched") {
+    const assignee = String(payload.assignee ?? raw.agent_name ?? "Codex")
+    const title = String(payload.task_title ?? "")
+    return `Task dispatched to ${assignee}: ${title}`
+  }
+  if (t === "codex_accepted") {
+    const title = String(payload.task_title ?? "")
+    return `Codex accepted task: ${title}`
+  }
+  if (t === "codex_completed") {
+    const title = String(payload.task_title ?? "")
+    const files = Array.isArray(payload.files_generated) ? payload.files_generated.length : 0
+    return `Codex completed: ${title} (${files} files)`
+  }
+  if (t === "codex_failed") {
+    const title = String(payload.task_title ?? "")
+    const reason = String(payload.reason_code ?? "unknown")
+    return `Codex failed: ${title} (${reason})`
+  }
+  if (t === "job_start") return `Job started: ${raw.stage}`
+  if (t === "job_result") return `Job finished: ${raw.stage}`
+  if (t === "source_record") return `Source record: ${raw.workflow}/${raw.stage}`
+  if (t === "score_update") return `Score update from ${raw.agent_name}`
+  if (t === "insight") return `Insight from ${raw.agent_name}`
+  return `${t}: ${raw.agent_name ?? ""} / ${raw.stage ?? ""}`
+}


⚠️ Potential issue | 🟠 Major

deriveHumanSummary exceeds configured cognitive-complexity threshold.

This function is currently over the Sonar limit and is reported as a failing check. Refactor to a type→formatter map to keep the quality gate green.

♻️ Suggested refactor shape

+type SummaryFormatter = ( + raw: AgentEventEnvelopeRaw, + payload: Record<string, unknown> +) => string + +const SUMMARY_FORMATTERS: Record<string, SummaryFormatter> = { + agent_started: (raw) => `${raw.agent_name} started: ${raw.stage}`, + agent_working: (raw) => `${raw.agent_name} working on: ${raw.stage}`, + agent_completed: (raw) => `${raw.agent_name} completed: ${raw.stage}`, + agent_error: (raw, payload) => `${raw.agent_name} error: ${String(payload.detail ?? "")}`, + // ...codex/tool/job mappings... +} + function deriveHumanSummary(raw: AgentEventEnvelopeRaw): string { const t = raw.type ?? "" const payload = (raw.payload ?? {}) as Record<string, unknown> - - if (t === "agent_started") return `${raw.agent_name} started: ${raw.stage}` - if (t === "agent_working") return `${raw.agent_name} working on: ${raw.stage}` - ... + const formatter = SUMMARY_FORMATTERS[t] + if (formatter) return formatter(raw, payload) return `${t}: ${raw.agent_name ?? ""} / ${raw.stage ?? ""}` }

🧰 Tools

🪛 GitHub Check: SonarCloud Code Analysis

[warning] 48-48: 'payload.task_title ?? ""' will use Object's default stringification format ('[object Object]') when stringified.

See more on https://sonarcloud.io/project/issues?id=jerry609_PaperBot&issues=AZzvzeUSuzevaEjDqEMa&open=AZzvzeUSuzevaEjDqEMa&pullRequest=395

[warning] 37-37: 'payload.detail ?? ""' will use Object's default stringification format ('[object Object]') when stringified.

See more on https://sonarcloud.io/project/issues?id=jerry609_PaperBot&issues=AZzvzeUSuzevaEjDqEMV&open=AZzvzeUSuzevaEjDqEMV&pullRequest=395

[warning] 39-39: 'payload.tool ?? t' will use Object's default stringification format ('[object Object]') when stringified.

See more on https://sonarcloud.io/project/issues?id=jerry609_PaperBot&issues=AZzvzeUSuzevaEjDqEMW&open=AZzvzeUSuzevaEjDqEMW&pullRequest=395

[warning] 40-40: 'payload.result_summary ?? ""' will use Object's default stringification format ('[object Object]') when stringified.

See more on https://sonarcloud.io/project/issues?id=jerry609_PaperBot&issues=AZzvzeUSuzevaEjDqEMX&open=AZzvzeUSuzevaEjDqEMX&pullRequest=395

[warning] 58-58: 'payload.reason_code ?? "unknown"' will use Object's default stringification format ('[object Object]') when stringified.

See more on https://sonarcloud.io/project/issues?id=jerry609_PaperBot&issues=AZzvzeUSuzevaEjDqEMd&open=AZzvzeUSuzevaEjDqEMd&pullRequest=395

[warning] 52-52: 'payload.task_title ?? ""' will use Object's default stringification format ('[object Object]') when stringified.

See more on https://sonarcloud.io/project/issues?id=jerry609_PaperBot&issues=AZzvzeUSuzevaEjDqEMb&open=AZzvzeUSuzevaEjDqEMb&pullRequest=395

[failure] 30-30: Refactor this function to reduce its Cognitive Complexity from 16 to the 15 allowed.

See more on https://sonarcloud.io/project/issues?id=jerry609_PaperBot&issues=AZzvzeUSuzevaEjDqEMT&open=AZzvzeUSuzevaEjDqEMT&pullRequest=395

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@web/src/lib/agent-events/parsers.ts` around lines 30 - 67, The deriveHumanSummary function is too complex; replace the big if/else chain with a map from event type string to formatter functions: create a Record<string, (raw: AgentEventEnvelopeRaw, payload: Record<string,unknown>) => string> where each key (e.g., "agent_started", "tool_result", "codex_completed", "codex_failed", "job_start", "source_record", etc.) implements the corresponding logic currently in deriveHumanSummary (use raw.agent_name, raw.stage, payload fields like payload.tool, payload.result_summary, payload.task_title, payload.files_generated, payload.reason_code, etc.), preserve existing behaviors (slice result_summary to 80 chars for tool_* types, default assignee fallback for codex_dispatched), and use a default formatter for unknown types that returns `${t}: ${raw.agent_name ?? ""} / ${raw.stage ?? ""}`; then have deriveHumanSummary simply look up the formatter by t (with payload typed as Record<string, unknown>) and return the result.

coderabbitai · 2026-03-15T04:57:08Z

web/src/lib/agent-events/useAgentEvents.ts

+import { parseActivityItem, parseAgentStatus, parseToolCall, parseFileTouched, parseCodexDelegation } from "./parsers"
+import type { AgentEventEnvelopeRaw } from "./types"
+
+const BACKEND_URL = process.env.NEXT_PUBLIC_API_URL ?? "http://localhost:8000"


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🏁 Script executed:

cat -n web/src/lib/agent-events/useAgentEvents.ts | head -70

Repository: jerry609/PaperBot

Length of output: 2592

🏁 Script executed:

find web/src -name "*.ts" -o -name "*.tsx" | head -20

Repository: jerry609/PaperBot

Length of output: 903

🏁 Script executed:

rg "NEXT_PUBLIC_API_URL" web/src --type ts --type tsx -B 2 -A 2

Repository: jerry609/PaperBot

Length of output: 88

🏁 Script executed:

rg "/api/" web/src --type ts --type tsx -l | head -10

Repository: jerry609/PaperBot

Length of output: 88

🏁 Script executed:

cat web/src/app/api/_utils/backend-proxy.ts

Repository: jerry609/PaperBot

Length of output: 6600

🏁 Script executed:

cat web/src/app/api/_utils/auth-headers.ts

Repository: jerry609/PaperBot

Length of output: 1157

🏁 Script executed:

rg "fetch.*api/" web/src --type ts -B 2 -A 2 | head -50

Repository: jerry609/PaperBot

Length of output: 3783

🏁 Script executed:

find web/src -path "*/test*" -o -path "*.test.*" | head -10

Repository: jerry609/PaperBot

Length of output: 529

🏁 Script executed:

find web/src/app/api -name "*event*" -o -name "*stream*"

Repository: jerry609/PaperBot

Length of output: 43

🏁 Script executed:

cat web/src/lib/agent-events/parsers.test.ts

Repository: jerry609/PaperBot

Length of output: 11930

🏁 Script executed:

rg "events/stream" web/src --type ts

Repository: jerry609/PaperBot

Length of output: 296

Direct backend fetch from browser cannot authenticate and violates the established proxy pattern.

The hook calls fetch(${BACKEND_URL}/api/events/stream) directly from the browser, which bypasses the server-side auth layer. NextAuth tokens are available only on the server via the auth() function—the browser cannot access them. This fetch will fail authentication in any backend that requires a bearer token. Additionally, the http://localhost:8000 fallback on line 9 will route real users to their own localhost if the environment is missing.

Create a same-origin /api/events/stream route instead, using the proxyStream() utility with auth: true from backend-proxy.ts. This follows the established pattern where browser requests go through Next.js API routes, which inject credentials server-side before proxying to the backend.

While addressing this: note that the current retry logic (lines 47–50) only retries on network errors (caught exception). HTTP error responses like 401, 403, or 5xx return silently without reconnection, leaving the hook offline until reload.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@web/src/lib/agent-events/useAgentEvents.ts` at line 9, The hook is calling fetch(`${BACKEND_URL}/api/events/stream`) from the browser (BACKEND_URL constant) which bypasses server-side auth and uses a dangerous localhost fallback; instead create a same-origin Next.js API route at /api/events/stream that uses proxyStream({ auth: true }) from backend-proxy.ts, then change useAgentEvents to fetch '/api/events/stream' (remove BACKEND_URL fallback). Also update the reconnection logic in useAgentEvents so non-2xx HTTP responses (401/403/5xx) are treated as errors that trigger the retry path (not just network exceptions), ensuring the hook attempts reconnects rather than silently staying offline.

coderabbitai · 2026-03-15T04:57:08Z

web/src/lib/agent-events/useAgentEvents.ts

+        const res = await fetch(`${BACKEND_URL}/api/events/stream`, {
+          signal: controller.signal,
+          headers: { Accept: "text/event-stream" },
+        })
+        if (!res.ok || !res.body) return


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🏁 Script executed:

cat -n web/src/lib/agent-events/useAgentEvents.ts

Repository: jerry609/PaperBot

Length of output: 2592

Missing retry logic for HTTP failures and clean stream closes.

fetch only throws on network or abort failures. A non-OK response at line 25 returns early, and a normal completion of the SSE loop at lines 28-46 also bypasses the error handler. This means a 401/503 or server-side stream closure leaves the dashboard permanently offline until reload. The retry logic at line 50 only handles exceptions, not these cases.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@web/src/lib/agent-events/useAgentEvents.ts` around lines 21 - 25, The SSE connection abandons retries when fetch returns non-OK or when the stream cleanly ends; update useAgentEvents.ts so the code treats non-OK responses and normal stream completion as errors that trigger the existing retry/backoff logic: after fetch(`${BACKEND_URL}/api/events/stream`, { signal: controller.signal, ... }) if (!res.ok || !res.body) throw a descriptive Error (include status/text) instead of returning, and inside the SSE reading loop detect EOF/closed stream and throw or reject (unless controller.signal.aborted) so the outer catch/retry path at the retry/backoff handler runs; keep the abort behavior intact so controller.signal.abort still stops retries.

Copilot AI review requested due to automatic review settings March 15, 2026 04:41

Copilot started reviewing on behalf of jerry609 March 15, 2026 04:41 View session

vercel bot deployed to Preview March 15, 2026 04:42 View deployment

gemini-code-assist bot reviewed Mar 15, 2026

View reviewed changes

Copilot AI reviewed Mar 15, 2026

View reviewed changes

refactor: unify next api backend proxies

0450c4d

jerry609 force-pushed the refactor/pr1-next-api-proxy branch from f899732 to 0450c4d Compare March 15, 2026 04:52

vercel bot deployed to Preview March 15, 2026 04:54 View deployment

coderabbitai bot reviewed Mar 15, 2026

View reviewed changes

		except Exception as exc: # pragma: no cover
		log.debug("_emit_codex_event failed silently: %s", exc)

		1. Create web/src/lib/agent-events/useAgentEvents.ts — SSE consumer hook as specified in 08-RESEARCH.md Pattern 5. "use client" at top. Connect to `${BACKEND_URL}/api/events/stream` via fetch + readSSE(). For each incoming message: call parseActivityItem/parseAgentStatus/parseToolCall and dispatch to store. AbortController for cleanup. 3-second reconnect on error (not on AbortError). BACKEND_URL from `process.env.NEXT_PUBLIC_API_URL ?? "http://localhost:8000"`.

		IMPORTANT: readSSE() yields SSEMessage objects, but the SSE endpoint sends raw AgentEventEnvelope dicts as JSON. The readSSE() parser returns whatever JSON.parse produces. Since the events endpoint sends `data: {envelope_dict}\n\n`, readSSE() will yield the parsed envelope dict directly. Cast `msg as AgentEventEnvelopeRaw` and check for `raw?.type` before dispatching.

		- Clicking a run button calls `setSelectedRunId(id)`. Clicking the same run again deselects (passes `null`)
		- Highlight the selected run with `bg-accent` class

Conversation

jerry609 commented Mar 15, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changed

Why

Validation

Notes

Summary by CodeRabbit

Release Notes

Uh oh!

vercel bot commented Mar 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai bot commented Mar 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rate limit exceeded

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Possibly related PRs

Poem

Uh oh!

github-actions bot commented Mar 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Vercel Preview

Uh oh!

gemini-code-assist bot commented Mar 15, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Mar 15, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

sonarqubecloud bot commented Mar 15, 2026

Quality Gate passed

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 15, 2026

Choose a reason for hiding this comment

Uh oh!

jerry609 commented Mar 15, 2026 •

edited by coderabbitai bot

Loading

vercel bot commented Mar 15, 2026 •

edited

Loading

coderabbitai bot commented Mar 15, 2026 •

edited

Loading

github-actions bot commented Mar 15, 2026 •

edited

Loading