feat(web): pipeline filters, drift KPIs, and trace prompt inspector by cursor[bot] · Pull Request #37 · eamonboyle/llm-debate-engine

cursor · 2026-06-12T08:07:58Z

Summary

Adds four product improvements to the LLM debate research dashboard, grounded in existing patterns and data already present in run artifacts.

Features

1. Pipeline insight filters (Agents + Timing)

/agents and /timing now use InsightFilterCard with the same search/model/preset/fast/date filters as other insight pages
Stats and timing tables respect the filtered run subset, with clear empty states when filters match nothing

2. Preset leaderboard filter parity

/presets upgraded from fast-mode-only filtering to full InsightFilterCard support via applyIndexFilters
"Filter runs" links preserve active filter context (matching model leaderboard behavior)

3. Overview confidence drift KPIs

Overview dashboard now shows all three aggregate confidence drift means: solver→revision, revision→synth, and calibrated−synth
Reorganizes KPI rows to avoid duplication and surface evidence-plan risk alongside drift metrics

4. Trace prompt & retry inspector

Run trace steps expose collapsible LLM request (prompt) and Parse retries panels when artifact steps include request and rawAttempts
Human-readable summary of model, temperature, schema, and message previews before raw JSON

Testing

pnpm test — 172 tests passed
pnpm typecheck — passed
pnpm web:typecheck — passed
pnpm web:build — passed
pnpm format:check — passed

Manual verification

Visit /agents or /timing and apply model/preset filters — table counts should update
Visit /presets and compare filter behavior with /leaderboard
Visit / with analysis index loaded — confirm three drift KPI cards
Open any run trace (/runs/[id]) and expand "LLM request (prompt)" on agent steps

- Add InsightFilterCard to agent stats and pipeline timing pages - Give preset leaderboard full filter parity with model leaderboard - Show all three confidence drift means on the overview dashboard - Expose LLM request prompts and parse retries in run trace steps Co-authored-by: Eamon Boyle <eamonboyle@users.noreply.github.com>

vercel · 2026-06-12T08:08:00Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
llm-debate-research	Ready	Preview, Comment	Jun 12, 2026 8:08am

vercel Bot deployed to Preview June 12, 2026 08:08 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(web): pipeline filters, drift KPIs, and trace prompt inspector#37

feat(web): pipeline filters, drift KPIs, and trace prompt inspector#37
cursor[bot] wants to merge 1 commit into
mainfrom
cursor/product-feature-opportunities-1538

cursor Bot commented Jun 12, 2026

Uh oh!

vercel Bot commented Jun 12, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

cursor Bot commented Jun 12, 2026

Summary

Features

1. Pipeline insight filters (Agents + Timing)

2. Preset leaderboard filter parity

3. Overview confidence drift KPIs

4. Trace prompt & retry inspector

Testing

Manual verification

Uh oh!

vercel Bot commented Jun 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

vercel Bot commented Jun 12, 2026 •

edited

Loading