Skip to content
Open
No due date
Last updated May 27, 2026

Make documents, images, search, execution, previews, dependency probes, and safe approvals feel native to the workbench instead of like loose subprocesses. Tools become first-class workbench objects with inspectors, properties, previews, and copy/select. Preview verifier-native completion hooks behind an opt-in flag so the v0.9.0 surface lands with field experience accumulated.

In scope

  • Tool inspectors. Every tool call has a properties pane: inputs, outputs, duration, exit, environment, sandbox decision. Tool results render in a preview panel (image, table, diff, doc) rather than as raw text.
  • Copy / select mode. First-class text selection inside the workbench (transcript, properties, previews) without breaking terminal-line semantics. The terminal-copy-includes-wrapped-lines bug (#1853) gets fixed structurally.
  • Provider / model diagnostics. Inspectable view of provider state: active provider + model, last-call latency, last error, prompt-cache hit rate. Surfaces from doctor + /provider slash command + properties pane. When the active provider is Hugging Face Inference Providers (via the v0.8.47 first-class integration when that lands first, or via the OpenAI-compatible path in the interim), the pane previews model passport metadata pulled from the HF Hub: license, base model, context length, chat template, tool-call support, reasoning support, gated / private status.
  • Tool surface polish. PDF / image / search / exec ergonomics; dependency probe + skip-on-miss (existing pattern); approval-flow clarity. Ship the small, independent RuntimeTool Windows .cmd fallback as a standalone PR.
  • Plugin tool design RFC. Open docs/proposals/plugin-tools.md defining how // approval: auto frontmatter interacts with the execution modes (Plan / Agent / YOLO), the approval policy, trust, and --model auto model routing. Implementation deferred until design lands.
  • Verifier preview (general). Opt-in via [verifier] enabled = true. On claim-of-done, auto-spawn a verifier sub-agent with fresh context and read-only tools, time-boxed. Verdict-as-gate: pass / partial / fail. /force-complete override is audit-logged. /tasks shows a "verified" badge. Extends the Goal-mode-specific Fin wakeup from v0.8.43 to the general claim-of-done case.

Out of scope

  • Full ExternalTool / RuntimeTool 65-call-site migration (defer; ship per-tool migration PRs in v0.8.47).
  • Plugin tool runtime implementation (gated on the RFC).
  • Verifier as default (preview only — explicit opt-in).
  • New execution modes — Plan / Agent / YOLO remain the three modes.
  • Hugging Face Workset implementation (Hub registry / datasets / adapters / Jobs — see #1977). v0.8.46 only previews HF model-passport metadata in the provider pane; the workset surface ships post-v0.9.0.

Definition of done

  • Every tool surfaces an inspectable properties pane and a result preview.
  • Copy / select mode ships with terminal-line semantics preserved.
  • /provider slash command and provider properties pane in place; HF model passport preview works against at least three model IDs (Qwen/*, deepseek-ai/*, meta-llama/*).
  • Verifier preview enables via config; verdict appears on completed tasks; /force-complete writes to audit log.
  • Plugin tool design RFC lands as draft; one round of maintainer review captured in PR.

Release gate

  • Parity gates green.
  • CHANGELOG.md [0.8.46] entry marks verifier preview as experimental and tool inspectors as the headline.
  • docs/verifier-preview.md explains opt-in + override semantics.
93% complete

List view