Skip to content

v0.8.41 — hostability, orientation cache, hardening, cockpit UX, tool-calling accuracy#1875

Draft
Hmbown wants to merge 2 commits into
mainfrom
work/v0.8.41
Draft

v0.8.41 — hostability, orientation cache, hardening, cockpit UX, tool-calling accuracy#1875
Hmbown wants to merge 2 commits into
mainfrom
work/v0.8.41

Conversation

@Hmbown
Copy link
Copy Markdown
Owner

@Hmbown Hmbown commented May 21, 2026

v0.8.41 umbrella

Closes #1849.

Tracks

  • Hostability — Tencent Lighthouse + Feishu/Lark smoke + docs
  • Orientation cache — native PEEK/Aleph-style context, no MCP dep
  • Hardening tail — Windows containment, sync tool cancel, sub-agent/RLM output reliability, doctor-noise cleanup
  • Cockpit UX — sidebar + presentation of state so the user feels like they're driving
  • Tool-calling spike — strict mode on /beta, prefix-as-repair, prompt-rhythm experiment

Status

Work in progress. Individual slices land here as they complete.

Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request updates the CHANGELOG.md for version 0.8.41, adding entries for new features like hostability improvements, orientation caching, and UX enhancements. The reviewer identified several inaccuracies in the changelog, including incorrect UI panel names, inconsistent environment variable prefixes, and the need to categorize bug fixes under a 'Fixed' section rather than 'Added'.

Comment thread CHANGELOG.md
Comment on lines +10 to +21
### Added

- **Hostability.** Tencent Lighthouse deployment flow, Feishu/Lark smoke-tested integration,
and companion documentation for self-host and cloud install paths.
- **Orientation cache.** Native PEEK/Aleph-style workspace context injection — no MCP
dependency — giving the model a persistent, low-latency map of the project.
- **Hardening.** Fixes for macOS file writes, Windows input/composer edge cases, sync
tool-cancel reliability, sub-agent/RLM output robustness, and doctor-noise cleanup.
- **Cockpit UX.** Sidebar and presentation panels (Now, Recent, Pending, Budget, Levers)
so the user feels like they're driving the agent, not watching from the back seat.
- **Tool-calling accuracy.** Strict-mode (`/beta`) tool schema classification and
adaptation behind `DS_TUI_STRICT_TOOLS`, measured on `deepseek eval`.
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The changelog entry for v0.8.41 should be refined for accuracy and consistency with the project's standards:

  • Section Grouping: The "Hardening" bullet describes bug fixes (macOS file writes, Windows edge cases, etc.). Following the "Keep a Changelog" format used in previous versions (e.g., line 52), these should be moved to a ### Fixed section.
  • Panel Name Accuracy: The "Cockpit UX" entry mentions panels named "Now, Recent, Pending, Budget, Levers". However, the implementation in crates/tui/src/tui/sidebar.rs (lines 48-53) uses "Work, Tasks, Agents, Context". The changelog should use the actual UI labels.
  • Environment Variable Wiring: The entry mentions DS_TUI_STRICT_TOOLS. However, this variable is not wired up in crates/tui/src/config.rs, and the DS_ prefix is inconsistent with the DEEPSEEK_ prefix used for all other environment variables in the project. Also, the PR description mentions "Windows containment" which is missing from the changelog entry.
  • Formatting: Commands and environment variables like DS_TUI_STRICT_TOOLS and deepseek eval should be wrapped in backticks for consistency with previous entries.
Suggested change
### Added
- **Hostability.** Tencent Lighthouse deployment flow, Feishu/Lark smoke-tested integration,
and companion documentation for self-host and cloud install paths.
- **Orientation cache.** Native PEEK/Aleph-style workspace context injection — no MCP
dependency — giving the model a persistent, low-latency map of the project.
- **Hardening.** Fixes for macOS file writes, Windows input/composer edge cases, sync
tool-cancel reliability, sub-agent/RLM output robustness, and doctor-noise cleanup.
- **Cockpit UX.** Sidebar and presentation panels (Now, Recent, Pending, Budget, Levers)
so the user feels like they're driving the agent, not watching from the back seat.
- **Tool-calling accuracy.** Strict-mode (`/beta`) tool schema classification and
adaptation behind `DS_TUI_STRICT_TOOLS`, measured on `deepseek eval`.
### Added
- **Hostability.** Tencent Lighthouse deployment flow, Feishu/Lark smoke-tested integration,
and companion documentation for self-host and cloud install paths.
- **Orientation cache.** Native PEEK/Aleph-style workspace context injection — no MCP
dependency — giving the model a persistent, low-latency map of the project.
- **Cockpit UX.** Sidebar and presentation panels (Work, Tasks, Agents, Context)
so the user feels like they're driving the agent, not watching from the back seat.
- **Tool-calling accuracy.** Strict-mode (`/beta`) tool schema classification and
adaptation behind `DS_TUI_STRICT_TOOLS`, measured on `deepseek eval`.
### Fixed
- **Hardening.** Fixes for macOS file writes, Windows input/composer edge cases, sync
tool-cancel reliability, sub-agent/RLM output robustness, and doctor-noise cleanup.

@Hmbown Hmbown added this to the v0.8.41 milestone May 21, 2026
@Hmbown
Copy link
Copy Markdown
Owner Author

Hmbown commented May 21, 2026

Maintainer review note from the v0.8.41 cockpit pass:

Current local check on work/v0.8.41 fails:

cargo check --locked -p deepseek-tui
error[E0004]: non-exhaustive patterns: `SidebarFocus::Now` and `SidebarFocus::Budget` not covered
  - crates/tui/src/config_ui.rs:972
  - crates/tui/src/tui/sidebar.rs:48

The deeper issue is architectural, not just adding two match arms. The cockpit patch currently adds Now/Budget as new sidebar focus variants and separate render functions, but those functions duplicate existing sidebar/session signal sources and recompute data less accurately. Before this lands:

  • Budget should reuse the existing context/cost source path from the Session/Context panel: session.total_conversation_tokens, model context window, compact_threshold, auto_compact, displayed_session_cost_for_currency(app.cost_currency), and app.format_cost_amount(...). No hardcoded $, no message-count token estimate.
  • Now should reuse or extract from task_panel_lines, active_tool_rows, recent_tool_rows, and subagent_panel_lines; it should not re-derive tool/subagent status from raw history with a weaker model.
  • If SidebarFocus::Now / Budget remain, they must be fully threaded through config UI, settings normalization, docs, tests, focus cycling, and render_sidebar. If they are only cockpit projections, prefer not adding new persisted focus variants yet.
  • Auto mode still renders Work/Tasks/Agents/Context, so the cockpit implementation needs to decide whether cockpit is the new auto stack or only focused detail panels.

Suggested acceptance before v0.8.41 merges: one source of truth per datum, cockpit as projection/orientation cache, cargo check --locked -p deepseek-tui green, and focused tests for currency formatting plus the /compact hint.

@Hmbown
Copy link
Copy Markdown
Owner Author

Hmbown commented May 23, 2026

v0.8.41 overnight follow-up summary:

Opened 6 review-only PRs from tonight:

Community PR harvest verdicts verified locally:

No merges, tags, publishes, force-pushes, or issue closures performed.

@Hmbown
Copy link
Copy Markdown
Owner Author

Hmbown commented May 24, 2026

v0.8.42 triage note: I did not find remaining unique release-blocker scope here for the current local stabilization branch. The actionable v0.8.41 slices were already harvested or routed in later PRs/issues, and v0.8.42 now has its own release-prep branch with changelog/version bumps.

Recommendation after v0.8.42 lands: close this draft as superseded by the harvested PRs and the current v0.8.42/v0.8.43+ milestone split, unless Hunter wants to preserve it as an archival umbrella.

@Hmbown Hmbown modified the milestones: v0.8.41, v0.8.43 May 24, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

v0.8.41 tracker: hostability, long-session hardening, and orientation cache

1 participant