feat: Guardian Command — Phase 2 multi-agent platform rewrite by rz1989s · Pull Request #70 · sip-protocol/sipher

rz1989s · 2026-04-09T08:45:55Z

Summary

Full rewrite of Sipher from a hand-rolled REST API + chat agent to a Pi SDK-native multi-agent platform with a world-class frontend.

Plan A: Infrastructure + Pi Migration (16 tasks)

Pi SDK (pi-agent-core + pi-ai) replaces Anthropic SDK
AgentPool (multi-tenant, 30min idle eviction)
Dynamic tool loading (21 tools in 4 groups + routeIntent)
EventBus + ActivityLogger (typed events, level filtering)
Wallet auth (nonce → JWT), SSE activity stream
6 new DB tables, 10 new API routes
COURIER formalized with EventBus events

Plan B: HERALD — X Agent (11 tasks)

9 X API tools (readMentions, readDMs, postTweet, replyTweet, likeTweet, searchPosts, readUserProfile, sendDM, schedulePost)
Budget tracker with circuit breaker ($150/mo cap, 4 gates)
Intent classifier (command/question/engagement/spam)
Post approval queue (auto-approve after 30min)
Adaptive poller (10min default, backoff after 3 empty polls)

Plan C: SENTINEL — Blockchain Monitor (6 tasks)

Scanner: vault state, stealth payments, balance changes
Detector: 6 event types (unclaimed, expired, threat, large transfer, balance, RPC error)
Refund guard: threshold check, double-processing prevention, idempotency
Adaptive worker: 60s idle, 15s active, exponential backoff on RPC errors

Plan D: Guardian Command UI (10 tasks)

AI Designer-generated designs (4 approved mockups)
Activity stream (SSE real-time feed)
Command bar (bottom sheet chat, Cmd+K, confirmation cards)
Vault view (balance, deposit/withdraw, pending ops, history)
HERALD view (budget bar, activity timeline, approval queue, DMs)
Squad view (agent status grid, stats, coordination log, kill switch)
Tailwind CSS 4, dark design system, mobile-first

Stats

45 commits, 588 backend tests, ~160 new tests
Frontend: 27KB CSS + 625KB JS (191KB gzip)
Spec: docs/superpowers/specs/2026-04-09-sipher-phase2-guardian-command-design.md

Test plan

Backend: pnpm test -- --run (588 passing)
Frontend: cd app && pnpm build (builds clean)
Visual QA: open app/designs/*.html for design reference
API routes: verify /api/stream, /api/command, /api/vault, /api/squad, /api/herald
Full integration: start backend + frontend, connect wallet, test activity stream

…and threat checking

…in stats Add getPaymentLinksBySession, expireStaleLinks, getPaymentLinkStats, getAuditStats, and getSessionStats to db.ts with full test coverage (15 new tests, 240 passing). Sort by rowid DESC as tie-breaker for stable ordering when created_at timestamps collide within same tick.

…screening

…data Implements the invoice tool with required amount, 7-day default expiry, and invoice_meta JSON (description, dueDate, reference). Reuses payment_links table with type='invoice'. 8 tests covering full metadata, DB storage, validation, and expiry defaults.

Express router for payment link pages with dark-theme Tailwind HTML templates, XSS-safe escaping, open-amount support, expiry/paid/404 states, and confirm endpoint.

…ls + /pay and /admin routes Registers 4 new tools in TOOLS array, TOOL_EXECUTORS, and SYSTEM_PROMPT. Mounts /pay and /admin route groups in index.ts with stale link expiry on the 5-minute purge interval. Updates tools.test.ts to assert 14 tools and stream.test.ts mock to include all new exports.

…kie, tx sig validation, devnet→env

Pure-math synchronous tool that floors amounts to common denominations (10, 50, 100, 500, 1000, 5000, 10000) to reduce amount correlation in privacy-preserving transactions. Remainder stays in vault.

Implements crankTick with expiry, miss-window, max_exec completion, recurring re-schedule via intervalMs, and per-op error isolation. 8 tests covering all branches (executed, expired, missed, failed, skipped).

Creates a single scheduled_op with action='send' and max_exec=1. Supports exact delay, random range, or default 30-60 min. Expiry set to scheduled time + 1 hour. 8 tests, all passing.

…ed timing

…otonic delays)

… tools)

Add activity_stream, herald_queue, herald_dms, execution_links, cost_log, and agent_events tables with indexes for Phase 2 Guardian/Command layer. Exports: insertActivity, getActivity, dismissActivity, logCost, getCostTotals, logAgentEvent, getAgentEvents, createExecutionLink, getExecutionLink, updateExecutionLink — all ULID-keyed, ISO 8601 timestamps. TDD: 34 tests in db-schema.test.ts, all passing, no regressions.

…ding

…ool routing

…tion (closes #72)

, closes #74, closes #75)

…loses #77)

…nt backoff (closes #78)

…ls (closes #81, closes #82, closes #83)

Root tests/coordination/event-bus.test.ts was a duplicate of packages/agent/tests/coordination/event-bus.test.ts (identical logic, different import path). Deleted the root copy. Root tests/pi/tool-adapter.test.ts had no counterpart in packages/agent — moved it to packages/agent/tests/pi/tool-adapter.test.ts with corrected import paths so it runs with the agent package test suite where it belongs.

…devnet (closes #85) Scanner was hardcoded to createConnection('devnet') which would connect to devnet in production. Now reads SOLANA_NETWORK env var, defaulting to mainnet-beta for production safety. Devnet must be explicitly opted into via SOLANA_NETWORK=devnet.

#86) getReadyToPublish() had a TOCTOU race: it queried pending posts then updated them individually, allowing concurrent callers to pick up the same posts. Now wraps the select+update in db.transaction() and adds a CAS guard (AND status = 'pending') on the UPDATE WHERE clause so only the first caller wins the update.

…loses #87) VaultView always rendered MOCK_BALANCE, MOCK_USD, MOCK_FEES, and MOCK_PENDING regardless of API response. Now derives balance, usd, fees, and pendingOps from data when available, falling back to mocks only when data is null (loading or not connected). Extended VaultData interface with optional balance/usd/fees/pending_ops fields and added TODO noting the vault API should be extended to return these fields.

…d handler, type mismatches, intent classifier (closes #88, closes #89, closes #90, closes #91, closes #92)

…loses #93)

#94)

Store originating wallet in pending map and validate req.wallet matches before resolving. Prevents unauthorized confirmation of fund-moving operations by other authenticated wallets.

…96, closes #103)

likeTweet, replyTweet, sendDM, and publishTweet now check the budget gate before executing. Prevents overspending when gate is dm-only or paused.

)

…#105)

…prompt, confirm cap, admin tokens (closes #106, closes #107, closes #108, closes #109, closes #110)

#113, closes #114, closes #116) - Replace `as any` with `as Tool['parameters']` across all 9 HERALD tool files - Add message length validation (max 4000 chars) on /api/command - Add per-IP rate limiter on /api/auth/verify (10 req/min) to prevent ed25519 CPU amplification

…loses #111, closes #112, closes #115) - Add error state and banner to StreamView and VaultView when API fetch fails - Replace hardcoded mock balance/pending data with placeholders when API has no real data - Convert useSSE connected tracking from ref to useState for proper re-renders

…client cache (closes #117, closes #118, closes #121, closes #122)

…124)

rz1989s added 30 commits April 9, 2026 00:24

feat(agent): add bundled known-addresses dataset for privacy scoring …

b58c74c

…and threat checking

feat(agent): add paymentLink tool — one-time stealth receive URLs

b8187c6

feat(agent): add privacyScore tool — wallet exposure analysis (0-100)

a8cfea4

feat(agent): add threatCheck tool — OFAC, exchange, and scam address …

6ac3a3e

…screening

feat(agent): add /pay/:id route with server-rendered payment pages

7d3cfb7

Express router for payment link pages with dark-theme Tailwind HTML templates, XSS-safe escaping, open-amount support, expiry/paid/404 states, and confirm endpoint.

feat(agent): add admin dashboard with auth, stats, and HTML templates

665cd8f

fix(agent): address code review — expiry check on confirm, Secure coo…

1957958

…kie, tx sig validation, devnet→env

feat(agent): add scheduled ops CRUD helpers for crank engine

f101209

feat(agent): add roundAmount tool — denomination rounding for privacy

7c83ae8

Pure-math synchronous tool that floors amounts to common denominations (10, 50, 100, 500, 1000, 5000, 10000) to reduce amount correlation in privacy-preserving transactions. Remainder stays in vault.

feat(agent): add crank engine — 60s scheduled ops executor

290f03a

Implements crankTick with expiry, miss-window, max_exec completion, recurring re-schedule via intervalMs, and per-op error isolation. 8 tests covering all branches (executed, expired, missed, failed, skipped).

feat(agent): add scheduleSend tool — delayed private sends

b795761

Creates a single scheduled_op with action='send' and max_exec=1. Supports exact delay, random range, or default 30-60 min. Expiry set to scheduled time + 1 hour. 8 tests, all passing.

feat(agent): add splitSend tool — random chunk splitting with stagger…

9a54206

…ed timing

feat(agent): add drip tool — DCA-style private distribution

b9ab263

feat(agent): add recurring tool — repeating private payments with jitter

c1a8511

feat(agent): add sweep tool — auto-shield incoming wallet funds

4716b04

feat(agent): add consolidate tool — staggered stealth balance merging

058fce9

fix(agent): fix flaky timing tests in consolidate and split-send (mon…

8db2b26

…otonic delays)

feat(agent): wire 7 time-based privacy tools + crank engine (21 total…

faadbd1

… tools)

chore: swap anthropic sdk for pi-agent-core + pi-ai + infra deps

214f066

feat: add tool schema adapter — Anthropic format to Pi AI format

bc1ecec

feat: add EventBus — typed agent coordination layer

0144a5d

feat: add Pi AI provider config — OpenRouter for SIPHER + HERALD

1e5e74d

feat: add tool groups + routeIntent meta-tool — 4 groups, dynamic loa…

3693882

…ding

feat: add AgentPool — multi-tenant agent lifecycle with eviction

305c719

feat: add SIPHER Pi agent factory — system prompt, fund-moving set, t…

34eb799

…ool routing

fix: move service-sipher test to packages/agent/tests/

1f36d79

rz1989s added 26 commits April 9, 2026 16:06

fix: verify ed25519 wallet signature before JWT issuance (closes #71)

1f8d544

fix: add column allowlist to updateExecutionLink to prevent SQL injec…

512d84d

…tion (closes #72)

fix: add JWT auth to squad, herald, chat, and tool endpoints (closes #73

6820fce

, closes #74, closes #75)

fix: add /api/activity route and extend SSE event types (closes #76, c…

9e0e3a6

…loses #77)

fix: rewrite herald poller to recursive setTimeout to prevent permane…

58f7c9b

…nt backoff (closes #78)

fix: add .unref() to session purge and crank timers (closes #79)

8fef5ad

fix: replace as-any casts with proper types in herald, pi, and DM too…

d5e306f

…ls (closes #81, closes #82, closes #83)

fix: address low-priority issues — budget reset, refund guard, comman…

46c6cae

…d handler, type mismatches, intent classifier (closes #88, closes #89, closes #90, closes #91, closes #92)

fix: wire kill switch to all agents — chat, crank, sentinel, herald (c…

8d00310

…loses #93)

fix: add requireOwner authorization middleware for admin routes (closes

7b8e4ba

#94)

fix: add graceful shutdown handler for SIGTERM/SIGINT (closes #98)

2b9184f

fix: add wallet-scoping to confirmation endpoint (closes #95)

518809d

Store originating wallet in pending map and validate req.wallet matches before resolving. Prevents unauthorized confirmation of fund-moving operations by other authenticated wallets.

fix: pin JWT algorithm to HS256 and fix Bearer prefix parsing (closes #…

014b39a

…96, closes #103)

fix: add canMakeCall budget guard to write HERALD tools (closes #97)

cc7dbaf

likeTweet, replyTweet, sendDM, and publishTweet now check the budget gate before executing. Prevents overspending when gate is dm-only or paused.

fix: use JWT wallet as approver in herald approval endpoint (closes #102

f148acc

)

fix: set EventBus maxListeners to prevent warning spam (closes #104)

db174e1

fix: cap conversation messages at 100 to prevent memory growth (closes …

364612f

…#105)

fix: harden backend state management — nonce cap, JWT secret, system …

423aaf5

…prompt, confirm cap, admin tokens (closes #106, closes #107, closes #108, closes #109, closes #110)

fix: low-priority hardening — base58 guard, domain nonce, SSE IDs, X …

4b70cc1

…client cache (closes #117, closes #118, closes #121, closes #122)

fix: add isStealth type and useAuth error state (closes #123, closes #…

da54368

…124)

rz1989s merged commit 01d3c1f into main Apr 9, 2026
3 checks passed

This was referenced Apr 9, 2026

HIGH: Chat endpoints use body wallet instead of JWT-authenticated wallet #99

Closed

HIGH: /api/tools/:name bypasses confirmation flow — direct tool execution #100

Closed

LOW: console.log includes full wallet addresses — privacy concern #119

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Guardian Command — Phase 2 multi-agent platform rewrite#70

feat: Guardian Command — Phase 2 multi-agent platform rewrite#70
rz1989s merged 92 commits intomainfrom
feat/phase2-guardian-command

rz1989s commented Apr 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rz1989s commented Apr 9, 2026

Summary

Plan A: Infrastructure + Pi Migration (16 tasks)

Plan B: HERALD — X Agent (11 tasks)

Plan C: SENTINEL — Blockchain Monitor (6 tasks)

Plan D: Guardian Command UI (10 tasks)

Stats

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant