garrytan · rapidstartup · Mar 28, 2026 · Mar 28, 2026 · Mar 28, 2026 · Mar 28, 2026
diff --git a/.gitattributes b/.gitattributes
@@ -0,0 +1,17 @@
+# Store and checkout text as LF (cross-platform; matches web/TS conventions).
+* text=auto eol=lf
+
+# Binary — never apply line-ending conversion
+*.png binary
+*.jpg binary
+*.jpeg binary
+*.gif binary
+*.webp binary
+*.ico binary
+*.pdf binary
+*.woff binary
+*.woff2 binary
+*.ttf binary
+*.eot binary
+*.wasm binary
+*.zip binary
diff --git a/.planning/codebase/ARCHITECTURE.md b/.planning/codebase/ARCHITECTURE.md
@@ -0,0 +1,153 @@
+# Architecture
+
+**Analysis Date:** 2026-03-28
+
+## Pattern Overview
+
+**Overall:** Modular skill-based architecture with headless browser automation core
+
+**Key Characteristics:**
+- Skills as independent, self-contained modules with defined interfaces
+- Persistent headless browser server for automation tasks
+- CLI wrappers that communicate with persistent services
+- Template-driven skill generation system
+- Centralized configuration and state management
+
+## Layers
+
+**Presentation Layer (CLI):**
+- Purpose: Command-line interfaces for user interaction with skills
+- Location: `*/src/cli.ts` files across modules (browse, design, etc.)
+- Contains: Argument parsing, command routing, user output formatting
+- Depends on: Service layers, configuration systems
+- Used by: End users, scripts, other CLIs
+
+**Service Layer:**
+- Purpose: Core functionality implementation for each skill domain
+- Location: `*/src/` directories (e.g., `browse/src/`, `design/src/`)
+- Contains: Business logic, API integrations, core algorithms
+- Depends on: Infrastructure layer, external SDKs
+- Used by: Presentation layer, other services
+
+**Infrastructure Layer:**
+- Purpose: Shared utilities, configuration, state management, server infrastructure
+- Location: `browse/src/config.ts`, `browse/src/server.ts`, `scripts/` utilities
+- Contains: Persistent browser server, state persistence, config resolution, helper functions
+- Depends on: External packages (playwright, puppeteer-core, etc.)
+- Used by: All service and presentation layers
+
+**Template/Generation Layer:**
+- Purpose: Skill template system and documentation generation
+- Location: `scripts/gen-skill-docs.ts`, `.tmpl` template files
+- Contains: Template processing, skill scaffolding, documentation generation
+- Depends on: File system, template engines
+- Used by: Development workflows, skill creation process
+
+## Data Flow
+
+**Skill Execution Flow:**
+
+1. **Command Invocation:** User runs `gstack <skill> <command>` or direct binary (`browse`, `design`)
+2. **CLI Parsing:** Argument parsing and command routing in `*/src/cli.ts`
+3. **Server Management:** For browse skill, ensures persistent server is running via `ensureServer()`
+4. **Service Call:** Routes to appropriate service function based on command
+5. **Execution:** Service performs core functionality (browser automation, design generation, etc.)
+6. **Result Return:** Output formatted and returned to user via stdout/stderr
+
+**Browser Automation Flow (Browse Skill):**
+1. CLI reads state file (`.gstack/browse.json`) for server connection info
+2. If missing/stale, starts persistent Bun server running `browse/src/server.ts`
+3. CLI sends HTTP commands to server on localhost:port with bearer token auth
+4. Server executes Playwright/Puppeteer commands against Chromium
+5. Results returned via HTTP to CLI, then to user
+
+**Skill Documentation Flow:**
+1. `gen-skill-docs.ts` reads skill YAML/YAML-like configuration
+2. Processes SKILL.md.tmpl templates with skill-specific data
+3. Generates final SKILL.md files in each skill directory
+4. Also generates codex-specific variants when requested
+
+## Key Abstractions
+
+**Skill Abstraction:**
+- Purpose: Standardized interface for all gstack skills
+- Examples: `agents/openai.yaml`, `*/SKILL.md`, `*/SKILL.md.tmpl`
+- Pattern: Each skill has CLI, optional server, templates, tests, and documentation
+
+**Persistent Server Abstraction:**
+- Purpose: Long-running browser automation service
+- Examples: `browse/src/server.ts`, `browse/src/cli.ts` management functions
+- Pattern: State file (.gstack/browse.json) tracks PID, port, token; health checks via HTTP
+
+**Configuration Abstraction:**
+- Purpose: Centralized config resolution with defaults and environment overrides
+- Examples: `browse/src/config.ts`, `resolveConfig()` function
+- Pattern: Hierarchical resolution (defaults → file → env → args) with validation
+
+**Template Abstraction:**
+- Purpose: Reusable skill scaffolding and documentation generation
+- Examples: `SKILL.md.tmpl` files, `gen-skill-docs.ts` script
+- Pattern: Handlebars-style templating with skill metadata injection
+
+## Entry Points
+
+**Binary Entry Points:**
+- `browse/dist/browse`: Compiled browse CLI (primary user entry point)
+- `design/dist/design`: Compiled design CLI
+- `bin/gstack-global-discover`: Global skill discovery utility
+- `browse/dist/find-browse`: Helper for finding browse instances
+
+**Script Entry Points:**
+- `scripts/gen-skill-docs.ts`: Skill documentation generation
+- `scripts/skill-check.ts`: Skill health validation
+- `scripts/dev-skill.ts`: Skill development helper
+- `scripts/discover-skills.ts`: Skill discovery and listing
+
+**Direct Source Entry Points (Dev):**
+- `browse/src/cli.ts`: Development browse CLI
+- `design/src/cli.ts`: Development design CLI
+- `*/src/cli.ts`: Development CLIs for other skills
+
+## Error Handling
+
+**Strategy:** Defensive programming with clear error messages and graceful degradation
+
+**Patterns:**
+- **Server Lifecycle:** Automatic restart on version mismatch or failure, with startup error capture
+- **CLI Validation:** Early argument validation with clear usage instructions
+- **Network Resilience:** Retry mechanisms for transient connection failures in HTTP communication
+- **Process Management:** Cross-platform process detection and cleanup (Windows/Linux/macOS differences)
+- **Authentication:** Token validation with automatic refresh on mismatch
+- **File Operations:** Existence checks before read/write, graceful handling of missing files
+
+**Specific Implementations:**
+- Browse skill uses HTTP health checks (`isServerHealthy()`) instead of PID checks for cross-platform reliability
+- Windows uses Node.js child_process with detached:true for proper process detachment
+- Legacy state cleanup prevents conflicts between different installation methods
+- Server startup includes timeout handling and error log capture for diagnostics
+
+## Cross-Cutting Concerns
+
+**Logging:** 
+- Primary: Console output with color-coded prefixes (`[browse]`, `[design]`, etc.)
+- Levels: Error (console.error), info/status (console.log), debug (conditional)
+- Pattern: Consistent prefixed output for easy filtering
+
+**Validation:**
+- Input: Early validation of CLI arguments and configuration values
+- API: Response validation from external services (OpenAI, browser automation)
+- State: Version checking to detect mismatches between CLI and server binaries
+
+**State Management:**
+- Location: `.gstack/` directory in user home or project root
+- Persistence: JSON state files for server connection info, tokens, version hashes
+- Coordination: File-based locking to prevent race conditions during server startup
+- Cleanup: Automatic cleanup of stale state and legacy /tmp files
+
+**Authentication:**
+- Mechanism: Bearer token auth for CLI-to-server communication
+- Storage: Encrypted file storage (`~/.gstack/openai.json` for design, `.gstack/browse.json` for browse)
+- Resolution: Multiple sources (file → env → interactive prompt) with guided setup
+
+---
+*Architecture analysis: 2026-03-28*
diff --git a/.planning/codebase/CONCERNS.md b/.planning/codebase/CONCERNS.md
@@ -0,0 +1,140 @@
+# Codebase Concerns
+
+**Analysis Date:** 2026-03-28
+
+## Tech Debt
+
+**[Documentation gaps]:**
+- Issue: Many skill directories lack clear documentation on purpose and usage
+- Files: `agents/skill-*/README.md` (missing or minimal in many skills)
+- Impact: Difficult for new contributors to understand skill boundaries and responsibilities
+- Fix approach: Add standardized README templates for each skill with purpose, inputs, outputs, and examples
+
+**[Configuration complexity]:**
+- Issue: Multiple configuration mechanisms across skills (JSON, env vars, hardcoded values)
+- Files: `agents/*/config.ts`, `agents/*/*.json`, `.env.example`
+- Impact: Inconsistent configuration patterns increase cognitive load
+- Fix approach: Establish unified configuration pattern using `resolveConfig()` utility
+
+**[Build script fragmentation]:**
+- Issue: Build and setup scripts scattered across multiple directories
+- Files: `setup*`, `bin/*`, `scripts/*`, `package.json` scripts
+- Impact: Difficult to discover and maintain development workflows
+- Fix approach: Consolidate scripts into standardized locations with clear documentation
+
+## Known Bugs
+
+**[Skill registration failures]:**
+- Symptoms: Skills fail to load due to missing adapter configurations
+- Files: `agents/skill-parser.test.ts`, `agents/*/adapter.ts`
+- Trigger: When skill adapter doesn't implement required interfaces correctly
+- Workaround: Manual verification of adapter implementation
+- Fix approach: Add runtime validation during skill loading with clear error messages
+
+**[Browser cookie import race conditions]:**
+- Symptoms: Cookie import fails intermittently when browser is busy
+- Files: `browse/src/cookie-import-browser.ts`, `setup-browser-cookies/*`
+- Trigger: Concurrent access to browser profile during import
+- Workaround: Retry mechanism with exponential backoff
+- Fix approach: Implement proper file locking or use browser automation APIs
+
+## Security Considerations
+
+**[Environment variable exposure]:**
+- Risk: Accidental committing of `.env` files containing secrets
+- Files: `.env.example` (template), potential `.env*` files in developer environments
+- Current mitigation: `.gitignore` excludes `.env*` files
+- Recommendations: Add pre-commit hook to scan for accidental secrets, use secrets scanning in CI
+
+**[Insecure random number generation]:**
+- Risk: Use of non-cryptographically random values for security-sensitive operations
+- Files: `browse/src/sidebar-agent.ts` (generation of session IDs)
+- Current mitigation: None identified
+- Recommendations: Replace `Math.random()` with `crypto.randomUUID()` or similar secure alternatives
+
+## Performance Bottlenecks
+
+**[Skill loading latency]:**
+- Problem: Initial skill discovery and loading takes significant time
+- Files: `agents/skill-parser.ts`, `agents/skill-validation.ts`
+- Cause: Sequential file system operations for each skill directory
+- Improvement path: Implement skill caching with filesystem watchers for invalidation
+
+**[Browser instance multiplication]:**
+- Problem: Multiple browser instances spawned unnecessarily
+- Files: `browse/src/browser-manager.ts`, `browse/src/server.ts`
+- Cause: Lack of proper singleton pattern for browser manager
+- Improvement path: enforce singleton pattern and reuse browser contexts where safe
+
+## Fragile Areas
+
+**[Agent communication protocol]:**
+- Files: `agents/paperclip/*.ts`, `agents/*/adapter.ts`
+- Why fragile: Loose coupling via JSON messages without schema validation
+- Safe modification: Add JSON schema validation for all inter-agent messages
+- Test coverage: Gaps in error handling for malformed messages
+
+**[File system operation safety]:**
+- Files: `scripts/*` (file manipulation scripts), `setup*` scripts
+- Why fragile: Synchronous file operations without proper error handling
+- Safe modification: Wrap FS operations in try/catch with meaningful error messages
+- Test coverage: Limited unit testing for edge cases (permissions, missing directories)
+
+## Scaling Limits
+
+**[Concurrent skill execution]:**
+- Current capacity: Sequential skill execution in workflows
+- Limit: Long-running skills block entire workflow
+- Scaling path: Implement proper async/await patterns and worker queues for parallel execution
+
+**[Memory usage in browser operations]:**
+- Current capacity: Single browser session per user
+- Limit: Memory leaks in long-running browser sessions
+- Scaling path: Implement periodic browser restart and memory monitoring
+
+## Dependencies at Risk
+
+**[PUPPETEER_EXECUTABLE_PATH]:**
+- Risk: Hardcoded paths to browser executables in `browse/src/platform.ts`
+- Impact: Breaks when browser updates or when running in different environments
+- Migration plan: Use environment-configurable paths with auto-detection fallback
+
+**[Node.js version compatibility]:**
+- Risk: Use of modern Node.js features (`import.meta`, top-level await) that may break on older versions
+- Impact: Limits deployment environments
+- Migration plan: Either require specific Node.js version or transpile for broader compatibility
+
+## Missing Critical Features
+
+**[Skill versioning]:**
+- Problem: No mechanism to track skill versions or dependencies
+- Blocks: Safe updates and rollback of skills
+- Proposed solution: Add version metadata to skill configuration and implement compatibility checking
+
+**[Skill sandboxing]:**
+- Problem: Skills run with full process access
+- Blocks: Safe execution of third-party or untrusted skills
+- Proposed solution: Implement skill execution in restricted environments (e.g., VMs, containers)
+
+## Test Coverage Gaps
+
+**[Error handling in adapters]:**
+- What's not tested: Adapter failure scenarios and recovery
+- Files: `agents/*/adapter.ts`
+- Risk: Silent failures in skill execution
+- Priority: High
+
+**[Edge cases in file system operations]:**
+- What's not tested: Permission errors, disk full, concurrent access
+- Files: `scripts/*`, `setup*` scripts
+- Risk: Unhandled exceptions causing partial setup states
+- Priority: Medium
+
+**[Browser automation failure modes]:**
+- What's not tested: Network failures, browser crashes, element not found
+- Files: `browse/src/*`
+- Risk: Unreliable UI automation
+- Priority: High
+
+---
+*Concerns audit: 2026-03-28*