-
Notifications
You must be signed in to change notification settings - Fork 13
Open
Description
Hey — I've been using Familiar daily on Windows and built out a full Windows port on a fork. Wanted to gauge interest before opening PRs.
What's included:
- Windows OCR pipeline using the built-in
Windows.Media.OcrAPI (WinRT via PowerShell 5.1) — mirrors the Apple Vision OCR pattern with layout region inference (HEADER/SIDEBAR/CONTENT from word bounding boxes) - Foreground window metadata capture — captures active window title, app name, PID, and URL extraction for browsers at screenshot time. This addresses Capture ground-truth app/window/URL metadata per frame #11 and complements the on-screen app detector added recently (which is macOS-only)
- NSIS installer packaging —
npm run dist:winbuilds a Windows.exeinstaller with PowerShell scripts properly unpacked viaextraResources - Cross-platform Launch at Login — replaces the macOS-only
openAtLoginwith a cross-platform setting that works on both platforms - Platform-aware defaults — capture interval, UI text ("System Tray" vs "Dock"), extractor type auto-detection, settings normalisation for
windows_ocrtype
The OCR quality is solid for screen text — I've been running it daily for a few weeks. All output follows the familiar-layout-v0 markdown format with real frontmatter metadata (app, window title, URL).
Happy to break this into focused PRs if you're interested. Thinking something like:
- Core Windows OCR + packaging
- Cross-platform Launch at Login + capture interval
- README updates
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels