Sullybase Local LLM Chat

v2.4.1

A lightweight desktop app for chatting with local LLMs via Ollama. All conversations stay on your device — no external data transmission.

Architecture

The app is built on a Flask + pywebview stack:

server.py — Flask backend: Ollama API proxy, chat/settings persistence, context file loading, file browser
app.py — Desktop launcher: starts Flask in a daemon thread, waits for it to be ready, then opens a pywebview window
index.html / app.js / style.css — Frontend served by Flask as static files

Chats and settings are stored as JSON files in the app support directory:

macOS: ~/Library/Application Support/Sullybase-LLM-Chat/

Features

Sidebar

Model selector with refresh button — lists all locally available Ollama models
Chat history — browse, search (with snippet preview), and switch between past conversations
Stats panel — live GPU/CPU device badge, VRAM usage bar, context window usage bar, tokens/sec

Chat Interface

Streaming responses with blinking cursor and stop button
Markdown rendering — headings, code blocks with syntax highlighting and copy button, tables, blockquotes, lists
Thinking block support — collapsible <think>...</think> sections for reasoning models
AI-generated chat titles with word-sliced fallback; retries on subsequent messages if the first attempt fails
Context files — attach local files or folders (up to 2 MB per file) to inject into the system prompt; re-read from disk on each message
Regenerate & edit — re-roll the last reply, or edit a prior user message to resend from that point (the following replies are discarded and regenerated)
Draft persistence — unsent text is kept per-chat across switches and reloads
Keyboard shortcuts — Cmd/Ctrl+N new chat, Cmd/Ctrl+K search, Esc closes panels
Scroll-to-bottom button — appears when you've scrolled up in a long chat
Message timestamps — each assistant reply shows when it was sent (time same-day, date otherwise)

Performance info bar

Shows prompt tokens ↑, completion tokens ↓, tokens/sec, first-token latency, and total generation time after each response.

Requirements

Python 3.14+
Dependencies: see requirements.txt (flask, pywebview, requests)
Ollama installed and running. Install by: • Clicking the link above, or • Running this in terminal
```
 curl -fsSL https://ollama.com/install.sh | sh
```

Setup

Download the Code folder from this repository
Install dependencies:
```
pip install -r requirements.txt
```
Run:
```
python app.py
```
Connect Ollama — open Ollama, then click ↻ in the sidebar to load models

Optional: macOS Automator Shortcut

You can create a double-clickable launcher using Automator:

Open Automator → New Document → Application
Add a Run Shell Script action
Set the script to:
```
cd /path/to/repo
python app.py
```
Save the Automator app anywhere (e.g. Applications or your Desktop)

Notes

Privacy: All data stays local — no external network calls except to localhost:11434 (Ollama)
Logging: Rotating logs written to the app support directory (logs/sullybase.log)
Thinking models: Thinking is parsed and shown as collapsible sections
macOS file browser: Uses osascript (AppleScript) to avoid thread-safety issues with tkinter

Model Support

Developed and tested on an Apple M2 Air (8 GB RAM) with qwen2.5-coder:3b and 'qwen3.5:4b-mlx'. Any Ollama-compatible model should work given sufficient RAM and compute.

Contributing

Open an issue to discuss the change first
Fork, implement, and submit a pull request

Use GitHub Issues for bug reports and feature requests.

Possible Updates

Agentic mode

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
Code		Code
.DS_Store		.DS_Store
README.md		README.md
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sullybase Local LLM Chat

Architecture

Features

Sidebar

Chat Interface

Performance info bar

Requirements

Setup

Optional: macOS Automator Shortcut

Notes

Model Support

Contributing

Possible Updates

About

Uh oh!

Releases 4

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Sullybase Local LLM Chat

Architecture

Features

Sidebar

Chat Interface

Performance info bar

Requirements

Setup

Optional: macOS Automator Shortcut

Notes

Model Support

Contributing

Possible Updates

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages