GitHub - kasimmj/local-ai-stack: 🐳 One-command self-hosted AI stack — Ollama + Open WebUI + Qdrant + n8n. Your private ChatGPT, fully offline.

Self-host an entire production-grade AI stack with ONE command. LLMs • Vector DB • Web UI • Workflow Automation • RAG • Voice — all local, all yours.

Quick Start • What's Inside • Architecture • Use Cases

🎯 Why Local AI Stack?

You don't have to choose between convenience and privacy. You don't have to pay OpenAI $0.03 per request for a chatbot. You don't have to leak company data through an API.

Run your own GPT — at home, on-prem, or in your private cloud.

git clone https://github.com/kasimmj/local-ai-stack
cd local-ai-stack
./start.sh

That's it. You now have a ChatGPT clone at http://localhost:3000, a vector database at http://localhost:6333, a workflow editor at http://localhost:5678, and a model API at http://localhost:11434.

📦 What's Inside

Service	Purpose	Port
🦙 Ollama	Run LLMs locally (Llama, Mistral, Qwen, DeepSeek...)	11434
💬 Open WebUI	ChatGPT-style UI, RAG, voice, image, multi-user	3000
🔍 Qdrant	Production vector database for embeddings/RAG	6333
🔄 n8n	Visual workflow automation (1000+ integrations)	5678
🗄️ Postgres	Persistent storage for n8n and your apps	5432
⚡ Redis	Fast cache + job queue	6379
🌐 Caddy	Auto-HTTPS reverse proxy (optional)	80/443

🚀 Quick Start

Prerequisites

Docker 24+ and Docker Compose v2
16GB RAM minimum (32GB recommended for larger models)
~50GB free disk space

Installation

git clone https://github.com/kasimmj/local-ai-stack
cd local-ai-stack
cp .env.example .env
./start.sh

Open http://localhost:3000 → create your admin account → start chatting.

Pull your first model

docker exec -it ollama ollama pull llama3.2:3b      # Fast & small (2GB)
docker exec -it ollama ollama pull qwen2.5:7b       # Great quality (4GB)
docker exec -it ollama ollama pull deepseek-r1:8b   # Reasoning (5GB)

Stop / Reset

./stop.sh                  # Graceful shutdown
./reset.sh                 # Nuke everything (delete data)

🏗️ Architecture

                ┌──────────────────────────────────────────┐
                │              Caddy (Reverse Proxy)        │
                └────┬────────────────┬────────────┬───────┘
                     │                │            │
              ┌──────▼──────┐  ┌──────▼─────┐  ┌──▼─────┐
              │ Open WebUI  │  │     n8n    │  │ Qdrant │
              │   :3000     │  │   :5678    │  │  :6333 │
              └──────┬──────┘  └──────┬─────┘  └────────┘
                     │                │
                     │  ┌─────────────┴──┐
                     │  │   Postgres     │
                     │  │     :5432      │
                     │  └────────────────┘
              ┌──────▼──────┐
              │   Ollama    │
              │   :11434    │
              └─────────────┘

All services share a private Docker network. Only Open WebUI, n8n, and Qdrant are exposed by default — everything else stays internal.

💡 Use Cases

🏢 Private Company Assistant

Replace ChatGPT Teams with an internal AI that knows your docs, never leaks data.

🎓 University RAG Research

Index thousands of papers and chat with them — no API costs, full reproducibility.

🤖 Customer Support Bot

Train on your knowledge base, deploy via n8n webhooks to WhatsApp/Telegram/Slack.

🛡️ Privacy-First Personal AI

Your conversations, your data, your model. No telemetry.

🌐 Edge AI for Disconnected Regions

Bring AI to areas with limited internet — fully offline after first install.

⚙️ Configuration

Choose your model

Edit .env:

DEFAULT_MODEL=qwen2.5:7b
EMBEDDING_MODEL=nomic-embed-text

Add Arabic / RTL support

Open WebUI already supports RTL out of the box. Just pick العربية in Settings → Interface.

Enable HTTPS (production)

DOMAIN=ai.yourcompany.com ./start.sh --with-caddy

Caddy will auto-provision a Let's Encrypt certificate.

Add a custom model

docker exec -it ollama ollama pull <model-name>
# Then refresh Open WebUI — it appears in the model picker.

📊 Resource Requirements

Model size	RAM	Disk	Speed (RTX 4090)
3B	4GB	2GB	80 tok/s
7B	8GB	4GB	50 tok/s
13B	16GB	8GB	28 tok/s
34B	32GB	20GB	12 tok/s
70B	64GB	40GB	6 tok/s

CPU-only mode is supported (slower).

🧩 Extensions

Drop YAML files into extensions/ to add more services:

voice/ — Whisper STT + Piper TTS for voice chat
vision/ — Stable Diffusion for image generation
search/ — SearXNG for web-grounded answers
monitoring/ — Grafana + Loki for observability

./start.sh --enable voice,vision,search

🛡️ Security Notes

⚠️ Default credentials are random per-install (stored in .env)
⚠️ Never expose ports directly to public internet without Caddy + auth
✅ All inter-service traffic is on a private Docker network
✅ No telemetry, no analytics, no external calls (unless you add them)

🤝 Contributing

PRs welcome! Especially:

Additional Ollama model preset bundles
n8n workflow templates
Extensions for new services (TTS, vision, scrapers)
Translations for Open WebUI

📜 License

Star ⭐ if you'd rather own your AI than rent it.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.github		.github
docs		docs
scripts		scripts
.env.example		.env.example
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
docker-compose.yml		docker-compose.yml
reset.sh		reset.sh
start.sh		start.sh
stop.sh		stop.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎯 Why Local AI Stack?

📦 What's Inside

🚀 Quick Start

Prerequisites

Installation

Pull your first model

Stop / Reset

🏗️ Architecture

💡 Use Cases

🏢 Private Company Assistant

🎓 University RAG Research

🤖 Customer Support Bot

🛡️ Privacy-First Personal AI

🌐 Edge AI for Disconnected Regions

⚙️ Configuration

Choose your model

Add Arabic / RTL support

Enable HTTPS (production)

Add a custom model

📊 Resource Requirements

🧩 Extensions

🛡️ Security Notes

🤝 Contributing

📜 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎯 Why Local AI Stack?

📦 What's Inside

🚀 Quick Start

Prerequisites

Installation

Pull your first model

Stop / Reset

🏗️ Architecture

💡 Use Cases

🏢 Private Company Assistant

🎓 University RAG Research

🤖 Customer Support Bot

🛡️ Privacy-First Personal AI

🌐 Edge AI for Disconnected Regions

⚙️ Configuration

Choose your model

Add Arabic / RTL support

Enable HTTPS (production)

Add a custom model

📊 Resource Requirements

🧩 Extensions

🛡️ Security Notes

🤝 Contributing

📜 License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages