# Running Large Language Models (LLMs) locally for Retrieval-Augmented-Generation (RAG) Systems with full privacy – Hans Dembinski’s blog [https://hdembinski.github.io/posts/llama_index_rag.html](https://hdembinski.github.io/posts/llama_index_rag.html)