QueryFox 🦊

Advanced RAG System with Intelligent Agentic Routing

A production-ready document intelligence system that combines Retrieval Augmented Generation (RAG) with intelligent query routing. Built with FastAPI and Streamlit, featuring automatic decision-making between local document search and real-time web search.

✨ Features

🤖 Intelligent Query Routing - Automatically routes between document RAG and web search using LangGraph
📄 Multi-Format Support - PDF, DOCX, TXT, CSV document processing
🌐 Smart Web Scraping - Structured content extraction (headings, paragraphs, lists) with recursive crawling
🔍 Semantic Search - FAISS vector database with K=3 retrieval optimization
⚡ Fast Inference - Groq API (Llama 3.3 70B) for <2s response times
💬 Interactive Chat - Real-time Streamlit interface with source attribution
🐳 Docker Ready - Containerized deployment with Docker Compose

🏗️ Architecture

┌──────────────┐      ┌─────────────────┐      ┌──────────────┐
│   Streamlit  │ ───► │  FastAPI + RAG  │ ───► │  Groq LLM    │
│   Frontend   │      │  + LangGraph    │      │  (Llama 3.3) │
└──────────────┘      └─────────────────┘      └──────────────┘
                              │
                    ┌─────────┴─────────┐
                    ▼                   ▼
              ┌──────────┐        ┌──────────┐
              │  FAISS   │        │  Tavily  │
              │  Vector  │        │  Web     │
              │  Search  │        │  Search  │
              └──────────┘        └──────────┘

🛠️ Tech Stack

Backend:

FastAPI - High-performance async API
LangChain & LangGraph - AI orchestration & agentic routing
FAISS - Vector similarity search
Groq API - Fast LLM inference
Tavily API - Real-time web search

Frontend:

Streamlit - Interactive web UI

AI/ML:

HuggingFace Transformers (all-MiniLM-L6-v2)
PyPDF, python-docx - Document processing
BeautifulSoup4 - Web scraping

🚀 Quick Start

Prerequisites

Python 3.11+
Docker & Docker Compose (optional)
Groq API Key (Get here)
Tavily API Key (Get here)

Local Setup (Recommended for Development)

# 1. Clone repository
git clone https://github.com/prasanna-nagarale/QueryFoxAi.git
cd QueryFox

# 2. Create environment file
cp .env.example .env
# Add your API keys to .env

# 3. Backend setup
cd backend
python -m venv venv
source venv/bin/activate  # Windows: venv\Scripts\activate
pip install -r requirements.txt

# 4. Start backend (Terminal 1)
uvicorn app.main:app --host 0.0.0.0 --port 8000 --reload

# 5. Frontend setup (Terminal 2)
cd ../frontend
python -m venv venv
source venv/bin/activate
pip install -r requirements.txt

# 6. Start frontend
streamlit run app.py

Access: http://localhost:8501

Docker Setup (Recommended for Production)

# 1. Clone and configure
git clone https://github.com/prasanna-nagarale/QueryFoxAi.git
cd QueryFox
cp .env.example .env
# Add your API keys to .env

# 2. Start with Docker
docker-compose up --build

# Access:
# Frontend: http://localhost:8501
# Backend API: http://localhost:8000/docs

📝 Environment Variables

Create a .env file in the root directory:

GROQ_API_KEY=your_groq_api_key_here
TAVILY_API_KEY=your_tavily_api_key_here

💻 Usage

1. Upload Document

Click "Upload Document"
Select PDF, DOCX, TXT, or CSV file
System processes and creates embeddings

2. Scrape Website

Enter URL in "Scrape Website"
Set max pages (1-5)
System extracts structured content

3. Ask Questions

Type your question in chat
System automatically routes to RAG or web search
Receive answer with source attribution

📊 Performance

Response Time: <2 seconds (RAG mode)
Document Processing: 3-7 seconds (average)
Retrieval Accuracy: K=3 optimized chunks
Supported File Size: Up to 10MB
Web Scraping: 1-5 pages with structured extraction

🎯 How It Works

Query Analysis - LangGraph analyzes if query needs current info or can use documents
Intelligent Routing:
- RAG Path: Semantic search in FAISS → Retrieve K=3 chunks → Generate answer
- Web Path: Tavily search → Fetch articles → Synthesize answer
Response Generation - Groq LLM creates contextual answer with sources

📂 Project Structure

QueryFox/
├── backend/
│   ├── app/
│   │   ├── main.py              # FastAPI application
│   │   ├── config.py            # Configuration
│   │   ├── core/
│   │   │   ├── rag_engine.py    # RAG logic
│   │   │   ├── langgraph_agent.py  # Agentic routing
│   │   │   ├── document_processor.py
│   │   │   ├── web_scraper.py
│   │   │   └── embeddings.py
│   │   └── models/
│   │       └── schemas.py       # Pydantic models
│   ├── requirements.txt
│   └── Dockerfile
├── frontend/
│   ├── app.py                   # Streamlit UI
│   ├── requirements.txt
│   └── Dockerfile
├── docker-compose.yml
├── .env.example
└── README.md

🔧 API Endpoints

Endpoint	Method	Description
`/`	GET	Health check
`/upload`	POST	Upload and process document
`/scrape`	POST	Scrape and process website
`/query`	POST	Query with intelligent routing
`/docs`	GET	Interactive API documentation

Full API Docs: http://localhost:8000/docs (when running)

🤝 Contributing

Contributions are welcome! Please follow these steps:

Fork the repository
Create feature branch (git checkout -b feature/AmazingFeature)
Commit changes (git commit -m 'Add AmazingFeature')
Push to branch (git push origin feature/AmazingFeature)
Open Pull Request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

LangChain - AI orchestration framework
Groq - Fast LLM inference
Tavily - Web search API
FAISS - Vector search
Streamlit - UI framework

📧 Contact

Prasanna Nagarale

Email: nagaraleprasanna@gmail.com
LinkedIn: linkedin.com/in/prasanna-ai
GitHub: github.com/prasanna-nagarale
Portfolio: prasanna-nagarale.github.io/prasanna-portfolio

🌟 Star History

If you find this project useful, please consider giving it a star ⭐

Built with ❤️ using FastAPI, LangGraph, and Groq

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

QueryFox 🦊

✨ Features

🏗️ Architecture

🛠️ Tech Stack

🚀 Quick Start

Prerequisites

Local Setup (Recommended for Development)

Docker Setup (Recommended for Production)

📝 Environment Variables

💻 Usage

1. Upload Document

2. Scrape Website

3. Ask Questions

📊 Performance

🎯 How It Works

📂 Project Structure

🔧 API Endpoints

🤝 Contributing

📄 License

🙏 Acknowledgments

📧 Contact

🌟 Star History

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
backend		backend
frontend		frontend
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml

Folders and files

Latest commit

History

Repository files navigation

QueryFox 🦊

✨ Features

🏗️ Architecture

🛠️ Tech Stack

🚀 Quick Start

Prerequisites

Local Setup (Recommended for Development)

Docker Setup (Recommended for Production)

📝 Environment Variables

💻 Usage

1. Upload Document

2. Scrape Website

3. Ask Questions

📊 Performance

🎯 How It Works

📂 Project Structure

🔧 API Endpoints

🤝 Contributing

📄 License

🙏 Acknowledgments

📧 Contact

🌟 Star History

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages