Contract Intelligence Analyst

An agentic RAG system that lets you chat with legal contracts. Upload any commercial agreement and ask questions in plain English. The agent finds relevant clauses, flags risks, and compares terms across documents.

This project is built as a portfolio piece to demonstrate end-to-end agentic RAG, local LLM deployment, and production-ready ML tooling.

What it does

Chat with contracts Ask questions like "What are the termination clauses?" and get grounded answers with citations.
Risk flagging Scans contracts for high-risk language such as unlimited liability, auto-renewal, and broad IP assignment. Outputs severity levels: HIGH, MEDIUM, LOW.
Clause extraction Extract specific clause types such as payment, liability, or non-compete across all contracts.
Cross-contract comparison Compare how the same clause differs across agreements side by side.
Source transparency Every answer includes references to the exact contract and chunk it came from.

Architecture

contract-analyst/
├── ingestion/        # PDF parsing, chunking, embedding, FAISS index
│   ├── ingest.py
│   ├── faiss.index
│   └── chunks.pkl
├── agent/            # LangGraph agent with tools
│   └── agent.py
├── app/              # Streamlit UI
│   └── app.py
└── eval/             # RAGAS evaluation suite, 20-question golden dataset

Ingestion pipeline

Contracts are split into 512-character chunks with 64-character overlap using RecursiveCharacterTextSplitter
Each chunk is embedded using all-MiniLM-L6-v2 (384 dimensions) from sentence-transformers
Embeddings are stored in a FAISS IndexFlatIP index for cosine similarity search

Agent layer

The LangGraph StateGraph implements a ReAct loop. The LLM decides which tool to call, executes it, reasons over the result, and either calls another tool or returns a final answer.

Tools

Tool	Purpose
`vector_search`	General semantic search across contracts
`clause_extractor`	Extract specific clauses with summaries
`risk_flagger`	Generate structured risk reports
`contract_comparator`	Compare clauses across contracts

Stack

Layer	Technology
LLM	Mistral 7B via Ollama (local, no API keys)
Agent framework	LangGraph
Embeddings	sentence-transformers (all-MiniLM-L6-v2)
Vector store	FAISS
UI	Streamlit
Chunking	LangChain RecursiveCharacterTextSplitter

Setup

Prerequisites

Python 3.10 or higher
Ollama installed

1. Clone and create a virtual environment

git clone https://github.com/Eklavya20/contract-analyst.git
cd contract-analyst
python -m venv venv

# Windows
.\venv\Scripts\activate

# Mac/Linux
source venv/bin/activate

2. Install dependencies

pip install -r requirements.txt

3. Pull the model

ollama pull mistral

4. Build the index

python ingestion/ingest.py

This step chunks and embeds contracts, then saves the FAISS index in the ingestion/ directory.

5. Run the app

streamlit run app/app.py

Or run the agent directly:

python agent/agent.py

Requirements

langchain
langchain-community
langchain-ollama
langgraph
faiss-cpu
sentence-transformers
pdfplumber
numpy
streamlit
pandas

Example queries

Flag all risks in SaaS_Agreement.txt
Compare liability clauses across contracts
What are the payment terms in Distribution_Agreement.txt?
Which contracts include auto-renewal clauses?
Show termination clauses across all contracts

What I learned building this

Some of the most important challenges were not where I expected them.

Chunking matters more than the model If a clause is split across chunks, it becomes much harder to retrieve. Tuning chunk size (512 characters) and overlap (64 characters) improved retrieval quality more than prompt changes.
Agent routing becomes valuable with scale A single pipeline works for small datasets, but different query types need different tools. Semantic search, structured risk reports, and comparisons require different handling.
Local LLMs are viable for focused use cases Mistral 7B running on Ollama can reliably handle tool usage and structured outputs. Latency is higher than cloud APIs, but the tradeoff is strong: no cost, offline capability, and better data privacy.

Roadmap

RAGAS evaluation suite — faithfulness 0.842, context recall 0.950
Enable PDF uploads via the Streamlit UI with dynamic re-indexing
Add concept drift detection for clause distributions
Support the CUAD dataset (500+ annotated contracts)

Evaluation results (RAGAS)

Evaluated on a 20-question golden dataset derived from the contract corpus, using Mistral 7B as both the generator and judge.

Metric	Score	What it measures
Faithfulness	0.842	Answers grounded in retrieved context (no hallucination)
Answer relevancy	0.767	Answer actually addresses the question
Context precision	0.901	Retrieved chunks are relevant to the question
Context recall	0.950	Relevant chunks are successfully retrieved

Notable failure cases: questions about auto-renewal and governing law scored lowest on faithfulness, traced to retrieval misses where the relevant clause fell outside the top-5 retrieved chunks. Increasing TOP_K from 5 to 7 is the next iteration.

Acknowledgements

CUAD Dataset (Contract Understanding Atticus Dataset)
LangGraph for agent orchestration
Ollama for local model serving

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Contract Intelligence Analyst

What it does

Architecture

Ingestion pipeline

Agent layer

Tools

Stack

Setup

Prerequisites

1. Clone and create a virtual environment

2. Install dependencies

3. Pull the model

4. Build the index

5. Run the app

Requirements

Example queries

What I learned building this

Roadmap

Evaluation results (RAGAS)

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
agent		agent
app		app
eval		eval
ingestion		ingestion
README.md		README.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Contract Intelligence Analyst

What it does

Architecture

Ingestion pipeline

Agent layer

Tools

Stack

Setup

Prerequisites

1. Clone and create a virtual environment

2. Install dependencies

3. Pull the model

4. Build the index

5. Run the app

Requirements

Example queries

What I learned building this

Roadmap

Evaluation results (RAGAS)

Acknowledgements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages