508 AI Evaluation Platform

Interactive web application for evaluating and testing the Section 508 Compliance Analysis pipeline. Upload solicitation PDFs and watch the full AI analysis run in real time with complete LLM prompt/response transparency.

Purpose

This is the iterative improvement workbench for the 508 AI pipeline. Use it to:

Upload solicitation documents and run the full analysis pipeline
See every LLM prompt and response in real time (SSE streaming)
Compare results across different AI providers (LM Studio, OpenAI, AWS Bedrock)
Tune temperature, max tokens, and other parameters via the Settings panel
Download full analysis results as JSON

Quick Start

cd 508_eval_platform

# Create virtual environment (first time only)
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt

# Copy and configure environment
cp .env.example .env
# Edit .env with your provider settings

# Start the server
./run.sh

Open http://localhost:8080 in your browser.

AI Providers

Provider	Config	Notes
LM Studio	`USE_LMSTUDIO=true`	Local inference, port 1236
OpenAI	`USE_OPENAI=true` + API key	GPT-4o default
AWS Bedrock	Default (neither flag set)	Requires AWS credentials

Project Structure

508_eval_platform/
├── eval_ui/              # FastAPI web app
│   ├── api.py            # API endpoints + SSE streaming
│   ├── runner.py         # Pipeline orchestrator
│   ├── instrumentation.py # LLM call capture
│   ├── models.py         # Pydantic data models
│   └── static/           # Frontend (HTML + JS)
├── src/
│   ├── config_loader.py  # Environment config
│   └── core/
│       ├── processors.py      # Main analysis pipeline
│       ├── vector_matching.py # FAISS vector search
│       ├── file_utils.py      # PDF/text extraction
│       ├── bedrock_client.py  # AWS Bedrock client
│       ├── lmstudio_client.py # LM Studio client
│       └── openai_client.py   # OpenAI client
├── data/standards/       # 508 standards text
├── section_508_db/       # FAISS vector index
├── requirements.txt
├── run.sh
├── .env.example
└── README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

508 AI Evaluation Platform

Purpose

Quick Start

AI Providers

Project Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
eval_ui		eval_ui
section_508_db		section_508_db
src		src
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
run.sh		run.sh

Folders and files

Latest commit

History

Repository files navigation

508 AI Evaluation Platform

Purpose

Quick Start

AI Providers

Project Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages