Medical Data Agent Backend

A FastAPI-based backend that uses an AI agent (powered by LangChain and OpenAI) to interpret natural language queries from doctors and retrieve patient data from a mock Epic FHIR service.

Features

Natural Language Understanding: Uses OpenAI's GPT-4 (via LangChain) to understand queries like "Show me the last 3 A1c results".
Mock Epic FHIR Client: Simulates an EHR system returning realistic "FHIR-Lite" JSON data for:
- Patient Demographics
- Lab Results (Observation)
- Medications (MedicationRequest)
Tool Calling: The agent intelligently calls specific Python functions (get_patient, get_labs, get_medications) based on the user's intent.
Transparent Data Sources: The API returns both the natural language summary and the raw data used to generate the answer.

Prerequisites

Python 3.9+
An OpenAI API Key

Installation

Clone the repository.
Install dependencies:
```
pip install -r requirements.txt
```
Set up your environment variables:
```
cp .env.example .env
```
Edit .env and add your OPENAI_API_KEY.

Usage

Start the server:
```
uvicorn main:app --reload
```
The server will start at http://127.0.0.1:8000.

Test the API:

Endpoint: POST /ask

Example Request:

{
  "doctor_query": "What is the patient's A1c trend?",
  "patient_id": "PT123"
}

Example Curl:

curl -X POST "http://127.0.0.1:8000/ask" \
     -H "Content-Type: application/json" \
     -d '{"doctor_query": "Show me active medications", "patient_id": "PT123"}'

Mock Data

The system is pre-loaded with mock data for testing:

Patient ID: PT123 (John Doe)
- Has A1c, Glucose, and Hemoglobin labs.
- Has Metformin and Lisinopril medications.
Patient ID: PT456 (Jane Smith)
- Basic demographics only.

Any other Patient ID will result in empty data or a "Patient not found" error.

Project Structure

main.py: FastAPI application and API endpoint definition.
agent.py: LangChain agent logic, prompt engineering, and tool execution loop.
tools.py: Definitions of the tools the AI can use (decorated with @tool).
epic_service.py: The mock client simulating Epic's FHIR API.

Local LLM Clinical Extraction Pipeline (REDCap / Epic FHIR)

Overview

This repository contains a HIPAA-compliant, zero-cloud NLP pipeline designed to extract complex, unstructured clinical text into structured integer codes for the REDCap Epilepsy database.

Due to the constraints of processing unredacted clinical notes and navigating IRB approvals, this architecture is designed to run 100% locally on lab hardware (using Ollama and Llama/Qwen architectures). No patient data is ever transmitted to OpenAI, Anthropic, or external cloud providers.

Core Features

Zero-Cloud Local Inference: Utilizes qwen2.5:14b via Ollama for deterministic, secure clinical abstraction.
Strict Schema Adherence: Leverages LangChain and Pydantic to enforce REDCap data dictionary rules, including mutually exclusive integers and multi-select arrays.
Batch Processing: Automatically processes arrays of clinical progress notes and exports a redcap_import_ready.csv formatted exactly for REDCap bulk upload.
Epic FHIR Integration: Includes backend API clients secured via RSA/JWT for querying structured patient demographics and medication data directly from Epic's FHIR endpoints.

Architecture

Input: Unstructured clinical paragraphs (currently synthetic data for development).
LLM Engine: LangChain orchestration binds the REDCap Pydantic schema to the local 14B parameter model. Temperature is set to 0 for deterministic, repeatable extraction.
Output: A strict Python Dictionary/JSON object mapped to REDCap integer codes.
Export: Pandas aggregates the processed records and generates a batch CSV.

Prerequisites

To run this pipeline locally, you will need:

Hardware: A dedicated GPU with at least 8GB of VRAM (e.g., RTX 4070) is recommended to run the 14B model effectively.
Software: Python 3.12+ and Ollama installed and running in the background.

Setup Instructions

1. Clone the repository and activate the virtual environment:

git clone <your-repo-url>
cd Epic-api
.\venv\Scripts\Activate.ps1

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.gitignore		.gitignore
Codebook for RedHat.pdf		Codebook for RedHat.pdf
README.md		README.md
agent.py		agent.py
batch_redcap_extractor.py		batch_redcap_extractor.py
desktop.ini		desktop.ini
epic_fhir_client.py		epic_fhir_client.py
evaluator.py		evaluator.py
examples.py		examples.py
main.py		main.py
ollama_test.py		ollama_test.py
redcap_extractor.py		redcap_extractor.py
requirements.txt		requirements.txt
test_real_integration.py		test_real_integration.py
tools.py		tools.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Medical Data Agent Backend

Features

Prerequisites

Installation

Usage

Mock Data

Project Structure

Local LLM Clinical Extraction Pipeline (REDCap / Epic FHIR)

Overview

Core Features

Architecture

Prerequisites

Setup Instructions

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Medical Data Agent Backend

Features

Prerequisites

Installation

Usage

Mock Data

Project Structure

Local LLM Clinical Extraction Pipeline (REDCap / Epic FHIR)

Overview

Core Features

Architecture

Prerequisites

Setup Instructions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages