Voice Transcription Proxy

OpenAI-compatible transcription service powered by LFM2.5-Audio-1.5B (Liquid AI).

Uses the official liquid-audio Python package with PyTorch for native GPU inference — no llama.cpp needed.

Endpoints

Endpoint	Protocol	Description
`POST /v1/audio/transcriptions`	HTTP	Whisper-compatible file upload
`POST /v1/audio/transcriptions` + `stream=true`	HTTP SSE	Streaming transcription
`ws:///v1/realtime?intent=transcription`	WebSocket	OpenAI Realtime API with server VAD
`GET /health`	HTTP	Health + GPU status

Run locally

uv sync
uv run python transcription_proxy.py --port 8091

Docker

docker build -t voice .
docker run --gpus all -p 8091:8091 voice

# With persistent model cache
docker run --gpus all -p 8091:8091 -v voice-cache:/cache voice

Kubernetes

containers:
  - name: voice
    image: ghcr.io/anthaathi/voice-transcription-proxy:latest
    ports:
      - containerPort: 8091
    resources:
      limits:
        nvidia.com/gpu: 1
    volumeMounts:
      - name: cache
        mountPath: /cache

Architecture

┌─────────────────────────────────────────┐
│ Single process (FastAPI + liquid-audio)  │
│                                          │
│  Model loaded on GPU at startup          │
│  ├─ POST /v1/audio/transcriptions        │
│  ├─ WebSocket /v1/realtime (webrtcvad)   │
│  └─ ASR via LFM2AudioModel              │
└─────────────────────────────────────────┘

Model is auto-downloaded from HuggingFace on first start. Mount /cache volume to persist across restarts.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.github/workflows		.github/workflows
k8s		k8s
.dockerignore		.dockerignore
.gitignore		.gitignore
.python-version		.python-version
Dockerfile		Dockerfile
README.md		README.md
index.html		index.html
pyproject.toml		pyproject.toml
transcription_proxy.py		transcription_proxy.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Voice Transcription Proxy

Endpoints

Run locally

Docker

Kubernetes

Architecture

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Voice Transcription Proxy

Endpoints

Run locally

Docker

Kubernetes

Architecture

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages