Instructions for setting up, running, and testing the Vexa system locally using Docker Compose and Make.
-
For CPU (Tiny Model, Slower Performance - Good for local tests/development): this will use 'whisper tiny' model, which can run on CPU.
make all
This command (among other things) uses
env-example.cpudefaults for.envif not present. -
For GPU (Medium Model, Faster Performance - Requires NVIDIA GPU & Toolkit): this will use 'whisper medium' model, which is good enough to run on GPU.
make all TARGET=gpu
This uses
env-example.gpudefaults for.envif not present.
make testWhat to expect during testing:
- Test user and its token are created
- You will be asked for a meeting ID
- Provide the
xxx-xxxx-xxxfrom your running meeting (https://meet.google.com/xxx-xxxx-xxx) - Bot is sent to the meeting you provided
- Wait about 10 sec for the bot to join the meeting
- Let the bot into the conference
- Start speaking
- Wait for the transcripts to appear.
The transcription latency can is higher and quality might be lower when running locally in CPU mode, since you don't have a device to run bigger model quickly. But this is usually enough for development and testing
API docs (Swagger/OpenAPI) are available at (ports are configurable in .env):
Main API docs: http://localhost:8056/docs
Admin API docs: http://localhost:8057/docs
Managing Services:
make ps: Show container status.make logs: Tail logs (ormake logs SERVICE=<service_name>).make down: Stop all services.make clean: Stop services and remove volumes.