A modern web application that enables real-time voice conversations using OpenAI's Realtime API for speech-to-text and text-to-speech capabilities.
- Real-time voice communication
- Text chat functionality
- Modern and responsive UI
- WebSocket-based communication
- Low-latency audio processing
- Node.js (v14 or higher)
- OpenAI API key
- Modern web browser with microphone support
- Clone the repository
- Install dependencies:
npm install
- Create a
.envfile in the root directory and add your OpenAI API key:OPENAI_API_KEY=your_api_key_here - Start the development server:
npm run dev
- Open your browser and navigate to
http://localhost:3000 - Click the microphone button to start recording
- Speak your message
- Click the microphone button again to stop recording and send
- Alternatively, type your message in the text input and click send
- Node.js
- Express.js
- Socket.IO
- WebSocket
- OpenAI Realtime API
- Web Audio API
MIT