CosyVoice2 API server built with FastAPI, featuring efficient voice caching, queue-based request handling, and multi-request concurrency management.
-
Updated
Feb 26, 2026 - Python
CosyVoice2 API server built with FastAPI, featuring efficient voice caching, queue-based request handling, and multi-request concurrency management.
Text-to-speech CLI tool that uses the Kokoro model for inference. Runs extremely fast locally with or without a GPU. Render smooth speech faster than real-time on most machines. Use Kokoro from CLI or the FastAPI webserver via HTTP requests or directly in the browser. Supports audio playback from the CLI, web interface, or download in many formats.
Create instant text to speech using external API
VanillaJS: an app that combines a Joke API, and a speech-to-text API.
A curated list of AI audio generation APIs, SDKs, and tools including text-to-speech, speech synthesis, music generation, voice cloning, sound design, and generative AI platforms. Covers commercial services, open source models with APIs, and production-ready infrastructure for developers building audio applications.
A lightweight Spring Boot application that streams real-time, human-like speech from text using Spring AI and ElevenLabs.
A lightweight Spring Boot application that uses Spring AI and ElevenLabs to transform text into realistic, human-like voice
Project: The Essential Feature Set for User Engagement. Created at https://spectra.codes, which is owned by @Drix10
Curate and access APIs, SDKs, and tools for AI audio generation, including text-to-speech, music creation, and sound design.
Add a description, image, and links to the text-to-speech-api topic page so that developers can more easily learn about it.
To associate your repository with the text-to-speech-api topic, visit your repo's landing page and select "manage topics."