feat: Full Chatterbox TTS Integration (Backend & UI)#369
feat: Full Chatterbox TTS Integration (Backend & UI)#369jarodise wants to merge 4 commits intoBlaizzy:mainfrom
Conversation
- server: Add exaggeration and cfg_weight params - server: Handle base64 ref_audio decoding - ui: Add conditional controls for Chatterbox (Ref Audio, Language, Sliders) - ui: Fix download button and audio blob handling - ui: Set correct lang_code for Chatterbox
- ui: Show reference filename in audio player instead of default voice name - ui: Fix download button to correctly save blob as MP3 - ui: Remove confusing 'English-detected' static dropdown
|
Hey @jarodise, thanks for the awesome contribution!
Let me know what you think! If you agree we can revert other changes and keep server changes for this PR and make couple small PRs addressing the above. |
|
Sure, sounds good! I don't have much experience in coding and this might be my very first code contribution to an open source project. :) If I understood correctly, I should revert change for UI and documentation from my side? |
|
Amazing, that makes it even more special! I'm here to help you with all your contributions If it complex you can also open issues detailing the problem or vision and the community and I routinely pick it up |
Yes :) |
|
I think implementing these changes for chatterbox is a really good idea. |
|
Hey @jarodise Qwen3-TTS is out and it does exactly what I recommended. Please add a supported language property or method and use that to retrieve languages fro each multilingual model. It's more generalizable. Check: https://github.com/Blaizzy/mlx-audio/blob/main/mlx_audio/tts/models/qwen3_tts/qwen3_tts.py#L194-L196 |
This PR integrates full support for the Chatterbox TTS model, including backend enhancements and a complete UI overhaul for voice cloning features.
Changes
1. Server API
2. Web UI
3. Documentation
MLX_AUDIO_GUIDE.mdwith a comprehensive 'Web UI Usage' section.This integration allows users to fully utilize Chatterbox's voice cloning and multilingual capabilities directly from the web interface.