Skip to content

feat: add mlx support for IndexTTS2 #512

Draft
0xrushi wants to merge 1 commit intoBlaizzy:mainfrom
0xrushi:feat/indextts2
Draft

feat: add mlx support for IndexTTS2 #512
0xrushi wants to merge 1 commit intoBlaizzy:mainfrom
0xrushi:feat/indextts2

Conversation

@0xrushi
Copy link
Copy Markdown

@0xrushi 0xrushi commented Feb 19, 2026

Context

Description

Implements a full MLX-native IndexTTS2 pipeline (w2v-bert -> semantic codec -> UnifiedVoice -> s2mel -> BigVGAN), adds Qwen-based emotion-from-text support

Additional information

NA

Checklist

  • Tests added/updated
  • Documentation updated
  • Issue referenced (e.g., "Closes #...")

- Introduced `indextts2_inference.py` for quality-focused TTS inference with command-line arguments.
- Added `__init__.py` and `emotion.py` to support emotion processing in the IndexTTS2 model.
- Created model configuration and conversion scripts for various components including BigVGAN, CAMPPlus, and semantic codecs.
- Implemented core model logic in `indextts2.py`, integrating emotion handling and audio processing.
- Added support for W2V-BERT and UnifiedVoice models, enhancing the overall TTS pipeline.
@0xrushi 0xrushi marked this pull request as draft February 19, 2026 12:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant