feat: add mlx support for IndexTTS2 by 0xrushi · Pull Request #512 · Blaizzy/mlx-audio

0xrushi · 2026-02-19T12:58:06Z

Context

Add support for indexTTS2

Description

Implements a full MLX-native IndexTTS2 pipeline (w2v-bert -> semantic codec -> UnifiedVoice -> s2mel -> BigVGAN), adds Qwen-based emotion-from-text support

Additional information

NA

Checklist

Tests added/updated
Documentation updated
Issue referenced (e.g., "Closes #...")

- Introduced `indextts2_inference.py` for quality-focused TTS inference with command-line arguments. - Added `__init__.py` and `emotion.py` to support emotion processing in the IndexTTS2 model. - Created model configuration and conversion scripts for various components including BigVGAN, CAMPPlus, and semantic codecs. - Implemented core model logic in `indextts2.py`, integrating emotion handling and audio processing. - Added support for W2V-BERT and UnifiedVoice models, enhancing the overall TTS pipeline.

0xrushi marked this pull request as draft February 19, 2026 12:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add mlx support for IndexTTS2 #512

feat: add mlx support for IndexTTS2 #512
0xrushi wants to merge 1 commit intoBlaizzy:mainfrom
0xrushi:feat/indextts2

0xrushi commented Feb 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

0xrushi commented Feb 19, 2026

Context

Description

Additional information

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant