Switch speak/dub default voices to the requested language's native voice by alexkroman · Pull Request #136 · AssemblyAI/cli

alexkroman · 2026-06-12T23:07:47Z

Each streaming-TTS voice speaks exactly one language, but assembly speak
and assembly dub always defaulted to the English voices (jane / the
English rotation) regardless of the requested language — a German dub came
out in jane's voice.

A new aai_cli/tts/voices.py maps every voice to its language (giovanni=it,
lola=es, juergen=de, rafael=pt, estelle=fr, the rest en). With no explicit
--voice, both commands now rotate through the requested language's native
voices — most languages ship exactly one, so the language alone selects the
voice. English keeps the curated multi-speaker rotation, a language without
a catalog voice falls back to it, and an explicit --voice (bare or
SPEAKER=VOICE) still always wins.

https://claude.ai/code/session_01PPkdXahnabDwMBwCWcSEGA

Each streaming-TTS voice speaks exactly one language, but `assembly speak` and `assembly dub` always defaulted to the English voices (jane / the English rotation) regardless of the requested language — a German dub came out in jane's voice. A new aai_cli/tts/voices.py maps every voice to its language (giovanni=it, lola=es, juergen=de, rafael=pt, estelle=fr, the rest en). With no explicit --voice, both commands now rotate through the requested language's native voices — most languages ship exactly one, so the language alone selects the voice. English keeps the curated multi-speaker rotation, a language without a catalog voice falls back to it, and an explicit --voice (bare or SPEAKER=VOICE) still always wins. https://claude.ai/code/session_01PPkdXahnabDwMBwCWcSEGA

…mediafile refactor Reconciles the shared-scaffolding refactor with three upstream changes: the language-native voice rotation (#136), the dub --video flag and the new caption command (#139), and the exec-module splits (#138). - run_dub keeps upstream's YouTube-download branch and _dub_and_emit split, with this branch's early validation (--voice parse, URL echo, out-path checks) threaded through; the parsed --voice pair rides in a frozen _VoicePlan. - caption_exec now uses the shared mediafile helpers too (it had copied the same scaffolding), which also gives caption the upfront out-path validation, the samefile self-overwrite guard, the transcript status check, and the './-' ffmpeg path hardening. - mediafile grows the caption-shaped pieces: validate_out (hoisted from dub_exec), a general resolve_transcript (diarized variant delegates), a kind= parameter for validate_local_media, and a suggestion override for ffmpeg_failure. - test_dub_pipeline's YouTube-source tests move to test_dub_sources.py to stay under the 500-line file gate. https://claude.ai/code/session_018TuAQTvp9PVy5EdhsDWo2h

alexkroman added this pull request to the merge queue Jun 12, 2026

Merged via the queue into main with commit 8e16afb Jun 12, 2026
15 checks passed

alexkroman deleted the claude/magical-maxwell-5qocqa branch June 12, 2026 23:17

alexkroman mentioned this pull request Jun 13, 2026

Deduplicate clip/dub media scaffolding into shared mediafile module #137

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch speak/dub default voices to the requested language's native voice#136

Switch speak/dub default voices to the requested language's native voice#136
alexkroman merged 1 commit into
mainfrom
claude/magical-maxwell-5qocqa

alexkroman commented Jun 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

alexkroman commented Jun 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants