First of all, thank you for open-sourcing such an excellent project. I have a question that I would like to ask:
When performing monologue streaming inference, the audio saved directly to a file sounds great. However, during real-time playback, there is consistently a strong electrical noise, and I haven’t been able to achieve smooth audio playback while performing inference simultaneously. This issue has troubled me for several days.
May I ask if there is any existing example or recommended approach for achieving streaming inference with smooth real-time audio playback?
Thank you very much!