ADHD_Recognition with personal voice
-
Updated
Jun 3, 2025 - Python
ADHD_Recognition with personal voice
The Morse Wave Translator software is an application designed to record audio signals and subsequently decode them into Morse code. The application leverages several Python libraries for this task, including PyQt5 for the creation of the graphical interface, sounddevice for audio recording, soundfile for reading and writing audio files, pygame ...
Demonstration for the Qwen/Qwen3-TTS-12Hz models using Daggr for modular UI nodes. Supports voice design (prompt-to-speech), voice cloning (zero-shot), and custom voice synthesis with multiple speakers and languages. Features lazy model loading to optimize memory, multi-model sizes (0.6B and 1.7B), ASR and support for various audio inputs.
ses tanıma, ses analiz ve ses tespit projesidir. Projemizde giriş ifadesi olan sesin okunması, özniteliklerinin çıkarılması ve uygun dijital formata dönüştürülmesi adımları sırasıyla gerçekleştirilmiştir. Verilen ifadelerin yetersiz olması durumunda ses verisinin çoğaltılması ve kaliteli veriyi elde etme işlemleri uygulanmıştır.
✅ completed | Voices assistant for windows managing system applications
Music player "based terminal/no GUI" scripted in python. Allow you to play and download music from youtube.
🎤 Create dynamic voice experiences with Qwen3-TTS-Daggr-UI, a Gradio app for voice design, cloning, and speech recognition across multiple languages.
Voice Assistant powered by LangChain and OpenAI Whisper. Features real-time speech recognition, multi-LLM support (OpenAI, Google), and computer vision capabilities via OpenCV. Enables natural voice interactions with advanced AI responses.
Python script that samples wav files you supply and then generates a meditation audio file.
Streamliner-AI is a fully automated, asynchronous Python pipeline designed to monitor Kick streamers, detect viral high-energy moments, generate vertical clips optimized for social media, and publish them to TikTok without manual intervention.
Editor audio per segmenti musicali
This case study uses Multimodal Generative AI (text, image, audio, video) to create a complete, professional digital marketing campaign for the small bakery, demonstrating a cost-effective content creation process.
Audio Imager & Editor Python Package
A Python-based application for real-time emotion detection from audio recordings, built with Streamlit and machine learning libraries.
A web-based tool for analyzing how Convolutional Neural Networks classify sound. Upload audio files to interactively explore layer activations and see how the model decodes complex waveforms into distinct categories.
A multi-modal AI system that detects whether images, audio, or text are real or AI-generated. Built using CNNs, NLP, and audio feature extraction with a unified Streamlit interface for real-time authenticity verification.
An application that aims to produce the transcripts of a meeting. Apart from the transcripts, it incorporates an in-built Voice Classification Software, capable of identifying and distinguishing between each participant, thus personalizing the transcripts with respect to each participant
Add a description, image, and links to the soundfile topic page so that developers can more easily learn about it.
To associate your repository with the soundfile topic, visit your repo's landing page and select "manage topics."