Build software better, together

Phoenix-dev11 / ADHD_recognition

Star

ADHD_Recognition with personal voice

python flask numpy scikit-learn pandas seaborn matplotlib opensmile librosa html-css-javascript soundfile

Updated Jun 3, 2025
Python

yeyupiaoling / YeAudio

Star

Python的音频工具

audio ffmpeg asr soundfile

Updated Dec 5, 2025
Python

qasax / whisperLive-SystemAudio

Star

soundcard whisper pywebview soundfile whisperlive

Updated Oct 30, 2024
Python

bocaletto-luca / MorseWaveTranslator

Star

The Morse Wave Translator software is an application designed to record audio signals and subsequently decode them into Morse code. The application leverages several Python libraries for this task, including PyQt5 for the creation of the graphical interface, sounddevice for audio recording, soundfile for reading and writing audio files, pygame ...

python open-source pyqt5 morse-code pygame digital-communication audio-processing soundfile sound-device signal-decoding

Updated Jul 6, 2025
Python

PRITHIVSAKTHIUR / Qwen3-TTS-Daggr-UI

Star

Demonstration for the Qwen/Qwen3-TTS-12Hz models using Daggr for modular UI nodes. Supports voice design (prompt-to-speech), voice cloning (zero-shot), and custom voice synthesis with multiple speakers and languages. Features lazy model loading to optimize memory, multi-model sizes (0.6B and 1.7B), ASR and support for various audio inputs.

python text-to-speech numpy torch pytorch speech-synthesis voice-control gradio librosa audio-processing asr torchaudio voice-cloning huggingface-transformers speech-to-speech daggr soundfile qwen-tts

Updated Feb 12, 2026
Python

meryemozlem / yapay_zeka_ses_tanima

Star

ses tanıma, ses analiz ve ses tespit projesidir. Projemizde giriş ifadesi olan sesin okunması, özniteliklerinin çıkarılması ve uygun dijital formata dönüştürülmesi adımları sırasıyla gerçekleştirilmiştir. Verilen ifadelerin yetersiz olması durumunda ses verisinin çoğaltılması ve kaliteli veriyi elde etme işlemleri uygulanmıştır.

speech speech-recognition soundfile ses-tespit konusma-analizi

Updated Apr 18, 2023
Jupyter Notebook

RustamovAkrom / Voices-Assistant-For-Windows

Sponsor

Star

✅ completed | Voices assistant for windows managing system applications

numpy torch gtts speechrecognition uv pyttsx3 pyyaml playsound sounddevice langdetect vosk soundfile rapidfuzz

Updated Oct 31, 2025
Python

cromega08 / music_player

Star

Music player "based terminal/no GUI" scripted in python. Allow you to play and download music from youtube.

python music-player os python-script python3 requests shutil pydub requests-module pytube sounddevice shutil-python s2t soundfile

Updated Mar 16, 2022
Python

talin190 / Qwen3-TTS-Daggr-UI

Star

🎤 Create dynamic voice experiences with Qwen3-TTS-Daggr-UI, a Gradio app for voice design, cloning, and speech recognition across multiple languages.

python text-to-speech torch speech-synthesis voice-control gradio librosa audio-processing asr torchaudio voice-cloning huggingface-transformers speech-to-speech daggr soundfile qwen-tts

Updated Mar 9, 2026
Python

LohiyaH / Voice-Assistant

Star

Voice Assistant powered by LangChain and OpenAI Whisper. Features real-time speech recognition, multi-LLM support (OpenAI, Google), and computer vision capabilities via OpenCV. Enables natural voice interactions with advanced AI responses.

pyaudio speech-recognition opencv-python python-dotenv soundfile langchain langchain-community

Updated Jan 8, 2025
Python

lhoestq / pandas-audio-methods

Star

Audio methods for pandas dataframes using soundfile

audio python extension pandas dataset parquet dask dataframe audio-processing huggingface soundfile

Updated Jan 8, 2025
Python

grumpystrongman / pythonmeditationmusic

Star

Python script that samples wav files you supply and then generates a meditation audio file.

python music jupyter numpy jupyter-notebook sampling meditation frequ soundfile

Updated Jun 19, 2023
Jupyter Notebook

oeilsimple / Streamliner-AI

Star

Streamliner-AI is a fully automated, asynchronous Python pipeline designed to monitor Kick streamers, detect viral high-energy moments, generate vertical clips optimized for social media, and publish them to TikTok without manual intervention.

python docker streamlink ffmpeg pytorch pytest click asyncio ruff pyyaml github-actions python-dotenv tiktok-api httpx loguru soundfile kick-api faster-whisper

Updated Feb 6, 2026
Python

edsoftitaly / SongStructurer

Star

Editor audio per segmenti musicali

audio python music editor open-source gui waveform pygame tkinter matplotlib librosa csv-export beat-detection soundfile segmentazione

Updated Sep 29, 2025
Python

Sanjayh1 / Multimodal-GenAI-Bakery-Creatives

Star

This case study uses Multimodal Generative AI (text, image, audio, video) to create a complete, professional digital marketing campaign for the small bakery, demonstrating a cost-effective content creation process.

soundfile audiofile diffusers stable-diffusion-xl videofileclip google-genai veo3 parler-tts

Updated Oct 13, 2025
Jupyter Notebook

KinMaynard / soundscope

Star

Audio Imager & Editor Python Package

audio visualization python dsp sound matplotlib imaging soundfile

Updated Feb 9, 2022
Python

sabin74 / audio_emotion_recognition

Star

A Python-based application for real-time emotion detection from audio recordings, built with Streamlit and machine learning libraries.

machine-learning scikit-learn xgboost classification librosa speech-emotion-recognition streamlit soundfile streamlit-audio-recorder

Updated Jul 30, 2025
Jupyter Notebook

Sushitrashhhh / AudioCNN

Star

A web-based tool for analyzing how Convolutional Neural Networks classify sound. Upload audio files to interactively explore layer activations and see how the model decodes complex waveforms into distinct categories.

react typescript deep-learning neural-network modal audio-visualizer python3 pytorch audio-classification librosa cnn-classification tailwindcss torchaudio fastapi soundfile radix-ui lucide-react nextjs15

Updated Feb 8, 2026
TypeScript

SaiSushma2004 / Multi-Modal-AI-Content-Autenticity-Detection-System

Star

A multi-modal AI system that detects whether images, audio, or text are real or AI-generated. Built using CNNs, NLP, and audio feature extraction with a unified Streamlit interface for real-time authenticity verification.

python nlp machine-learning tensorflow numpy scikit-learn pillow pandas nltk matplotlib deeplearning librosa picklers opencv-python computervision joblib streamlit soundfile generative-ai

Updated Feb 3, 2026
Python

Rohan-Agrawal029 / LyreBird

Star

An application that aims to produce the transcripts of a meeting. Apart from the transcripts, it incorporates an in-built Voice Classification Software, capable of identifying and distinguishing between each participant, thus personalizing the transcripts with respect to each participant

python machine-learning pyaudio tensorflow aws-s3 numpy keras jupyter-notebook pandas librosa audio-processing pydub soundfile

Updated Feb 21, 2023
Jupyter Notebook

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

soundfile

Here are 26 public repositories matching this topic...

Phoenix-dev11 / ADHD_recognition

yeyupiaoling / YeAudio

qasax / whisperLive-SystemAudio

bocaletto-luca / MorseWaveTranslator

PRITHIVSAKTHIUR / Qwen3-TTS-Daggr-UI

meryemozlem / yapay_zeka_ses_tanima

RustamovAkrom / Voices-Assistant-For-Windows

cromega08 / music_player

talin190 / Qwen3-TTS-Daggr-UI

LohiyaH / Voice-Assistant

lhoestq / pandas-audio-methods

grumpystrongman / pythonmeditationmusic

oeilsimple / Streamliner-AI

edsoftitaly / SongStructurer

Sanjayh1 / Multimodal-GenAI-Bakery-Creatives

KinMaynard / soundscope

sabin74 / audio_emotion_recognition

Sushitrashhhh / AudioCNN

SaiSushma2004 / Multi-Modal-AI-Content-Autenticity-Detection-System

Rohan-Agrawal029 / LyreBird

Improve this page

Add this topic to your repo