Timestamper is a Python tool for working with audio transcription and subtitle files. It enables you to generate timestamped templates for text, transcribe audio into subtitles with word-level timestamps, and convert .srt subtitle files into .docx documents.
- Add Timestamps to Text Files: Automatically generate
.srt-style timestamp templates for.txtfiles. - Audio Transcription: Transcribe audio files (
.mp3,.wav,.m4a,.aac,.webm) into.srtsubtitle files using thefaster-whisperlibrary. - Word-Level Timestamps: Generate subtitles with precise word-level timestamps.
- SRT to DOCX Conversion: Convert
.srtsubtitle files into.docxdocuments with timestamps and text. - Custom Whisper Model Selection: Choose from various Whisper model sizes (
tiny,base,small,medium,large) for transcription. - Multi-Language Support: Specify the language for audio transcription.
- Clone the repository:
git clone https://github.com/leopalladium/timestamper.git cd timestamper - Install dependencies:
pip install -r requirements.txt
- Run
main.py. - Select the root directory containing your audio files.
- Choose the Whisper model size.
- Enter the language code (e.g.,
en,ru). - Select which audio files to process or type
allto process all found files. - The script generates
.srtfiles with word-level timestamps in the same directory as the audio files.
- Place your
.txtfile in the script directory. - Use the
add_timestamps_to_sentencesfunction to generate a timestamped template.
- Use the
convert_srt_to_docxfunction to convert an.srtfile to a.docxdocument.
- Add speaker diarization
- Optimize and debug code
- Develop as a subtitle editing tool
- API for server deployment
- Add audio track from video text recognition
- Make executable more lightweight
- Add signature for executable
This project is licensed under the MIT License. See the LICENSE file for details.