Timestamper

Timestamper is a Python tool for working with audio transcription and subtitle files. It enables you to generate timestamped templates for text, transcribe audio into subtitles with word-level timestamps, and convert .srt subtitle files into .docx documents.

Features

Add Timestamps to Text Files: Automatically generate .srt-style timestamp templates for .txt files.
Audio Transcription: Transcribe audio files (.mp3, .wav, .m4a, .aac, .webm) into .srt subtitle files using the faster-whisper library.
Word-Level Timestamps: Generate subtitles with precise word-level timestamps.
SRT to DOCX Conversion: Convert .srt subtitle files into .docx documents with timestamps and text.
Custom Whisper Model Selection: Choose from various Whisper model sizes (tiny, base, small, medium, large) for transcription.
Multi-Language Support: Specify the language for audio transcription.

Installation

Clone the repository:

git clone https://github.com/leopalladium/timestamper.git
cd timestamper

Install dependencies:
```
pip install -r requirements.txt
```

Usage

Transcribing Audio Files

Run main.py.
Select the root directory containing your audio files.
Choose the Whisper model size.
Enter the language code (e.g., en, ru).
Select which audio files to process or type all to process all found files.
The script generates .srt files with word-level timestamps in the same directory as the audio files.

Adding Timestamp Templates to Text Files

Place your .txt file in the script directory.
Use the add_timestamps_to_sentences function to generate a timestamped template.

Converting SRT to DOCX

Use the convert_srt_to_docx function to convert an .srt file to a .docx document.

Roadmap

Add speaker diarization
Optimize and debug code
Develop as a subtitle editing tool
API for server deployment
Add audio track from video text recognition
Make executable more lightweight
Add signature for executable

License

This project is licensed under the MIT License. See the LICENSE file for details.

Author

Klimentsi Katsko (@leopalladium)

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.idea		.idea
LICENSE		LICENSE
README.md		README.md
TimestampTool_1.0.0.0.spec		TimestampTool_1.0.0.0.spec
icon.ico		icon.ico
main.py		main.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Timestamper

Features

Installation

Usage

Transcribing Audio Files

Adding Timestamp Templates to Text Files

Converting SRT to DOCX

Roadmap

License

Author

About

Uh oh!

Releases 2

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Timestamper

Features

Installation

Usage

Transcribing Audio Files

Adding Timestamp Templates to Text Files

Converting SRT to DOCX

Roadmap

License

Author

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Uh oh!

Contributors

Uh oh!

Languages