DeepSeek-OCR Client

A real-time Electron-based desktop GUI for DeepSeek-OCR

Unaffiliated with DeepSeek

Features

Drag-and-drop image upload
Real-time OCR processing

Click regions to copy
Export results as ZIP with markdown images
GPU acceleration (CUDA, MPS) or CPU fallback

Requirements

Windows 10/11, other OS are experimental
Node.js 18+ (download)
Python 3.12+ (download)
NVIDIA GPU with CUDA, Apple Silicon (MPS), or CPU

Note: MPS and CPU backends use @Dogacel's modified model instead of the base model.

Quick Start (Windows)

Extract the ZIP file
Run start-client.bat
- First run will automatically install dependencies.
- Subsequent runs will start quicker.
Load Model - Click the "Load Model" button in the app, this will download or load the model.
- If this is the first run, this might take some time.
Drop an image or click the drop zone to select one.
Run OCR - Click "Run OCR" to process.

Note: if you have issues processing images but the model loads properly, please close and re-open the app and try with the default resolution for "base" and "size". This is a known issue, if you can help to fix it I would appreciate it!

Linux/macOS

Please follow Windows instructions but start with start-client.sh instead of start-client.bat.

Links

Future goals (PRs welcome!)

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
backend		backend
docs/images		docs/images
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
index.html		index.html
main.js		main.js
package-lock.json		package-lock.json
package.json		package.json
renderer.js		renderer.js
requirements.txt		requirements.txt
start-client.bat		start-client.bat
start-client.sh		start-client.sh
start.py		start.py
styles.css		styles.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DeepSeek-OCR Client

Features

Requirements

Quick Start (Windows)

Linux/macOS

Links

Future goals (PRs welcome!)

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DeepSeek-OCR Client

Features

Requirements

Quick Start (Windows)

Linux/macOS

Links

Future goals (PRs welcome!)

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages