Skip to content

WebHare/tika-server

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Tika server

This package wraps the Tika server and Tessaract OCR, and embeds a quick test whether the OCR is actually extracing images from PDF

You should expose this server behind a proxy with middleware to take care of any authentication, but be careful if you put this server behind a HTTP/2 proxy. The tika server is case sensitive when processing headers such as X-Tika-OCRLanguage and X-Tika-PDFOcrStrategy but http/2 lowercases headers.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Packages

 
 
 

Contributors