Skip to content

Questions on OCR evaluation #5

@Soongja

Description

@Soongja

Hi, I have a few questions on OCR evaluation.

  1. When evaluating OCR performance on DIR300 dataset(or DocUNet benchmark), the size of the predicted image and GT image are different. I suppose you have resized one of the two in advance. To which size did you resize the images?(predicted size or GT size?)

  2. Which tessdata(traineddata) did you use for Tesseract?(tessdata_fast or tessdata_best or tessdata)
    reference: https://tesseract-ocr.github.io/tessdoc/Data-Files.html

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions