Hey, thanks for awesome doc toolkit.
2024-11-02 17:47:58,569 - rapid_layout - INFO: pp_layout_cdla contains ['text', 'title', 'figure', 'figure_caption', 'table', 'table_caption', 'header', 'footer', 'reference', 'equation']
0%| | 0/1 [00:00<?, ?it/s]
Traceback (most recent call last):
File "/Users/jakit/simonas/open-source/RapidDoc/demo.py", line 13, in <module>
result = pdf_parser(pdf_path)
^^^^^^^^^^^^^^^^^^^^
File "/Users/jakit/simonas/open-source/RapidDoc/rapid_doc/main.py", line 74, in __call__
txt_boxes, txts = self.run_direct_extract(i, img_width)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/Users/jakit/simonas/open-source/RapidDoc/rapid_doc/main.py", line 105, in run_direct_extract
txt_boxes, txts = self.pdf_extracter.extract_page_text(page_num, img_width)
^^^^^^^^^^^^^^^
ValueError: too many values to unpack (expected 2)
Hey, thanks for awesome doc toolkit.
I tried to run
pdf_path = "tests/test_files/direct_extract/single_column.pdf"and got a following error: