From 7d7cced46b5353362a6713beb9fb4bf31b24f096 Mon Sep 17 00:00:00 2001 From: Reise Reise <81468643+Reise-Reise@users.noreply.github.com> Date: Thu, 3 Mar 2022 15:25:25 +0100 Subject: [PATCH 1/2] fix link to archived demoweek.prototypefund.de --- content/de/home/overview.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/de/home/overview.md b/content/de/home/overview.md index 73a2f38..214c742 100644 --- a/content/de/home/overview.md +++ b/content/de/home/overview.md @@ -11,7 +11,7 @@ subtitle = '' columns = "1" +++ -*Eine längere Einführung zu pd3f gibt es in einem [Blog-Post auf der Seite des Prototype Fund](https://demoweek.prototypefund.de/projects/06-ddd.html).* +*Eine längere Einführung zu pd3f gibt es in einem [Blog-Post auf der Seite des Prototype Fund](https://archive.demoweek.prototypefund.de/runde7/projects/06-ddd.html).* pd3f erkennt automatisiert Text auf gescannte PDFs mit [OCRmyPDF](https://github.com/jbarlow83/OCRmyPDF) (Tesseract) und extrahiert Tabellen mit [Camelot](https://github.com/camelot-dev/camelot) und [Tabula](https://github.com/tabulapdf/tabula). Es baut auf der Ausgabe von [Parsr](https://github.com/axa-group/Parsr) auf. From ab54642ed8babd3c0bcbe44a78bb6da192926852 Mon Sep 17 00:00:00 2001 From: Reise Reise <81468643+Reise-Reise@users.noreply.github.com> Date: Thu, 3 Mar 2022 15:25:39 +0100 Subject: [PATCH 2/2] fix link to archived demoweek.prototypefund.de --- content/en/home/overview.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/en/home/overview.md b/content/en/home/overview.md index e436244..ab30402 100644 --- a/content/en/home/overview.md +++ b/content/en/home/overview.md @@ -11,7 +11,7 @@ subtitle = '' columns = "1" +++ -*A longer introduction to pd3f in German [on the site of the Prototype Fund](https://demoweek.prototypefund.de/projects/06-ddd.html).* +*A longer introduction to pd3f in German [on the site of the Prototype Fund](https://archive.demoweek.prototypefund.de/runde7/projects/06-ddd.html).* pd3f can OCR scanned PDFs with [OCRmyPDF](https://github.com/jbarlow83/OCRmyPDF) (Tesseract) and extracts tables with [Camelot](https://github.com/camelot-dev/camelot) and [Tabula](https://github.com/tabulapdf/tabula). It's built upon the output of [Parsr](https://github.com/axa-group/Parsr).