From 8a4f9b58ef669445952901d8185bca9ca47b8d05 Mon Sep 17 00:00:00 2001 From: Ralph Soika Date: Sun, 21 Apr 2019 15:43:06 +0200 Subject: [PATCH] implementation issue #35 --- {ocr => tika}/Dockerfile | 0 {ocr => tika}/README.md | 28 +++++++++++++++------ {ocr => tika}/install-tika.sh | 0 {ocr => tika}/test/IMG_20190421_132434.jpg | Bin {ocr => tika}/test/IMG_20190421_133732.jpg | Bin {ocr => tika}/test/imixs-workflow.pdf | Bin {ocr => tika}/test/zugferd_invoice.pdf | Bin 7 files changed, 21 insertions(+), 7 deletions(-) rename {ocr => tika}/Dockerfile (100%) rename {ocr => tika}/README.md (60%) rename {ocr => tika}/install-tika.sh (100%) rename {ocr => tika}/test/IMG_20190421_132434.jpg (100%) rename {ocr => tika}/test/IMG_20190421_133732.jpg (100%) rename {ocr => tika}/test/imixs-workflow.pdf (100%) rename {ocr => tika}/test/zugferd_invoice.pdf (100%) diff --git a/ocr/Dockerfile b/tika/Dockerfile similarity index 100% rename from ocr/Dockerfile rename to tika/Dockerfile diff --git a/ocr/README.md b/tika/README.md similarity index 60% rename from ocr/README.md rename to tika/README.md index 371fbe0..75e9ed4 100644 --- a/ocr/README.md +++ b/tika/README.md @@ -1,12 +1,12 @@ -# imixs/ocr +# imixs/tika -The Docker Image 'imixs/ocr' provides a Tika / Tesseract installation +The Docker Image 'imixs/tika' provides a Tika Server. This server can be used for OCR via a Rest API provided by the [Apache Tika Project](https://tika.apache.org/) ## Features * inherit form official openJDK * runs tika and tesseract with OpenJDK 10 -* suported languages: de, en +* supported languages: de, en ## The Rest API @@ -30,13 +30,21 @@ Examples: -# 2. Running and stopping a container +# Running and stopping a container You can start an instance of the postgres service with the Docker run command: - docker run --name="ocr" \ + docker run --name="tika" \ -p 9998:9998 \ - imixs/ocr + imixs/tika + +## Docker Swarm + +The imixs/tiker image can perfectly be used in a docker swarm environment. So you have a single service providing OCR functionallity via a Rest API. + +## Imixs-Archive + +The [Imixs-Archive Project](https://github.com/imixs/imixs-archive/tree/master/imixs-archive-documents) provides a Imixs-Workflow plugin to be used for OCR. # Contribute @@ -49,8 +57,14 @@ Checkout the sources from GitHub: To build the image from the Dockerfile run: - docker build --tag=imixs/ocr ./ocr + $ docker build --tag=imixs/tika ./tika + +## Push manually to Docker repo (Docker-Hub) + +To push the image to a docker repo: + $ docker build -t imixs/tika:X.X.X . + $ docker push imixs/tika:X.X.X diff --git a/ocr/install-tika.sh b/tika/install-tika.sh similarity index 100% rename from ocr/install-tika.sh rename to tika/install-tika.sh diff --git a/ocr/test/IMG_20190421_132434.jpg b/tika/test/IMG_20190421_132434.jpg similarity index 100% rename from ocr/test/IMG_20190421_132434.jpg rename to tika/test/IMG_20190421_132434.jpg diff --git a/ocr/test/IMG_20190421_133732.jpg b/tika/test/IMG_20190421_133732.jpg similarity index 100% rename from ocr/test/IMG_20190421_133732.jpg rename to tika/test/IMG_20190421_133732.jpg diff --git a/ocr/test/imixs-workflow.pdf b/tika/test/imixs-workflow.pdf similarity index 100% rename from ocr/test/imixs-workflow.pdf rename to tika/test/imixs-workflow.pdf diff --git a/ocr/test/zugferd_invoice.pdf b/tika/test/zugferd_invoice.pdf similarity index 100% rename from ocr/test/zugferd_invoice.pdf rename to tika/test/zugferd_invoice.pdf