This page describes the setup of an independent OCR appliance for use with the Extended DMS environment. DMS will access the OCR appliance and issue commands so that OCR is performed. The advantages are that the DMS appliance's resources (CPU, RAM) are thus not consumed by the OCR processing.
OCRKit Professional is a low cost multi language OCR utility with good output quality. Given it's multi core architecture, it has also good performance. While OCRKit is available for MacOS and Windows, the current guide only explains a setup using the MacOS version given the current Windows version does not include CLI access. Get OCRKit Professional from here: http://www.ocrkit.com.
/Applications/
("OCRKit appliance")/<storagepath>/nuxeo/data/
and /<storagepath>/nuxeo/tmp/
NFS-exports /var/lib/nuxeo/data/
and /opt/nuxeo/server/tmp/
respectively and make suer these mounts will be auto mounted (kept alive)/<storagepath>/nuxeo/scripts/
and make executable (chmod +x)/<storagepath>/nuxeo/ssh/
#!/bin/bash echo "variables $1 $2" ssh admin@<<OCRKit.appliance.name>> "/Applications/OCRKit\ Pro.app/Contents/MacOS/OCRKit\ Pro --format text --output $1 $2" |
Make sure in "commands.conf" file of auto-deploy client specific repository, the following commands are added to the nuxeo container definition:
add_volume "/var/data/nuxeo/ssh" "/home/nuxeo/.ssh" add_volume "/var/data/nuxeo/scripts/ocrKit" "/usr/bin/ocrKit" |