This page describes the setup of an independent OCR appliance for use with the Extended DMS environment. DMS will access the OCR appliance and issue commands so that OCR is performed. The advantages are that the DMS appliance's resources (CPU, RAM) are thus not consumed by the OCR processing.
OCRKit Professional is a low cost multi language OCR utility with good output quality. Given it's multi core architecture, it has also good performance. While OCRKit is available for MacOS and Windows, the current guide only explains a setup using the MacOS version given the current Windows version does not include CLI access. Get OCRKit Professional from here: http://www.ocrkit.com.
/Applications/
("OCRKit appliance")/<storagepath>/nuxeo/data/
and /<storagepath>/nuxeo/tmp/
NFS-exports /var/lib/nuxeo/data/
and /opt/nuxeo/server/tmp/
respectively and make suer these mounts will be auto mounted (kept alive)/<storagepath>/nuxeo/scripts/
and make executable (chmod +x)/Users/<<adminuser>>/.ssh/authorized_keys
file of OCR appliance. Alternatively, you can use ssh-copy-id as per the steps outlined here: https://help.dreamhost.com/hc/en-us/articles/216499537-How-to-configure-passwordless-login-in-Mac-OS-X-and-Linux~/.ssh/
to /<storagepath>/nuxeo/ssh/.
chown this folder to root:root. Alternatively, you can "docker exec" into the nuxeo container, create the ssh keys directly inside the container and transfer them directly over to the OCR appliance using ssh-copy-id as per the above link.~/deploy/config/
)
#!/bin/bash ssh <<adminuser>>@<<OCRKit.appliance.name>> "nice -n 10 /Applications/OCRKit\ Pro.app/Contents/MacOS/OCRKit\ Pro --format text --output $1 $2" |
"elif [ ${1} = "NUXEO" ] then"
) so as to be added the container run command.:
add_volume "/<storage_path>/nuxeo/ssh" "/home/nuxeo/.ssh" add_volume "/<storage_path>/nuxeo/scripts/ocrKit" "/usr/bin/ocrKit" |
To simplify setting up the persistent NFS mounts on the mac (and indeed work around some bug that had been in the NFS stack for a while), the following scripts can be used – edited using ScriptEditor to fill in the missing parameters. Define this scripts as startup documents in your mac; it will check if an NFS volume is mounted and if not, mount it if it's available.
The below scripts are provided without prejudice, liability, warranty or support. If you do not agree, please do not use this script.
When using the script, please make sure you have the /var/lib/nuxeo/data/
and /opt/nuxeo/server/tmp/
paths created.
Note: You need to run both versions of this script, one for the /tmp
mountpoint and one for the /data
mountpoint.