...
- Auto-save incoming paper correspondence to a case, by receiving the pages from the scanning machine
- Enable all scanned documents to be searched using keywords that the document contains
...
The OCR functionality is enabled by default, using the tesseract OCR engine (maintained by google).
...
You can disable the OCR functionality as follows:
curl -H 'Content-Type:application/json+nxrequest' -X POST -d '{"params":{"enabled":false},"context":{}}' -u
Administrator:Administrator http://yournuxeoserver.com:8080/nuxeo/site/automation/SetOcrEnabledOperation
You can re-enable the OCR functionality by calling this REST command:
curl -H 'Content-Type:application/json+nxrequest' -X POST -d '{"params":{"enabled":true},"context":{}}' -u
Administrator:Administrator http://yournuxeoserver.com:8080/nuxeo/site/automation/SetOcrEnabledOperation
You can check whether OCR is currently enabled by calling this REST command:
curl -H 'Content-Type:application/json+nxrequest' -X POST -d '{"params":{},"context":{}}' -u
...
Tesseract 3 is included in the nuxeo container and will be used for OCRing by default. This is only recommended for low document numbers and reasonably low image-pdf document throughput.
Anchor3 3
Commercial OCR Applications
3 | |
3 |
...