OCR engines
An OCR Engine is used in the Digitization component, to identify text in a file, when native content is not available.
note
The images that need to be processed should have a resolution range of:
- min: 50 x 50 pixels
- max: 9000 x 9000 pixels
Here is a selection of OCR Engines that you can choose from, according to your needs, throughout the Document UnderstandingTM Framework.
| OCR Engine | Activity Pack | Debug Logs Format in Logs Folder | Reports Confidence |
|---|---|---|---|
| UiPath Extended Languages OCR | UiPath.OCR.Activities | ${date:format=yyyy-MM-dd} | ✅ |
| UiPath Document OCR | UiPath.OCR.Activities | ${date:format=yyyy-MM-dd} | ✅ |
| OCR for Chinese, Japanese and Korean | UiPath.Core.Activities.CjkOCR | ${date:format=yyyy-MM-dd} | ✅ |
| OmniPage OCR | UiPath.OmniPage.Activities | ${date:format=yyyy-MM-dd} | ✅ |
| Google Cloud Vision OCR | UiPath.UIAutomation.Activities | ${date:format=yyyy-MM-dd} | ❌ if DetectionMode is set to TextDetection (default) ✅ if DetectionMode is set to DocumentTextDetection |
| Microsoft Azure Computer Vision OCR | UiPath.UIAutomation.Activities | ${date:format=yyyy-MM-dd} | ❌ if UseReadAPI is not selected (default) ✅ if UseReadAPI is selected |
| Microsoft OCR | UiPath.UIAutomation.Activities | ${date:format=yyyy-MM-dd} | ❌ |
| Tesseract OCR | UiPath.UIAutomation.Activities | ${date:format=yyyy-MM-dd} | ✅ |
note
When debugging errors, you can always visit the logs folder and check the relevant OCR log files. Read more about logging here.