OCR engines

An OCR Engine is used in the Digitization component, to identify text in a file, when native content is not available.

note

The images that need to be processed should have a resolution range of:

Here is a selection of OCR Engines that you can choose from, according to your needs, throughout the Document Understanding^TM Framework.

OCR Engine	Activity Pack	Debug Logs Format in Logs Folder	Reports Confidence
UiPath Extended Languages OCR	`UiPath.OCR.Activities`	`${date:format=yyyy-MM-dd}`	✅
UiPath Document OCR	`UiPath.OCR.Activities`	`${date:format=yyyy-MM-dd}`	✅
OCR for Chinese, Japanese and Korean	`UiPath.Core.Activities.CjkOCR`	`${date:format=yyyy-MM-dd}`	✅
OmniPage OCR	`UiPath.OmniPage.Activities`	`${date:format=yyyy-MM-dd}`	✅
Google Cloud Vision OCR	`UiPath.UIAutomation.Activities`	`${date:format=yyyy-MM-dd}`	❌ if DetectionMode is set to TextDetection (default) ✅ if DetectionMode is set to DocumentTextDetection
Microsoft Azure Computer Vision OCR	`UiPath.UIAutomation.Activities`	`${date:format=yyyy-MM-dd}`	❌ if UseReadAPI is not selected (default) ✅ if UseReadAPI is selected
Microsoft OCR	`UiPath.UIAutomation.Activities`	`${date:format=yyyy-MM-dd}`	❌
Tesseract OCR	`UiPath.UIAutomation.Activities`	`${date:format=yyyy-MM-dd}`	✅

note

When debugging errors, you can always visit the logs folder and check the relevant OCR log files. Read more about logging here.