OCR Agent

To get the most out of images and PDFs within Aiimi Insight Engine an OCR agent converts them to text and image-over-text pdf files. This means you can:

  • Search the contents of a PDF or image file.

  • See hit highlighting on a PDF or image file.

  • Easily find entities and metadata for PDF or image files within preview.

View our guide to configuring an OCR.

OCR Engines

The OCR agent ships with a built in engine called IronOCR. It can also run a series of other OCR engines like ABBY Reader.

The enrichment pipeline invokes the OCR agent through the OcrRest enrichment step.

OCR Requirements

For light OCR loads the agent can share a server with other agents such as enrichment and source agents.

If you are OCR’ing lots of content then you will want to run this agent on a dedicated server. The initial load to Aiimi Insight Engine may require increases computing power but can be reduced once you are processing deltas.

Last updated