Tika Text Extraction

Tika Text Extraction converts native documents to plain text, this is what is indexed in Aiimi Insight Engine.

  • Endpoint - The endpoint for the Tika service.

    • There is usually an instance running on each enrichment server.

  • Timeout - How long to let a text conversion run before it is cancelled.

  • Text Content Types - Enter file extensions that are already text, and can be skipped.

  • Email Content Types - Enter the file types for emails.

    • This can be left as default in most instances. If you're not sure, please reach out to your contact at Aiimi.