This extracts links from PPTX, DOCX and XLSX files. It then stores them in the metadata field externalLinks.
The metadata field must exist as a keyword in the entities section of control hub before this step can be run.
Select Show Advanced Options
Define the maximum number of items to process concurrently in Bounded Capacity.
Define the maximum number of items that can be queued.
Limiting either of these will reduce the memory use but increase the time taken.