I have noticed a slow down of the Strings to Document node after manipulation of the text to include in the document. In the example workflow attached, the node executes in 2 mins over a PDF file of 1600 pages when no manipulation is performed, but in 18 mins when the regex filter is applied. Without getting into the practical value of the task, I would like to understand the reason for this dramatic slow down that, from a user’s perspective, is totally unexpected - if anything, the node should actually run faster as the regex node removes some terms. I have noticed the same issue when a string manipulation is performed after the Document Data Extractor node and before the conversion back to a document.
test_time_difference.knwf (33.7 KB)