I have a use-case where product data is pulled from the database and for every product one chart is produced and outputted as image. Currently I pull all data and iterate with a group loop, but that is rather slow as the dataset is huge.
I was wondering if there are better ways to do this: e.g. pull one product at a time within a for loop and produce the images while the next product is being pulled from the db. I am just not sure if this can work in a streaming approach.
What kind of charts are you producing in each loop iteration?
How many unique products do you have in your dataset?
A streaming approach may help in decreasing execution time for some of the nodes in your workflow. I would check to see how many streaming-enabled nodes you have in your workflow and combine those into a component for streaming. If you’re interested, here’s more information on streaming with KNIME (Streaming data in KNIME | KNIME) and an example workflow (Streaming - Text Processing – KNIME Hub).
Based on your comments I believe that integrating streaming into your workflow would be a great way to speed it up. I would also recommend trying that in conjunction with Columnar Table Backend.