Saving Workflow takes over an hour

nxfxcom · January 25, 2023, 6:27pm

Hello,

I have a workflow (a few) that reads a few million JSONs and then transforms them to save them into different DBs. When I try to save the workflows, it takes over an hour. I assume its due to the size of the data in it. However, there is 0 disk activity, and no computer limits are hit. The HDD is a raid with 7k read and write speed. How can I solve this?

mlauber71 · January 25, 2023, 9:31pm

@nxfxcom First question would be how large these files are. Every node will have to be saved together with the data unless you tell the workflow not to save certain parts:

Then you could opt for the columnar storage (https://www.knime.com/blog/improved-performance-with-new-table-backend) though it might have less an effect on JSON files.

Then the inaction on the side of the disk might be due to CPU operations handling the data. You also might want to check the RAM settings in the knime.ini if they would be sufficient for the size of your data.

system · April 25, 2023, 9:32pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.