Hello,
I found the description of the chunk size parameter for the streaming execution a bit confusing when it states that “Larger chunk values yield better runtime”.
I understood that the data is streamed between nodes by chunk of rows according to this parameter.
In my case, I am using the streaming with the image processing nodes, so I achieve the best execution time by setting the chunck to 1 so that each row is directly streamed individually.
Would that not be the case with a classical table of string and digits for instance ?
Not really related to the previous question but in the job manager of individual nodes there is also now the “Test for streaming and distributed Processing” option. Any information about that ?
Thanks !