Random Loop Ideas

KnimeUser3003 · April 3, 2023, 2:55pm

Hi All,

This is not an issue but rather some guidance requested. I have tried a few things but this must be too advanced for my current understanding.

Say we have a flat file with a 100 columns and millions of rows. How does one create a workflow that, with use of a random seed, selects 1 value from each column, iteratively, for a stated amount of iterations. The number of rows in the final output will be equal to the stated amount of iterations. Each one of them labelled with an iteration number.
And for each iteration, the random seed used must be populated in its own column. So there will be 102 columns (100 columns + iteration number + random seed number) with x rows equal to the amount of iterations.

Thanks.

gonhaddock · April 3, 2023, 3:51pm

Hello @KnimeUser3003 and welcome to the KNIME community

I’m not sure if I captured the full scope of the description. There are probably a few ways to achieve this.

As for starting test, have a look to this workflow:

20230403_random_loop.knwf (29.4 KB)

BR

Daniel_Weikert · April 3, 2023, 3:51pm

I think with partioning node within a loop you could select with random seed. However if I understand correctly you also want to have a random row for each column so you would probably need to wrap this in a column list loop start too
Other option might be code snippets
br

gonhaddock · April 3, 2023, 3:59pm

@Daniel_Weikert
I have a doubt in this point. If you use a seed for each column in loop, and you want to save the output seeds; then you need a seed matrix with a value for each cell.

However the description request one seed per row.

To achieve a seed per column as well, the idea would be a nested Column List Loop Start and a loop end with two ports. (values, seeds).

BR

Daniel_Weikert · April 4, 2023, 4:22pm

Good catch @gonhaddock

KnimeUser3003 · May 17, 2023, 9:43am

Hi @gonhaddock, I think this is exactly what I needed, thank you!

Kind Regards,
KnimeUser3003

system · May 24, 2023, 9:43am

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.