Partitioning node: Stratified sampling by integer

Hi Knimers

I'm following a MOOC about ML in R where they do 2 things that I don't know how to do in Knime yet.

1 - Stratified sample using an integer variable ( Partitioning node only shows strings);

2 - Set a sed as 101 (?)

Any help on this topic?

Tks

For 1: You need to convert the int into a number beforehand.

For 2: We have various nodes, in which you can define a seed? 

For 1: You need to convert the int into a number beforehand.

For 2: We have various nodes, in which you can define a seed? 

Hi Iris

For my first task, if I want to split the data considering "stratified sampling" by number column is not possible using the partitioning node. The Partitioning node only shows strings as an option for "stratified sampling".

Can you take a look, please?

For the seed doubt, I think that I get it. If I select random seed in the partitioning node, it will always reproduce the partitioning in the same format in each execution.
 

Tks for your help

Sorry, there was a typo above, I meant you need to convert the integer into a string with the Number to string node. And maybe you also need the domain calculator after the conversion. Than the partitoning node will recognize the column.

 

Best, Iris

1 Like