When do I use stratified sample with the partitioning node?

LilianaGarcia · July 27, 2020, 4:58pm

Hi!

Reviewing the knime example, I saw that in the “example for learning a decision tree”, it is used the option stratified sampling but I don’t know why is that?

If somebody know when is recommend to use this option, it would be really great!

elsamuel · July 27, 2020, 5:14pm

This is a statistics question, not a KNIME-specific question.

You’d use stratified sampling if your original dataset can be divided into subpopulations and you want each subpopulation to be appropriately represented in your final partitioned dataset.

LilianaGarcia · July 27, 2020, 9:09pm

Thank you so much for your help.

ipazin · July 28, 2020, 11:46am

Hello!

True but considering this is related to KNIME workflow example I find this question ok. Also we solved and addressed bunch of DB and other non-KNIME related problems and this is far more related to KNIME purpose IMHO

Br,
Ivan

system · January 26, 2021, 11:46pm

This topic was automatically closed 182 days after the last reply. New replies are no longer allowed.