How to split data into training/validation/testing

Nafeeza86 · December 30, 2019, 3:37am

Hi,

Does anyone know how to partition the dataset into 3 sets: training, validation and testing in Knime?.
In many of the Knime tutorials, I see that they partition the dataset into training and testing, but I still cannot figure how to split it into 3.

I tried dividing the data into 3 sets by using two partitioning nodes in succession but it didn’t work.

Any help would be appreciated. Thanks.

mlauber71 · December 30, 2019, 11:10am

This should be quite easy. Please check out this example

Nafeeza86 · December 30, 2019, 4:55pm

Thanks for your response.

My question is how do you split the data into training, testing and validation before you feed it into the neural network. Can you provide me an example.

AlexanderFillbrunn · December 30, 2019, 5:01pm

Hi @Nafeeza86,
that is what the Partitioning node in @mlauber71’s example does.
Kind regards
Alexander

deicide_bg · January 2, 2020, 12:22pm

As described in the picture, you divide your dataset into three sets and label them accordingly in several steps.
Now, you may want to use one dataset only for train+test, then attach new, fresh data as validation set. You would have two sources which would need to go through the same processing and differ by only applying a model (cooked algorithm) to the last two sets - test and validation.

Nafeeza86 · January 2, 2020, 4:47pm

Hi,

Thanks for your explanation. I am clear now on how the data is partitioned and labeled. My question is how would you connect each of the partitioning nodes to the neural network learner and the predictor nodes. If you look at the example below, the data is first partitioned into training and test set, where the training set is fed into the learner node and the test set into the predictor. How can I accommodate the workflow from this forum into the one below (see attached). Can you give me a hint of how to connect the nodes. Thanks.

Kind regards,
Bibi

mlauber71 · January 4, 2020, 3:56pm

That would look something like this:

My suggestion would be to take a deep look at some learning resources for KNINME like

Or you might want to use the (free) Udemy KNIME bootcamp

system · July 5, 2020, 3:56am

This topic was automatically closed 182 days after the last reply. New replies are no longer allowed.