Hi,

I’m sorry if this questions seems quite basic but I am new to Knime and I am a little confused on some aspects of clustering.

I have a dataset with categorical variables (10) and I would like to perform clustering on this data. I have used to k-means algorithm to try to cluster the data. I tried to hotencode all of my categorical variables using the one-to-many node. I have set the number of clusters to 5 and have a result. I guess my question is, is this the right way of approaching this problem or should I be using k-mode clustering instead of k-means?

Example of data:

Sex, Location, Language, Process Stage

Any help would be greatly appreciated!

Eric