K-means Cluster Analysis Homework Help

NathanV · October 6, 2023, 2:33am

This is the prompt for the homework we were given, I’m not sure how to filter the clusters or what that necessarily means when it says clusters 2, 3, 4, 5. I was thinking maybe each cluster could be by like car types, like race car, family car, luxury car etc. Not sure if I’m on the right track

This is the data file we were given to construct these k-means clusters from

BIT-445-RS-Automobiles.xlsx (22.8 KB)

Any help would be extreeeeeeeeeemely appreciated, and I’d forever be in your debt, thank you

Daniel_Weikert · October 6, 2023, 2:38pm

A K-Means cluster normally needs numerical data to calculate a distance measure used for clustering. So you probalby need to encode your data first. Then you can check different clusters e.g 2,3,4,5 and see which gives the best metric
br

NathanV · October 8, 2023, 6:20pm

how would I encode my data?

Daniel_Weikert · October 9, 2023, 4:28pm

There are various ways, one would be a one hot encoding which can be done via one to many node in KNIME. But you might also explore other options as well
br

mlauber71 · October 10, 2023, 2:36pm

@NathanV a few resources about clustering an K Means that might help you

If you know a class you can see what would be a good number of clusters:

Optimizing a “silhouette coefficient”:

system · January 8, 2024, 2:36pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.