I have a task to do for a school project, but I am a newbie with Knime.
Main task is to prove or disprove the following claim:
Customers can be clustered based on where they shop the most.
- Decide on which columns to perform clustering to best analyze the question.
- Decide on algorithm/library to use.
- Provide graphically the clustering results to demonstrate if the claim is true.
I removed the empty fields with Row Filter. I also noteced that there is a incorrect values in the dataset like “max_distance_to_shops” column, but I can’t find a way to filter them. I am thinking about using IF construction in Row filter. Also I am unable to decide which column/s should I use for clustering the data.
I will be very thankful if someone could help me out.
There is a dropbox link for the dataset below: