A table in which sold items are listed per row. Each item has an order number. If several items were purchased in one order, they are then displayed twice.
Order number 1 cell phone
Order number 1 charging cable
Order number 1 cell phone cover
Now I want to try to evaluate with the Apriori. Unfortunately, it always tells me that it cannot process strings. Even if I group the order numbers into a list
Order number 1 cell phone cover, cell phone, charging cable
Sadly not, because if i use “Group By” then it converts me all items as a string in a list. When i try to use the “Cell Splitter” it says "No column in spec compatible to “StringValue”.
I use “GroupBy” and select the order number at first and after that i say that every item should be put in a list, because there is no feature that lets me put it in a string.
Someone said “I believe that this is the problem. I think that the data should be formatted like this:” but didint show how i can get there. The biggest problem ist that my table looks like following:
I first need to group everything by the order number, because in the following picture you see, that there are multiple order numbers with different items:
This runs with the “Split Collection Column” node. I have absolutely 0 experience setting the Apriori node parameters. Maybe someone with expertise can weigh in. I added an Association Rule Learner node which works directly with lists. Its underlying basis is the Apriori algorithm.