I’m trying to perform X-Y-Z analysis, using “All Space Missions from 1957” dataset downloaded from Kaggle. To do it i need to have information about Launch Vehicle, which is inside of “Detail” column. The problem is that i tried to put different separator, and the data are splitted, but not in all cases, i tried to implement several Cell Splitter Nodes, using sometimes as a separator “,” sometimes “|”, “white space | white space”, but as I found out there are a lot of different variations of characters used inside “Detail” column… Does it mean that this is too difficult task to implement all options of separators to split the data in my case? Or Knime has any life-huck to implement all options?
I did a quick test with “,” and “|” as separators and with a simple prompt describing what I want it came very close… based on the formula it is then possible to either extend it to clean out the separators and spaces or you can do that in additional nodes e.g. string manipulation.
Given that you seem to be keen to explore and learn to solve these challenges I won’t built a full solution for now - let me know if you are entirely stuck and I’ll have a crack
@MartinDDDD Martin, thanks a lot!!! This approach works!!
I used as you said Expression Node, then I put the following prompt into K-AI Assistant: “Column Detail is a string with a separators. Split the string and show the left part excluding the separator” and the Asistant provided me the following “substring($[“Detail”], 1, find($[“Detail”], “|”) - 1)” in the Expression editor. And then everything is implemented as I want.
For information for users, who touch the similar problem the pic with the solutiob is attached.