Cell Splitter in terms of Launch-Vehicle

Felis90 · October 10, 2024, 8:43pm

Hello collegues,

I’m trying to perform X-Y-Z analysis, using “All Space Missions from 1957” dataset downloaded from Kaggle. To do it i need to have information about Launch Vehicle, which is inside of “Detail” column. The problem is that i tried to put different separator, and the data are splitted, but not in all cases, i tried to implement several Cell Splitter Nodes, using sometimes as a separator “,” sometimes “|”, “white space | white space”, but as I found out there are a lot of different variations of characters used inside “Detail” column… Does it mean that this is too difficult task to implement all options of separators to split the data in my case? Or Knime has any life-huck to implement all options?

Best regards,
Ekaterina

MartinDDDD · October 11, 2024, 6:35am

Hey there,

for cases like this I’m a big fan of the new Expressions Node combined with K-AI Assistant:

I did a quick test with “,” and “|” as separators and with a simple prompt describing what I want it came very close… based on the formula it is then possible to either extend it to clean out the separators and spaces or you can do that in additional nodes e.g. string manipulation.

Given that you seem to be keen to explore and learn to solve these challenges I won’t built a full solution for now - let me know if you are entirely stuck and I’ll have a crack

Felis90 · October 11, 2024, 1:15pm

@MartinDDDD Martin, thanks a lot!!! This approach works!!

I used as you said Expression Node, then I put the following prompt into K-AI Assistant: “Column Detail is a string with a separators. Split the string and show the left part excluding the separator” and the Asistant provided me the following “substring($[“Detail”], 1, find($[“Detail”], “|”) - 1)” in the Expression editor. And then everything is implemented as I want.
For information for users, who touch the similar problem the pic with the solutiob is attached.

system · October 18, 2024, 1:15pm

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.