Hey all, our system generates mixed data and downloads it in a file that looks like this:
Basically it is a pairwise dataset with some identical columns, some missing columns and some “new” columns. My job is to sort this pairwise data set and aggregate the same data into the appropriate columns. So the output should look like this:
I am afraid that will result in a complicated workflow with loops and splitters, but your help is highly appreciated, because I am quite new to KNIME:-)
You always have 2 rows?
You could first get rid of the original header row. You split your first 2 entries from the rest with row splitter. You use Row to column names node to make your first row as the header. For the second split you now use chunk loop with size of 2 and use row to column names to generate the names. Then you use concatenate (with intersection of columns) to add the loop iteration to the split you created first when splitting your data.
There might be better options but that’s sth I could think of to give you some guidance.
br