Hello everyone,
I have sequences of data stored in different columns. Since my network diagrams can only take sequences with max length of 2 I need to split longer sequences.
In my Input table you see my first sequence: AGFA (splitted in multiple columns) that I need to manipulate in a way that the max length is only allowed to be 2 (see first output table)
After archiving the first step some entries occure two times. e.g. F originally occurs one time in the sequence but with this manipulation it occurs two times (in GF and FB). Therefore I need a way to double all the entries, that are not doubled yet so I can divide the sum by 2 in my diagram later. (see next Step (double entries) table)
In the last step I need to remove all entries in the column “Sequence” except for one time PER Topic (Same sequences can occur multiple times in different topics). So I can just count the times of occurence per whole Sequence.
I know its a lot but I hope someone can help me with this
Thank you in advance!