Split tables depending on value differences in columns

AWood · September 24, 2024, 9:01am

Dear Knime Community,

I need to create subsets of a table depending on values in a column. The table is shown below. The column of interest is the “value” one. I need subset tables of 5 raws with values increasing by at least 5. For example, a subset table would be: ID 1,10,15,14,5. Because ID 10 (198) is >5 higher than ID 1 (190), ID 15 (205) is >5 higher than ID 10 (198), etc…
Another subset would also be 2, 12, 8, 11, 3.

Would you know a way how to do that ?

Many thanks in advance !

ID	Value	Code
1	190,1588	124863
2	190,2019	548685
3	234,001	524856
4	190,6738	526458
5	225,2875	526358
6	242,0726	586744
7	199,0081	985478
8	209,245	539874
9	201,1485	542868
10	198,2239	568425
11	222,4592	987568
12	198,6527	635241
13	272,7346	635245
14	217,2901	542865
15	205,2132	658965

mwiegand · September 24, 2024, 9:44am

Hi @AWood,

I can’t quite follow you logic …

Can you rephrase it please?

Best
Mie

AWood · September 24, 2024, 10:01am

Thanks for your reply !
In other terms the subset 1 would be:
1,10,15,14,5, because 190<198<205<217<225, with a difference of at least 5 between increasing values.

Best

mwiegand · September 24, 2024, 10:11am

Thanks for the additional details. What is difficult to comprehend is the logic behind 1,10,15,14,5, which relates to the ID I suppose, and the other sub-set 2, 12, 8, 11, 3.

Could it be that you want to accomplish the following:

Take the first row
Identify the next row whos value difference to the first selected is >5
Skip / reprocess the rows which do not match in the next iteration
Continue until the end of the table
Re-Iterate over skipped rows

Best
Mike

AWood · September 24, 2024, 11:37am

Thanks for your feedback.
This relates to the ID indeed.
The points 1. and 2. you mention are right, then I need it to do it again until it has found 5 values that increase each time by at least 5. The first subset would then be ID 1–>5–>10–>14–>15. Sorry, I used the numeric order in my previous explanation. Then it would make a first subset table of these 5 raws.
Next, iterate with the remainder table (without the 5 rows of the firt extracted) and create new subsets. The second subset would ID 2–>3–>8–>11–>12
Thanks again for your interest and support !

JPollet · September 24, 2024, 4:21pm

Hello,
@AWood can you explain why you want 1->5->10 etc and not 1->3->5-> etc
ID 3 is higher than ID 1 by 5. Did you suppose that the values have to be in increasing order ?

Best,
Joel

Edit : ok, I just understand this “a difference of at least 5 between increasing values”. My question is out of the subject !!!

HansS · September 24, 2024, 6:54pm

Hi @AWood

Welcome to KNIME Forum.

See if this wf split tables depending on value differences.knwf (108.9 KB) works for you. Maybe it is a little bit complex with a nested Recursive Loop. But see for yourself step by step. The “step loop execution” option in the End Loop node can be helpfull here to understand what is is happening.

gr. Hans

mwiegand · September 24, 2024, 8:29pm

@AWood here you got a starting point.

Intriguingly, it does not come to the same conclusion as you since the next closest ID after ID 14 with a value of 217,2901 is ID 11 with a value of 222,4592 but not ID 5. Furthermore, ID 5 is getting removed / skipped.

What I haven’t fully comprehended is the result around ID 2 and 4. Mostly because it’s late and I don’t want to take all the fun out for you as well learning something new

Happy Kniming
Mike

AWood · September 25, 2024, 8:07am

Dear Community, dear Mike, dear Hank,
Thank you very much ! This was yesterday my second day on Knime, I was still looking for the best channels to learn aboutit and BOOM: two of you come with different solutions. I need some time to test it and learn the different nodes to get inspired for future wf. I’ll get back to you asap and hope to start being an expert to find my turn to help the community as you do so well !
Thank again and see you later !

mwiegand · September 25, 2024, 8:19am

You are most welcome!