Lag Node

AL1986 · March 1, 2017, 1:59pm

Hi,

I'm trying to create a new column which will hold the LAG -1 value but it will needs to restart every customer from the begining.

for example

Cust, Value, Lag_Value

1 , 10, ?

1 , 20 , 10

1 , 30 , 20

2 , 100, ?

2 , 200 , 100

2 , 300 , 200

how do I restart the Lag function every time the customer changes?

Thanks,

AL

Geo · March 1, 2017, 10:26pm

You should use Lag within a Group Loop construct.

Tyler · March 18, 2020, 3:18pm

Here’s how I handled dealing w/ the ‘lag’ and based on @Geo 's response, I was able to come up with this little thingy and i have no idea if it’s what “group loop construct” means but it was enough to push me a direction…

need; repeating the lag in the next column over… and restarting it, otherwise it’s always breaking the lag on new “groups”… (if there’s a lag on that iteration)

building two streams, the top stream is a group by on what I need to “lag by”…

start the loop in front of the group by, the table row to variable lets me grab these columns, per loop, and insert them into the “row filters” per column…

lag column (choose your column), Screen Shot 2020-03-18 at 10.19.27 AM
row id (helps remove the overflow),

row splitter = overflow*

MORE on rowid in 3 steps above… When there’s an overflow by the lag, knime accounts for this and makes a NEW row of data, it adds the text “overflow” to your rowid, rowid node effectively PUSHES that ‘string’ to a column and builds a nice juice’y RowId column for you again, then we filter.

Sometimes there’s NO overflow, that means your lag math cancels out, this stream takes that into account. I needed to do 3 iterations of the lag column for deaths, recovery, and confirmed coronavirus cases.

thoughts;… assuming concat() these two columns, and same with the stream, would simplify the ask, to 1 red line, one filter column… However i don’t think optimization is necessary on this stream, and i maybe over complicating by even placing this bit of information here… but if you’re like me, you want to optimize this too.

conclusion, the 3 lag

processes feel identical and could be looped with some crafty knime’ing, my first pass so keeping it simple today().

best,
t

ps. feel free to share a better way of doing this process because im a noobie.

ipazin · March 19, 2020, 8:11am

Hi there @Tyler,

Group Loop construct,I assume, means using Group Loop Start node. For example from OP this should do the trick:

GroupLag

Br,
Ivan

Tyler · March 19, 2020, 8:24pm

Awesome, I wild card search to tools lately and didn’t see this tool. Thanks @ipazin!!

Geo · April 27, 2020, 9:54am

Yes, sorry for the confusing terminology
with Group Loop construct, I’ve meant indeed: Group Loop Start … Loop End.

Tyler · April 29, 2020, 10:44pm

Don’t be sorry @Geo - your content was perfect timing for me to learn how to force loop tool to do a group loop and it later lead me to optimizing it. I appreciate the help