Repeat the standard deviation calculation for 3 cells at time in a column

francescospa · June 1, 2022, 3:28pm

Hello,
I’m a new user in Knime, and I don’t know how to solve this problem.

After several nodes I was able to screening and sort my data and now I would like to calculate standard deviation for 3 cells at time

I tried to use loop nodes, but maybe I didn’t quite understand how to do it, or is it correct to use other nodes ?

Thank you so much for your support !

eamendola · June 1, 2022, 4:40pm

@francescospa Welcome to the Knime Forum.

For what I understand you want to calculate the SD of those encircled rows. It should be pretty easy with the GroupBy node if you have some other column column that helps you group those three lines.

Something like this:

Are you able to do this ?

Doru · June 1, 2022, 4:40pm

Hi francescospa,

Create a loop that starts with a Chunk Loop Start node and set the “Rows per Chunk” in the Configuration window to 3. At each iteration you will have a set of 3 rows from your input table.

Hope this helps,
Doru

eamendola · June 1, 2022, 4:46pm

What @Doru suggest is like this

I think this is a more intelligent approach since you may not have a grouping column. See that the results are the same.

francescospa · June 9, 2022, 3:41pm

Finally today I was able to try your advice @eamendola @Doru !!! Great, the loop works perfectly on my workflow. Thank you so much for your support !

francescospa · June 14, 2022, 7:43am

Hello,
I have an other question related to the same problem. Is it possible in Knime to perform this type of calculation directly?
I would like to iterate the calculation as shown in the pic.

Now, I am able to iterate three rows at a time, but how I am able to do the same for 1 row and 2 rows below at a time ?

Thank you for your time !

Doru · June 14, 2022, 2:45pm

Hi @francescopa,

Sorry, not sure I understand the question.

I assume that, for the rows are R0, R1, R2, R3, you want to calculate st.dev for 0,1,2 and after that for wrows 1,2,3 and so on ?

If true, than your final output is a union /join of:

what you already did - see rows 0, 3, 6, 9 in your picture - with
a similar loop that is missing the first row from the original table. This will create the st.dev for rows 1,2,3 from the original/full table, as row0 is not available anymore, and finally
another similar loop that is missing the first 2 rows from the original table. This will create the st.dev for rows 2,3,4 from your original/full table.

You join the 3 tables and get the desired result.
Check the result for the last 2 calculations as they will have st.dev calculated for 1 and 2 records that you may not what.

please let me know if my assumption is incorrect.

If my explanation is not clear enough, let me know and I’ll try to make a pic for it.

francescospa · June 15, 2022, 7:29am

Thank you ! The assumption is correct and I think it could work in my workspace. The pic is an example, I do this after a “sort” and based on the input data the first row can be totally different every time, even in terms of rowID. Therefore, I thought that I can automate the deletion of the first or second row by creating a new “number sequence” column after “sort”. I could use a loop with “Counter Generator” to assign a sequence of numbers and then through “Row filter” delete row 0 and then row 0 and 1. And then I follow your steps

I hope I have explained. Let me know if there are easier ways

Thanks again!

Doru · June 15, 2022, 1:42pm

Not sure if simpler but this is the way I would try first:

All loops are based on the same table that has the first row removed in sequence.
The current processing loop can include some cleanup of the last records if you feel that is required.

francescospa · June 16, 2022, 8:00am

Thank you ! The workflow works very well

Doru · June 16, 2022, 2:21pm

Happy to hear that. Good luck with your project!

system · September 14, 2022, 2:21pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.