Count number of total row values for specific criteria

BG_SST · April 19, 2022, 11:50pm

Hello,

I have a larger dataset consisting of fields such as Column 1 being name (1, 2, 3,etc.), and Column 2 being an ID, and I wanted to perform a function that would count up the number of ID occurrences per name.

Ideally, the result would be the unique name and the total count of each ID per name. Example:
Name 1 = 6
Name 2 = 3
Name 3 = 8

I’ve tried variable loops and Rank/String Manipulation but was not able to obtain the result above.

Any help would be great!

data_test

eamendola · April 20, 2022, 12:51am

Hi @BG_SST, welcome to the Knime forum.

What you are looking for is counting the occurrences by grouping the column1, so a GroupBy node is required.

After you create/read your input data, add the GroupBy node and on the right side, include the column you want to group, this would create one row per value in the colum1:

Then, in the next tab of the node, which is Manual Aggregation you should add the column you want to count, being column2 in this example, and the Count function which will count the no. of rows per each value on the column1:

This should produce the following output

iCFO · April 20, 2022, 1:12am

For more complex conditional sums, you can also start with if statement in a Column Expressions Node and have a positive result =1 in a new column. Then use the GroupBy Node to Sum them. This is also an easy way to do conditional running totals.

BG_SST · April 20, 2022, 4:19pm

@eamendola

Thank you so much for the detailed explanation! This worked out perfectly. Classic case of overcomplicating on my part.

Regards,

Daniel_Weikert · April 20, 2022, 5:03pm

If you have a lot of columns I would have a look at unpivot or transpose node
br

system · April 27, 2022, 5:04pm

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.