Retrieve data that has been repeated three times or more in specific column

Hi guys,

Let’s say I have a table with >1000 rows, and there is a specific column X where multiple repeated data points can be present.

What I need is that if there is any data in column X that has been repeated three times or more, I would like to obtain the data for the entire row in a new table.

Could you help me with any suggested workflows or nodes? or even a Python script that I can integrate into KNIME?

Hi @ihisawi

Which “entire row” are you referring to? Is that the first, the third, all of them, etc.?

It makes quite a difference for a potential solution. A workable example with (anonymized) input and expected output would help a lot :wink:

3 Likes

Assuming you want to save all rows that have => 3 identical values in the selected column, try this:

You can filter the columns in the join that aren’t from the original table. I left them in this example for QA.

3 Likes

This is the perfect solution for what I wanted.
Thank you so much!

You"re welcome. Glad I could help.

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.