Removing Duplicate rows from a CSV file.

I read somewhere but now I can’t find the Duplicate Variable Node
I am looking to take my CSV file and check for doubles by rows.
This way I can verify that there are no doubles in my csv file.
But I need to check the whole row since I have approximately 10 or 12 columns.
Is there any good example workflows on how to set this up.
Thanks for the help,
Scott

I read somewhere but now I can’t find the Duplicate Variable Node

Are you referring to the Duplicate Row Filter node?

5 Likes

I wrote this article about duplicates. Maybe this and the accompanying workflow can help.

4 Likes

Yes I am referring to the Duplicate Row Filter Node

Hi @sgilmour,

did you find him? It is part of basic installation so should be visible in node repository…

Br,
Ivan

1 Like

Yes I found the duplicate row filter node. I was just looking for some example workflows to give me a starting point on how to get started.

1 Like

Hi @sgilmour,

in that case you can find node on KNIME Hub and there you’ll see a list of workflows were it was used :wink:

Br,
Ivan

1 Like

The example workflow helped me create the workflow. I was able to create the workflow and a sample file.3 Duplicate Rows Workflow.knwf (413.3 KB) . Just need to test it with my real excel sheet.

1 Like

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.