Hello guys, I have a csv file with too many rows, for example 275.000.
In one column appears repeatly the name of the source cyber attack country but the file is not grouped or it’s in disorder, for instance:
row1: …, France,…
I need to find the number of ocurrences by Country, for instance in this case, in the standard output interface of knime I hope to list:
I tried in knime via GroupBy but only it’s counting the number of the total rows of the file. Please, any help will be welcome.
After that I had to build a Pareto table but with the previous totals it’s easy to finish in excel.
J. Ignacio Saavedra V.
Bogotá - Colombia
hi @nachancheno , welcome to the community
replicate the column and group over it.
Hi marzukim, thanks for the wellcome and your answer.
When I replicate the column, ¿does knime automatically group the rows by the column values?
I can’t see clearly the code or graphs that you attached, if it’s not too much abuse, ¿could you attach the code, please?
Thanks and regards.
no problem, happy to help. as you requested, i’ve attached the workflow for you to review.
KNIME_Count01.knwf (86.3 KB)
Hello marzukim again.
Thanks very much! ¡I adapted the code and it worked great!
glad to hear that you were able to resolve the issue.
just to update you, you can take out the rules node (as not use in “groupby”).
i just use it for other execution tests. it still give the same results.
This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.