Counting occurrences of words


I want to count the occurrences of words in a .xls file.

the text is written in GREEK. By following instructions in the forum reached a point. But many terms appear in a different format and they might appear in more than two rows of the table. Whow can i group or filter by using specific search words.  did anyone know about this? attached an example workflow.



Hi John,

you can concatenate the string columns to one column and convert this to documents (Strings to Document). Transform the documents to a bag of words and then apply the TF node to count the words in each document. You can then group by term and sum up the tf value to count the word occurrences in the whole corpus.

Cheers, Kilian


1 Like

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.