Counting occurrences of words

hi

I want to count the occurrences of words in a .xls file.

the text is written in GREEK. By following instructions in the forum reached a point. But many terms appear in a different format and they might appear in more than two rows of the table. Whow can i group or filter by using specific search words.  did anyone know about this? attached an example workflow.

thanks

 

Hi John,

you can concatenate the string columns to one column and convert this to documents (Strings to Document). Transform the documents to a bag of words and then apply the TF node to count the words in each document. You can then group by term and sum up the tf value to count the word occurrences in the whole corpus.

Cheers, Kilian

 

1 Like