identify within the text (long one) several predefine words and the frequency

I have CSV file with one column of txt type  (One row)

I would like to identify within the text (long one) several predefine words and the frequency of each   such as Developer, HTM l, UI etc

So the    output will be

Col1      developer     HTML     UI

Doc1     3                      2           4

 

Can you please help and advice what is the best approach for this type of situation?  

Thanks in advanced

I have the same need. Please help.

Hi,

you can start with the File Reader to read in the csv formatted string data. Than use the Strings to Document->Dictionary Tagger (you need to provide a dictionary with your words that you want to count)->General Tag Set Filter (filter all words taht have _not_ been tagged)->Bag of Words Creator->TF (absolute)->Term to String->Pivot (group by Document column, pivots are terms as strings)

Cheers, Kilian

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.