Simple question about Term Frequency

Hi @RIchardC and welcome to the forum!

Without any sample data to go on, this is a general approach I would try:

  1. Strings To Document node to convert Source to a KNIME document, making sure to configure the node to apply the Category metadata

  2. Bag Of Words Creator to… create a Bag of Words

  3. TF node to calculate the absolute term frequency

Then some subsequent aggregation and joining to compare to your original list of terms, but exactly how this is done will depend on the format of your data. If you have a small example dataset I could try to build a toy workflow for you to check out.

1 Like