Hey all,
is there in KNIME the possibility to display the preprocessed documents only with the existing terms?
This picture is the result of the node reference row filter.
As you can see in the column “preprocessed documents”, this column also includes the words that have already been filtered out.
I have difficulty calculating the relative term frequency in node ‘TF’. As you can see in the figure, the word ‘PIN’ appears twice in the record. Instead of dividing it by the term number of the document, in this case 8, but it divides it by the total number of words, although some words have been filtered out and should no longer be considered.
I would be very happy about your help.
Thanks,
Canan