hello together,
I used the bag of word creator to calculate my document frequency. I noticed that the node removes terms that occur more than once in a document. But this would be counterproductive when assigning characteristics to the individual classes, or can one tell me why the node does this?
yes, you are right. The Bag Of Words Creator lists terms only once per document.
However the frequencies are calculated correctly, because it looks up the number of occurrences of the term in the document instead of counting the occurrences in the term column.