I have a question about the bag of words creator. I’m trying to count the occurence of specific words in american 10K files for multiple documents. However, the bag of words creator takes all words into account.
Could you share your existing workflow? That would make it easier for someone to help. Forgetting KNIME for the moment, what/how would you like to filter?
The data is stored on your computer and consequently not available in the workflow. Create a folder named “data” in your workflow and store the data there.
Try this. The component has the following tables:
Term count for a selected term
TF for a selected term
IDF for a selected term
TF for all terms
IDF for all terms