Hello KNIME Community!
I have a problem when using the dictionary tagger node and I hope that you could help me.
I will explain my approach: In a first worklflow I searched for the most mentioned terms in some pdf-documents. I sorted them according to theit term frequency. After that I chose these terms which are relevant for my project. Then I created an excel list that contains all these terms (terms to string). In a second workflow I started with the exact same data basis (pdf files like before). I implemented my terms out of the excel sheet with the dictionary tagger node. After running the analysis and pivoting the data, some terms show no frequencies anymore (? in the pivot table), although I extracted them from the same data basis as in my first workflow.
And in my pivoting table not all extracted terms out of the excel table are shown. Some terms are left out by no reason.
Long story Does anyone has a hint why some often mentioned terms are “vanished” after using the same data basis and why some terms out of the dictionary tagger are not shown in the frequencies.
Thank you all very much in advance!
Edit: Documents and terms are in German language.