Stanford Tagger is taking very long to execute

David1202 · December 20, 2022, 3:47pm

Hello everyone,

I am currently trying to find some trends using journal-analysis. To do that, I’d like to use the Stanford Tagger and Lemmatizer but the Stanford Tagger needs about an hour to run. Is this normal or can I do anything to speed this up, because I want to analyse a lot of different journals a day over a year.

I am using a JSON Reader to read my JSON Data from the different websites, after that im converting the JSON File into a String and then into a document, which is then given to the Stanford Tagger.
For better understanding I took some screenshots of my settings, the input and the workflow itself.

Thank you already for everyone’s help,
David Werner

Daniel_Weikert · December 20, 2022, 4:50pm

How much processing power (e.g. ram) did you assign to KNIME? You can adjust that in the .ini file
Also it make sense to first clean your data (e.g. first run the filter nodes) before you tag. That reduces the data set size upfront.
beste Grüße

David1202 · December 21, 2022, 3:19pm

Thanks for your fast response!
I did the changes you recommended and it is working much faster now.

Daniel_Weikert · December 21, 2022, 5:44pm

Great to hear that David. Congrats.
Br

system · December 28, 2022, 5:44pm

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.