Blockquote
WARN Tag Filter 5:1323 Selected tag type “SPACY_POS_en_core_web_md-3.3.0” could not be found.
This might happend because it is either a dynamic TagSet that is undefined for the selected column ‘Processed document’
or it is a TagSet defined by a missing language extension!
Install additional language extensions at File->Install KNIME Extensions.
Blockquote
In which Log can i find more information (details) ?
see here how to find the log files. @Artem from @Redfield might have more clues after you provided the log files. Furthermore it looks like the failing node is in the component Processing in your workflow. Maybe in the component you see some more details? Could you post an image of the content of the component?
I don’t understand the structure of your posts. Is the part about livrary spacy another issue which you experience with a proposed solution which the maintainers might include in their extension?
The third part seem to be about the Tag Filter node. See my link above for finding log files. But I do not understand why you want the logs there. It is clearly stated, that you could try to
Install additional language extensions at File->Install KNIME Extensions
Have you tried that? See the following screenshot for the possible language extensions there.
Thanks for your feedback.
Update:
very strange - after “playing” around with the Spacy Model Selector, i am able to run the workflow inclu. Spacy Model 3.6.
But the last node in the workflow with this config:
First of all I would recommend you to use this workflow if you are interested in the one that was presented at the webinar:
The workflow you mentioned is a part of the NLP course, and it is an exercise that needs to be solved.
One more thing, what Python environment are you using: the bundled or your own custom environment? Perhaps the problem could be that you may be missing some dependencies in case you are not using the bundled environment.
I can see you tried to download the model and use, however the version in 3.6.0 is not supported, while currently only versions 3.2.0 and 3.3.0 are supported (support of 3.5.0 is coming soon).
So please check your settings and provide an update, since it is a bit hard to tell what is wrong.
Blockquote
WARN LoadWorkflowRunnable Warnings during load: Status: Warning: 12_07_2023_KNIME_Refield_Sate_of_the_Art 3 loaded with warnings
WARN LoadWorkflowRunnable Status: Warning: 12_07_2023_KNIME_Refield_Sate_of_the_Art 3
WARN LoadWorkflowRunnable Status: Warning: BERT Model Selector 3:4598
WARN LoadWorkflowRunnable Status: Warning: State has changed from IDLE to CONFIGURED
WARN Spacy Tokenizer 3:4613:0:1321 C:\KNIME\plugins\se.redfield.textprocessing.channel.bin.win32.x86_64_1.1.2.202212230250\env\lib\site-packages\spacy\util.py:865: UserWarning: [W095] Model ‘en_core_web_md’ (3.3.0) was trained with spaCy v3.3 and may not be 100% compatible with the current version (3.4.1). If you see errors or degraded performance, download a newer compatible model or retrain your custom model with the current spaCy version. For more details and available updates, run: python -m spacy validate
WARN Tag Filter 3:4613:0:1325:0:1323 Selected tag type “SPACY_POS_en_core_web_md-3.3.0” could not be found.
This might happend because it is either a dynamic TagSet that is undefined for the selected column ‘Processed document’
or it is a TagSet defined by a missing language extension!
Install additional language extensions at File->Install KNIME Extensions.
WARN Spacy POS Tagger 3:4613:0:1325:0:149 C:\KNIME\plugins\se.redfield.textprocessing.channel.bin.win32.x86_64_1.1.2.202212230250\env\lib\site-packages\spacy\util.py:865: UserWarning: [W095] Model ‘en_core_web_md’ (3.3.0) was trained with spaCy v3.3 and may not be 100% compatible with the current version (3.4.1). If you see errors or degraded performance, download a newer compatible model or retrain your custom model with the current spaCy version. For more details and available updates, run: python -m spacy validate
WARN Spacy Lemmatizer 3:4613:0:1325:0:150 C:\KNIME\plugins\se.redfield.textprocessing.channel.bin.win32.x86_64_1.1.2.202212230250\env\lib\site-packages\spacy\util.py:865: UserWarning: [W095] Model ‘en_core_web_md’ (3.3.0) was trained with spaCy v3.3 and may not be 100% compatible with the current version (3.4.1). If you see errors or degraded performance, download a newer compatible model or retrain your custom model with the current spaCy version. For more details and available updates, run: python -m spacy validate
WARN Spacy Vectorizer 3:4613:0:1325:0:1334 C:\KNIME\plugins\se.redfield.textprocessing.channel.bin.win32.x86_64_1.1.2.202212230250\env\lib\site-packages\spacy\util.py:865: UserWarning: [W095] Model ‘en_core_web_md’ (3.3.0) was trained with spaCy v3.3 and may not be 100% compatible with the current version (3.4.1). If you see errors or degraded performance, download a newer compatible model or retrain your custom model with the current spaCy version. For more details and available updates, run: python -m spacy validate
I got the workflow now running with the “Bundled” Python env. configuration and Knime 4.7.2…
As you maybe saw below the “News relevance webinar” my comment 1 month ago, it was figure some
to understand /learn who Knime & Spacy are playing together.
Maybe you can give a start point to get more back round information. As only the workflows it is very time consuming to understand how/what is working/what should be the output.