Hi,
I downloaded the ClassificationClusterLDA sample workflow.
Example Workflow – Text Processing Tutorial – KNIME Community Hub
I went through the workflow step by step.
When checking, I found out that in the sample example, some words are translated “snowball stemmer” strangely.
My native language is not English, but the form “energy” cannot be flipped to “energi” or “literally” to “liter”.
But some words will correctly convert “lots” to “lot” or “reviews” to “review”.
Can you please explain to me how the bases of words are formed?
Is it the intention that they are “wrong”?
It is possible to enter other languages than those listed in the menu for “snowball stemmer”
Hello @MarekV
You can try installing other language packs for Text processing extension for Knime.
Otherwise you can take a look at Redfield NLP nodes, since they work on top of Spacy - an NLP framework that supports more than 20 languages. In this extension you can find Lemmatizer node that brings the tokens to their root form, which is actually better them stemming.
Thank you,
for providing information about Redfield NLP nodes.
Unfortunately I have version 4.3 installed.
Next to it I installed version 4.7.7 .
Unfortunately I only deployed version 4.7.7 because I don’t have admin rights to install it .
Then I downloaded the sample workflow External_spacy_model – KNIME Community Hub from the KNIME site ,
but when I open it reports, that it is not possible, to download the missing nodes.
See images
No sure where is problem
I think you need to check the boxes on the “Available Software Sites” screen for everything that is disabled, and try again.
Hi,
I am not allowed to contact directly Knime update site .
I am using
The zipped Update Site for 4.7.7 can be downloaded here.
Please do not click it.
It is 7.8 GB file just for ilustration
For me is no way .
Thanks
Ah, got you. Then did you follow the instructions already about adding a local extensions archive to your update sites? Sorry if I’m telling you something you already know, it’s a bit hard for me to tell from your screenshots above.
Hello,
I tried to make the pictures self-explanatory , but sometimes it doesn’t work.
Yes I installed the archive with your extensions but the nodes mentioned above are not found there. In the picture it is ADD ON
They are probably third party , which are probably installed specifically .
And I don’t know how
Hi @MarekV -
I think I understand now. Yes, the Redfield SpaCy nodes are in an extension that’s part of the Trusted Community Contributions update site. You can download the 4.7 version here: https://update.knime.com/community-contributions/trusted/4.7/TrustedCommunityContributions_4.7_202309141845.zip
Ok ,
I download https://update.knime.com/community-contributions/trusted/4.7/TrustedCommunityContributions_4.7_202309141845.zip
Than add to KNIME
In extension I have found spaCy
But during installation after 73 % of installation I have got message
Version of KNIME is 4.7.0.v202212010827
It seems that is security problem
Redfield Updates goes wrong - Partner Extensions - KNIME Community Forum
This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.