How to do NLP in non-English?

Hello everyone,

I am currently doing natural language processing for a non-English language.

I think that KNIME enrichment node supports only English, is that correct?
E.g., can I add custom dictionary or tokenizer?

I couldn’t find out the way in KNIME, so I tried in Python.
I could implement tagging, cleaning, stemming, lemmatizing in Python, but now, I couldn’t find out how to return/ what to return to KNIME.
E.g., how can I return to KNIME as the document type and what structure must it have?

Someone please advise me!
Thank you in advance!!

Hi @poc,

These are the language packages for textprocessing in KNIME:
Bildschirmfoto von 2022-03-28 14-34-17

Your language is not there, or did you not install the appropriate package?

Best Regards
Andrew

4 Likes

@Andrew_Steel
Thanks for reply!

I see these are the language packages.
But unfortunately what I’m doing for is Japanese and Vietnam…

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.