Detecting Chinese characters from English characters

Hi,

I would like to read a column in KNIME, and append a new column that would say whether the data is English, Chinese, Chinese&English, and Blank. I read in another topic that this can be done by Tika Language Detector. Is this still the right solution? How do I install this in KNIME? I’m using KNIME analytics platform v3.4.2

Thank you!

Hi @airahdelafuente,

The Tika Language Detector should be your first choice here. It was introduced in KNIME Analytics Platform 3.3, so you can install it by going to File -> Install KNIME Extensions -> KNIME Labs -> KNIME Textprocessing.

If you’re looking for nodes but are not sure to which extension they belong, you should check out node pit, it’s a really cool tool: https://nodepit.com/

Cheers,
Roland

3 Likes