Replacing umlauts (ä,ü,ö and ß) in german data

Hi,

I’ve loaded german data into Knime which causes some problems with the umlauts and other german characters in the text.

For example “Großhandel” looks like this:
image

Is there a way to replace ä,ü,ö and ß globally in the entire table?

Thanks for any help!

Hi @FerryCarondelet

Do you actually want to maintain the characters to ensure proper German spelling?

In that case, ensure that you load the data into KNIME using UTF-8 encoding. Using the CSV Reader as example, go to the encoding tab to change the character set.

Default:

UTF-8
image

If you also want to convert them eventually, use the replaceUmlauts() function in a String Manipulation node.

Hope this helps!

6 Likes

Wow awesome! Thank you @ArjenEX for your fast and very helpful response!

But it seems like in the string manipulation node, i can only replace the umlauts for specific columns. Is this also possible for the entire dataset?

Yes, change to the Multi Column version of the same node.

image

Select all columns you want to have the change applied to. Use the $$CURRENTCOLUMN$$ variable in the function. This will ensure all selected columns are updated accordingly.

3 Likes

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.