special characters read by CSV Reader not depicted correctly

Description:

Reading strings from a csv file with CSV Reader containing german characters such as ß and ä are not correctly depicted. This already affects version 5.5, maybe also earlier. I saw this under Windows, but not in KNIME 5.4 running under Linux. So might be more an operating system encoding issue…

Changing the encoding from OS default (windows-1252) to UTF-8 shows the correct symbol.

Reading an older file works fine and shows the correct symbol even with the same Reader Node.

Steps to reproduce:

  1. create a text file
  2. enter text with ß
  3. read with CSV Reader or File Reader
  4. look at text in the KNIME table

Actual results:

~A ..Y (signs always above the letters)

Expected results:

ß

Attachments:

OS:

Win10

Created ticket ID: QA-1487

Hi @daniela_digles,

Thanks for bringing this to our attention. This is indeed a problem of the CSV Reader’s default encoding to be set to the OS-specific default, which is different on Windows than on Linux. We are currently working on a new CSV Reader with a modern dialog. This new CSV Reader will always have UTF-8 as its default encoding, independent of the OS. This should lead to less inconsistent behavior in the future.

Marc

2 Likes

Internal ticket ID: UIEXT-3072
Summary: special characters read by CSV Reader not depicted correctly
Fix version(s): 5.10.0