Data cleaning

I’m probably missing something cos this seems like it should be simple

I have some data with 3 categories. I’ve identified that some of the data has a miss spelling - “VAC B” and VAC B " (see the space at the end of the second?). This now eroneously makes 4 categories

How can I get rid of teh space on the second? So far teh only thing I can do is remove teh space on teh string manip mode, but that affects other categories

I’ve tried rule engine, - $VAC Area$ LIKE “VAC B” => “VAC B”
but that removes the other 2 category strings entirely
All I can think is that I need to tell knime to leave anything that’s not like VAC B alone - but I can’t find how to do that

Can anyone help?

Hi @Andy_D

Go for the String Manipulation node, and strip

gr. Hans

1 Like

@Andy_D you can use Regex and reduce all multiple blanks to just one

Also for multiple columns:

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.