I am new to Knime and have only a few experiences with it.
I want to create a new column called “main target”.
The columns’ content shall be based on whether a string in another column called “contract” matches or not.
I would like to use a dictionary where all potential matchings are written and what value should be given in case of matching.
I have many different keywords which I want to check for matching, and the number of keywords will be growing.
I saw the Rule-based nodes where PMML can be used. I have zero experience with PMML.
Thank you all for the quick help and suggestions.
I went through all possibilities.
It turned out that I can utilize the "String Replace (Dictionary) the best. But it works only for an exact match. How can I use the Dictionary with regex instead of exact match? That would be the solution.
the Wildcard Tagger sounds promising, but I have no clue how to use it.
There is no documentation how to connect it. I tried with CSV readers and tables but I do not get into the configuration to start understanding how it should work.
The two step approach you mentioned is a good idea, but my case is the following:
It is about categorization of companies. I have thousands of companies I extract of which I do not exactly know what they are doing. They should all be engaged in car repairs in some way. But some of them are specialized in tires, others in body repairs etc.
Therefore, I try to categorize these companies to have a better picture on their business type. I have a list of URLs of which I know what kind of business type they have. Whenever the “unspecified” company is matching with one of the “specified” URLs, they should get added their category name.
Indeed, I could normalize the URLs to the blank domain name and compare it. But still it would be helpful to use regex for those cases where also parts of the domain are enough to categorize.
I still wonder why there is no possibility to use the string manipulation dictionary in combination with regex. That would be very powerful.