Get more than one match at string matching

swbine · September 9, 2019, 8:56am

Hi guys,

I have a short question: I use the String Similarity Node to match company names from two different sources together. For some it already gives me good results, for some others it’s quite difficult as e.g. AB technologies and AC technologies are quite close matches but it’s not the same company. I would like to increase the probability of getting the right match by giving the 5 most likely matches (hoping that the right one is part of it). I could not figure it out in KNIME so far, I’m using difflib’s get_close_matches in Python right now.

Is there a way to achieve this in KNIME? We don’t want to use the Python Node as we don’t want the people that will work with it to install and handle python code (plus my code is super slow as I’m not really familiar with it).

Thanks in advance!

Sabine

izaychik63 · September 9, 2019, 11:47am

You can find some ideas here
https://www.knime.com/blog/address-deduplication

system · March 9, 2020, 11:47pm

This topic was automatically closed 182 days after the last reply. New replies are no longer allowed.