I have a large table with >10000 names/spellings in one column containing names which differ slightly for different persons for example:
Mickey Mouse
M. Mouse
Mouse Mickey
M. Disney Mouse
M. Dinsey Muose
I would like to find somehow groups of similar names in order to construct with minimal manual effort a dictionary replacer replacing finally all of the above names by Mickey Disney Mouse, for instance.
that was the right way to go. It took me a while to figure the workflow out. The key was how to construct the query (I am not so into Java) and choose the right sensitivity for the fuzzy search.
I wanted to check the links you posted but unfortunately I am not able to access it seems the page is private? Could you share the solution please I have tried many different things in last weeks but haven’t had a good result yet
Jerry, (I am also not into Java and new to Knime) Do you have the workflow that you used for your ‘name consolidation’ case? I am struggling to produce something quick :S