Hello Knime Experts! I am fairly new to KNIME and I have been working with this tool since past 2 months now. I have a doubt on fuzzy matching (text processing). I have set of 600 names as a column (first name, middle name- optional and last name). I want to get all the set of similar strings from this column. I did find few workflows which does this but it does by comparing one name of one column with another name in different columns. I am not sure of any discussion relating to getting set of similar words from one column. The algorithm of fuzzy search to be used should preferably be an algorithm which gives percentage of similarity like cosine similarity. I do not have any words to compare the name field with. I have a single column with full name of people and I need to get set of similar words from these itself, so each name should be compared to all other 599 words and the set of words which are similar are to be marked as similar and those words are then checked with percentage of similarity it holds. Kindly help me resolve this or direct me to any document/discussion which helps in the same. Thanks in advance! Mahima

You can try HansS solution here [image] Finding duplicates compagnies KNIME Analytics Platform Hi everyone, I am working on a list of compagnies and i need to check if there is duplicates compagnies in a table. But it’s not easy to find thoose duplicated beacause there are compagnies written in different ways. And the list is very long, so I need to optimize the process to find duplicates. I have tried to use the worflows of duplicated adresses but it’s not working very well. So, if someone can help me to built the worflow to find the duplicates. Just for example, i give you names o…

Fuzzy search on set of names to find similar names by comparing each name with all others

KNIME Extensions

izaychik63 May 19, 2021, 5:44pm 2

You can try HansS solution here

3 Likes