Checking similarities of the specified columns using distances

I have a 2 excel files which contains string data in excel1 file there is software name and excel2 file there is a product now i have to compare these 2 columns and have to find distance between them like how much percentage those 2 strings are similar and i need to create a new column with name matched and need to write that percentage into newly created column.Can some one help me out how to do this with distances.
EXCEL - 1:


EXCEL - 2:

Hello @Gella_Gayathri,

Welcome to the KNIME forum!

You can achieve this by using the Similarity Search node. Here is an example workflow showing how to do this based on Levenshtein distance.

Best,
Keerthan

2 Likes

Hii @k10shetty1

i am unable to find Levenshtein distance in similarity search is i’m missing something could you help me out.

Thanks in advance,
Gayathri

Hello @Gella_Gayathri ,
If you scroll below “Dice,” you will see two more options: Levenshtein (absolute) and (normalized).

Thanks,
Sanket

3 Likes

Hii @k10shetty1 @sanket_2012 ,
i want to remove text or versions beside product name like (64-bit , 14.0.23026) how can i do that in knime.

You can use string manipulation node(s) with regex expressions.
br

Hii

Can anyone help what exact regular expression need to use to get desired results.

thank you
Gayathri