Hi everyone,
I have a question regarding fuzzy matching. I have a column about countries. These can contain typos. I also have a file with a single column that contains all country names, but all of them are spelled correctly. I used an example from the server (09_Fuzzy_String_Matching), but here I don’t have the ability to assign the right name to the misspelled names, but how can I replace it with the right name if a country was misspelled?
What do you mean by “a column about countries”?
Is it some text in which the country names are mentioned?
If there is a single country name, the String Matcher node alone will do the matching.
But if you have a string containing the country names, I have to ask if the text structure for all records are the same or not?
For example, if all the strings start with:
Country name: This country is located in…
It would be easy to handle.
Would you please provide a sample data?
Hi Armin, here you can find my workflow.
I have added the list with all existing country names in the yellow area. In the blue area you can see the file for which I should do the correct name assignment. Let’s assume that the correct list contains “Germany” (correctly written) in the yellow area. But if “Gernamy” was written in the blue area now, it should be matched with the names in the correct list and then replaced with the correct spelling. gernamy becomes Germany.
this Excel sheet includes some mistakes in the column “port of Destination”.
When for example Thailand is written “Tailand” in the document “TM full file Oct 18 1 (1).xlsx” then we should get “Thailand” from the other uploaded document as an Output or better said it should replace Tailand with Thailand.