Here I am with our solution to Just KNIME It! Challenge 10 β which was also made by yours truly!
This solution is simple and not so different from others I saw here. Thanks to everybody who participated, and for the nice contributions using other String distance and even more efficient ways of tackling this little puzzle.
See you tomorrow for a challenge on the Summer Olympics! Are you folks as excited about this event as⦠I am??
My workflow uses the Similarity Search node, and has a threshold of 0.7. Based on the split of typos/ no typos, it finds the closest match in dataset with no typos (if any) and updates the typo parameter. It later removes duplicate rows and outputs a downloadable csv.
The dashboard allows for a custom file upload, and provides a summary of the uploaded file (total rows and rows under each category. It also provides a link to download file as well.