Address Deduplication

The workflow looks for duplicate records of restaurants by searching for similar names and addresses in a reference table. The similarity is based on the mean of the 2-gram dice distances of the restaurant name and address.


This is a companion discussion topic for the original entry at https://kni.me/w/M0O8jTCkxd_3rjS-