This sounds like a job for adress deduplication or ‘fingerprinting’
Concerning adress deduplication @wiswedel provided some very useful workflows
Fingerprinting using adresses (ignore the title)
Compare string similarities (you may have to set a threshold)
You would need Palladian for that:
Repository to install Palladian
http://download.nodepit.com/palladian/4.0
1 Like