Typo Correction for large dataset

Recurssively applies Hierarchical Clustering to a column of string data to correct typos. Recurssion is employed to bypass compute time of clustering very large tables by clustering small chunks, correcting names based on the most common spelling in the chunk, shuffling and repeating.


This is a companion discussion topic for the original entry at https://kni.me/w/GQIeg88Ktm3aAmpV