Is there a workflow to use AI to check and validate column values of a dataset? For example, let’s say a dataset with a “Country” column, but some country names misspelled, or they appear in different or the entries are not country names but rather names of territories or states. I want to use AI to check this for me. HOW?
1 Like
Hi mmrostami,
The following is not exactly what you are looking for but maybe it could help you…
Cheers,
Carlos
2 Likes
@mmrostami you could indeed try a fuzzy match if you have a list of all correct spellings.
You can also try and use a (local) LLM to let it clean and detect the correct names. You might have to see how to instruct the model to just give back the results and no big speech.
4 Likes
Here’s a variety of approaches you can try.
3 Likes