Hi guys, apparently im trying to replace digits into values using cell replacer node. I succeeded but somehow there are many missing values…
Like this :
Reason for missing values:
EXAMPLE -
I declared “Internet” = 10.
Supposedly System will replace “Internet” with the number “10”.
Then there are some data spelt : Intranet , ethernet, internett etc
So the system cannot detect and replace 10 to those values. Thus, missing data.
*Not only “Internet”, but also like “stove and stoves” , “TV and television” and many more. “Internet” is just an example.
There are like 10k columns and rows in total and I would like to know, is there any possible way to clean the data?
I will be waiting here, appreciate any suggestion!! Thankyou so much!!
Hi, if understand well the original data had some misspellings like:
Internett which should be: Internet
stoves which should be: stove
TV which should be: television
Then you replaced, Internet by 10, stove by 5 and television by 4, and of course, the misspellings will be empty values.
My recommendation is to clean the values before the doing the replacement you can use the Groupby node to find the misspellings and then use the Rule Engine node to replace them.