I was wondering, is there a smarter way to approach Missing Value as opposed to just calculating the mean for the entire column.
I want to be able to calculate the mean based on other columns, for example;
For example, I have two columns, Sex and Score. I have missing values in my score column. Can I fill the missing values in by calculate the mean for Female and Male respectively as opposed the mean for both sexes or do I need to Ungroup and calculate for each column and then Group them back together?
Here is the workflow to get what you asked for:
I calculated the mean for each sex group and created rules for “Rule Engine (Dictionanry)” and replaced missing values.
Check the workflow and feel free to ask your questions.
missing.knwf (32.9 KB)
Thanks for the effort in your reply! That’s awesome, I can see what you’re doing there. That’s a great help!
Why not using a bit less nodes
2019_01_16_Group_Mean_For_Missing_Value.knwf (16.0 KB)
Glad I helped. There is a Solution button on each replay so feel free to hit in order for others to find solutions easier
This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.