Smarter Missing Value approach

Hi all,

I was wondering, is there a smarter way to approach Missing Value as opposed to just calculating the mean for the entire column.

I want to be able to calculate the mean based on other columns, for example;

For example, I have two columns, Sex and Score. I have missing values in my score column. Can I fill the missing values in by calculate the mean for Female and Male respectively as opposed the mean for both sexes or do I need to Ungroup and calculate for each column and then Group them back together?

Thanks

Hi,

Here is the workflow to get what you asked for:

I calculated the mean for each sex group and created rules for “Rule Engine (Dictionanry)” and replaced missing values.

Check the workflow and feel free to ask your questions.
missing.knwf (32.9 KB)

Best,
Armin

Hi Armin,

Thanks for the effort in your reply! That’s awesome, I can see what you’re doing there. That’s a great help!

1 Like

Hi there!

Why not using a bit less nodes :slight_smile:
GroupMeanForMissingValue

2019_01_16_Group_Mean_For_Missing_Value.knwf (16.0 KB)

Br,
Ivan

4 Likes

Love it! thanks Ivan.

1 Like

Hi Craig!

Glad I helped. There is a Solution button on each replay so feel free to hit in order for others to find solutions easier :wink:

SolutionButton

Br,
Ivan

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.