Workflow Demonstrating GroupBy Examples

Using the adult.csv data set: on each one of the 4 groups defined by sex and income values, calculate the total number of rows and average age and write the results to a CSV file; on each one of the 4 groups defined by sex and income values, calculate the average of all numerical columns; on full input table count: a) rows with missing values in column occupation; b) all rows in column occupation; c) rows with no missing value in column occupation; d) all rows in another column (i.e. marital-status). Notice that this number should be the same as the number in 2.


This is a companion discussion topic for the original entry at https://kni.me/w/lg_OGOIenjQ4pYT5