I need help with a KNIME workflow query. In this case, the issue is related to the “GroupBy” node where the desired outcome is not achieved even after setting the correct configuration. I am sharing a document that will provide more clarity on this subject. The workflow and dataset are also attached.
It would be great if we could get some help resolving this issue.
McCurr_Consultancy_Pracice_Exercise6 (1).knwf (97.0 KB)
HR_Employee_Attrition-1.xlsx|attachment (307.3 KB)
Based on the second table in your screenshot, which seems like a KNIME output, it appears that you already have your desired output there right? Or am I mistaken?
The reason for getting only Yes and No values for Attrition is because you left the all Aggregation tabsheets empty in the GroupBy node. It then just takes the unique values of Attrition.
If I groupby on
Attrition and aggregate
Age as min,max,mean and variance, the desired output is there.
Thank you for responding so quickly. I also noticed that my Aggregation tab sheet is empty in a group by node, but I was not able to figure out how to fill it in. If you could guide me on that, that would be great.
Shouldn’t be too difficult.
Go to Settings → Manaul Aggregation → select the Age column → click Add (4x) → select the aggregation method (min,max,mean,variance) → set any desired advanced settings → apply/ok → run.workflow.
More detailed info here:
I am only getting mean value when I try to do it manually.
Apologies for troubling you, but I am still not able to figure out how to get all 4 aggregation methods together (min, max, mean, variance).
Please read carefully what I wrote.
You add Age 4 times as column. And then for the first one select min, for the second one select max, etc.
Please accept my apologies for bothering you with a small and trivial problem.
Thank you so much.