clustering

eagle12 · November 19, 2017, 9:10pm

Hi,

I have a table of two columns; one column contains data and second column contains cluster numbers, and need to pick one data from each clusters (it could be random or first in each clusters). Please refer to the attached. Could anyone help me what nodes I need to use?

Thanks.

clustering.png

johannes_clarifydata · November 20, 2017, 12:53pm

Hi,

the GroupBy node is what you are looking for. The group column should be "cluster" and then you can select the first observation in each group as a manual aggregation of "data".

eagle12 · November 20, 2017, 7:21pm

Thanks so much !!