I have a table of two columns; one column contains data and second column contains cluster numbers, and need to pick one data from each clusters (it could be random or first in each clusters). Please refer to the attached. Could anyone help me what nodes I need to use?



the GroupBy node is what you are looking for. The group column should be "cluster" and then you can select the first observation in each group as a manual aggregation of "data".

Thanks so much !!