I have a problem. I try to use the RDKit Diversity Picker node to pick up the most active compounds. There are 100 molecules in the input (table 1). The number to pick is 20 and the Random seed is -1.
I repeated the picking process several times. And the results were not the same. The output files included different compounds each time.
Could you please help me? The KNIME version is 4.2.3
That’s like complaining that you get a different sample every time you do random sampling.
Using a specific random seed allows you to reproduce a random sampling event from run to run. If you change the random seed, of course you’ll get a different sampling result.
There is no such thing as “the best” random seed.
If you’re concerned about variance, then pick 3-5 seeds, do your analysis for each, and evaluate the variance in the results to see if it really matters.