This component identifies the five best-fitting continuous probability distributions for numeric columns using Python’s Fitter library. It supports 80 different continuos probability distributions and plots the best five distributions and probabily plot based on the least sum of squared errors. The component configuration window allows to select the input colomn(s) to fit the distribution. The Python's library behind the component is able to handle datasets of max 10K rows. Larger datasets will be downsampled to 10K rows. The result of the component comprises a table with the five best fit distributions, probability plots, and a table with descriptive stats for the selected column(s).
This is a companion discussion topic for the original entry at https://hub.knime.com/-/spaces/-/latest/~qlKBBB2EHrnh6zac/