Synthetic Data Augmentation with Copulas

Hi all! :wave:

I created a KNIME workflow that uses the Synthetic Data (Copulas) component to perform data augmentation on tabular datasets.

It starts with the Iris dataset , generates 500 synthetic rows using a Gaussian copula, and compares the real and synthetic data using Statistics , Linear Correlation , and a 3D scatter plot for visual inspection.

Great for testing, privacy, or boosting small datasets.

Check it out!

4 Likes

Cool idea - thanks for posting it!

1 Like

I can’t find the conda packages anywhere. Could you suggest how to install them? My Python skills are weak at best.


If you like the Iris Data, I can also recommend the New Iris Data :slight_smile:

2 Likes