I created a KNIME workflow that uses the Synthetic Data (Copulas) component to perform data augmentation on tabular datasets.
It starts with the Iris dataset , generates 500 synthetic rows using a Gaussian copula, and compares the real and synthetic data using Statistics , Linear Correlation , and a 3D scatter plot for visual inspection.
Great for testing, privacy, or boosting small datasets.
Thanks for the reply. I had Anaconda installed. For some unknown reason the prompt disappeared and won’t reinstall. Finally got the copula package installed through conda forge from a cmd prompt. “conda install conda-forge::copulas”. Installed to an existing environment. Your workflow is now working for me. Thanks again. Nice work.