Writing file to S3 bucket

tjmbakwe · January 11, 2021, 2:02am

Currently when I write a file into S3 bucket, it gives output files in partitions. How do I write a file to S3 bucket in a single CSV file, rather than the files being divided into multiple parts? A single file which contains the contents from all the partitions.

ipazin · January 13, 2021, 10:52am

Hello @tjmbakwe,

and welcome to KNIME Community!

What nodes do you use and how does you workflow design looks like?

Br,
Ivan

tobias.koetter · January 15, 2021, 12:27pm

Hi,
if you are using the Spark to CSV node you can override the Spark DataFrame partition using the Overwrite partitions count option which results in a single csv file. By default Spark is simply writing out as many files as the DataFrame has partitions.
Bye
Tobias

system · July 17, 2021, 12:27am

This topic was automatically closed 182 days after the last reply. New replies are no longer allowed.