the KNIME Spark nodes creates usual Spark data frames. Data frames are not persisted by default, they are computed on the fly and only available in memory. This means they are lost if you stop or restart the cluster. You can use nodes like Spark to Parquet to persist your data frame to e.g. DBFS or S3.