About DB to Spark node.


I was wondering if I can use a DB to Spark node to convert from SAP HANA to Spark RDD.

I’d appreciate it if someone could let me know.



After some investigation I do not see any reason why you wouldn’t be able to use the ‘DB to Spark’ node to pull data from SAP Hana into a Spark RDD/Dataframe.

This should be a relatively straightforward process. To start you would need to setup your DB connection for Hana and select a table you want to feed into Spark. Then setup your remote files system connection (HDFS, S3, etc…) and create your Spark context. These two connections will provide your inputs for the DB to Spark node. This should be enough to get you started.