Hi Tobias,
I belong to the same team as Shalini. I manage the knime and hadoop servers.
Thank you for your response.
Looking at the logs, i could figure out blockers when running the workflow in local and server mode.
When we run the workflow in local, Hive loader get stuck at 59% and never insert values from temp to original table. I am pasting the error below.
2019-07-02 09:14:12,463 : DEBUG : KNIME-Worker-38 : HiveLoader : Hive Loader : 6:373 : Temporary table sucessful created
2019-07-02 09:14:12,463 : DEBUG : KNIME-Worker-38 : HiveLoader : Hive Loader : 6:373 : Load data from hdfs
2019-07-02 09:14:14,045 : DEBUG : KNIME-Worker-38 : HiveLoader : Hive Loader : 6:373 : Executing ‘LOAD DATA INPATH ‘/user/harsingh/hive-import3211c370_a9ad_4ca0_a45c_3c5ddbbf62846119353587619459221.csv’ INTO TABLE account_mapping.lead_test_knime_287102c9_58b1_4333_bc83_a43fbdcb27e1’
2019-07-02 09:14:15,911 : DEBUG : KNIME-Worker-38 : HiveLoader : Hive Loader : 6:373 : Copying data to existing table
2019-07-02 09:14:15,911 : DEBUG : KNIME-Worker-38 : HiveLoader : Hive Loader : 6:373 : Executing 'INSERT INTO TABLE account_mapping.lead_test_knime
SELECT parent_account_id,segment
FROM account_mapping.lead_test_knime_287102c9_58b1_4333_bc83_a43fbdcb27e1’
When running the workflow in server mode, Hive loader fails with “timed out” error. Httpfs node is reachable from Knime and test connection gives “succeeded” result.
Httpfs is an ELB which redirects the request to an internal server and it’s timeout value is set to 900. I don’t see any issue with Hive here. I am pasting the error below.
2019-07-02 03:46:26,656 : DEBUG : KNIME-Worker-24 : UserGroupUtil : Hive Loader : 2:373 : Returning simple authentication user: harsingh (auth:SIMPLE)
2019-07-02 03:47:26,735 : DEBUG : KNIME-Worker-24 : Node : Hive Loader : 2:373 : reset
2019-07-02 03:47:26,735 : ERROR : KNIME-Worker-24 : Node : Hive Loader : 2:373 : Execute failed: hdfs-lb-1976646374.us-west-2.elb.amazonaws.com:14000: connect timed ou
t
2019-07-02 03:47:26,735 : DEBUG : KNIME-Worker-24 : Node : Hive Loader : 2:373 : Execute failed: hdfs-lb-1976646374.us-west-2.elb.amazonaws.com:14000: connect timed ou
t
java.net.SocketTimeoutException: hdfs-lb-1976646374.us-west-2.elb.amazonaws.com:14000: connect timed out
at java.net.PlainSocketImpl.socketConnect(Native Method)
at java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:350)
at java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:206)
at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:188)
at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392)
at java.net.Socket.connect(Socket.java:589)
at sun.net.NetworkClient.doConnect(NetworkClient.java:175)
at sun.net.www.http.HttpClient.openServer(HttpClient.java:463)
at sun.net.www.http.HttpClient.openServer(HttpClient.java:558)
at sun.net.www.http.HttpClient.(HttpClient.java:242)
at sun.net.www.http.HttpClient.New(HttpClient.java:339)
at sun.net.www.http.HttpClient.New(HttpClient.java:357)
at sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConnection.java:1220)
My question is if the local mode is creating a temp table with the same Httpfs connection then why it’s getting “timed out” in server mode and what should we do to insert complete records from temp to original table.
Thanks,
Harpreet