Collecting KNIME forum data

qqilihq · December 17, 2018, 6:38pm

(sorry, hadn’t seen this post before, probably b/c it was split from the other thread)

Hi Armin,

the reason is, that the content on the follow-up pages is loaded dynamically via XHR / AJAX. The HttpRetriever only sees the static page, and not the content which is added later through additional JavaScript. That’s the main motivation for the Selenium Nodes to exist.

In a nutshell, consider it like this:

HttpRetriever from Palladian: Pure downloading of pages via HTTP/HTTPS
Selenium: Complete browser logic.

– Philipp