Collecting KNIME forum data

(sorry, hadn’t seen this post before, probably b/c it was split from the other thread)

Hi Armin,

the reason is, that the content on the follow-up pages is loaded dynamically via XHR / AJAX. The HttpRetriever only sees the static page, and not the content which is added later through additional JavaScript. That’s the main motivation for the Selenium Nodes to exist.

In a nutshell, consider it like this:

  • HttpRetriever from Palladian: Pure downloading of pages via HTTP/HTTPS
  • Selenium: Complete browser logic.

– Philipp

1 Like