(sorry, hadn’t seen this post before, probably b/c it was split from the other thread)
Hi Armin,
the reason is, that the content on the follow-up pages is loaded dynamically via XHR / AJAX. The HttpRetriever only sees the static page, and not the content which is added later through additional JavaScript. That’s the main motivation for the Selenium Nodes to exist.
In a nutshell, consider it like this:
- HttpRetriever from Palladian: Pure downloading of pages via HTTP/HTTPS
- Selenium: Complete browser logic.
– Philipp