Read Web page and save it in pdf

Hi…Can anyone guide me how to read a web page (login required using credential) and save it in a PDF format

Tried using http retriever node along with count loop but not works as expected.

Hey,

have you tried using the KNIME Webinteraction (Labs) Extension?

There is no direct possibility to download a webpage as an .pdf however you can use the Content Retriever (Labs) node to get content of the webpage and create a pdf after.

1 Like

You want to screenshot respectively “print” the webpage to a PDF? Have a look at this thread:

1 Like

1 Like

Thanks for the reply. I have tried but couldnt able to sequence the nodes.

Thanks for the reply. tried using the workflow model but getting below error on the nodes

Did you follow the instructions here? Selenium Nodes — Download

-P.

Requested for trial license and entered, Node executes fine. Thank you

Can you suggest how we can provide credentials or token in the Webdriver or Start Webdriver factory node for the URLs which requires authentication

Great to hear!

Authentication can be manifold - can you clarify what kind of authentication you face? An example URL would be even more helpful.

Thank you for your help on this.

Example URL - I am trying to access our application internal Wiki sites which requires authentication to be provided for initial login (Ex - Internal URL or Azure Portal)
Meaning users need to provide credentials like a username and password (Basic authentication) to access the content or resources behind that specific URL

Moreover how we can take pdf for multiple pages from the website? I see navigate node has only forward but result in the same pdf creation.

Please suggest. Thanks.

Can you clarify what kind of login this is - if the URL is internal, a screenshot would help.

Are we talking about (literally!) HTTP Basic? I.e. is there a (browser-level) dialog box for username and password? Or is this a web form rendering in the browser window? Depending on this you need to build your workflow - for the latter, use the Send Keys and Submit nodes to fill out the login details.

Plain “Basic” authentication has a dedicated node as well:

Note that this really only works for Basic.

Moreover how we can take pdf for multiple pages from the website? I see navigate node has only forward but result in the same pdf creation.

Typically you’ll need to navigate to the URLs one-by-one and execute the node. So you’ll likely need to use a loop node.

Does this help?

1 Like

Thank you qqilihq for your support and sharing this details. Really much appreciated your help.

1 Like

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.