I am a new joiner to the Knime. I hope my question here is not too stupid
There are 2 workflows in the file,
-the 1st flow I try is with the node “get request”, I supposed I can use “json path” node afterwards to extract data, however, I dont know why is not working
-then i try to find a solution in the forum, I saw someone share another workflow with node “HHTP retriever”, I followed, but it fails again
May anyone tell me what’s wrong there?
get request problem.knwf (17.7 KB)
https://www.hotelsmag.com/Industry/News is not JSON but HTML format. You can e.g. use the HTML Parser node connected to the HTTP Retriever to parse it, if this is what you’re looking for:
Afterwards, you can treat it e.g. with the XPath node to extract specific parts of the parsed HTML DOM:
Thank you so much, Philipp! Your explanation is very clear
But I just have another problem here, see if you or other fellows can help
I use some online XPath tester, the path worked
But I don’t know why I input the same path into the node, the result shows nothing
hotelsmag_xpath problem.knwf (17.7 KB)
You need to prefix element names with
dns: so that they work properly (see the “Namespace” tab in the XPath node configuration). Thus, the XPath query should be:
I have updated the workflow which extracts all the
h1 headlines and put it for you to my NodePit Space for you. Hope this helps!
Thank you again!! I am self-learning this language, you helped a lot
This topic was automatically closed 182 days after the last reply. New replies are no longer allowed.