How to move String to XML ?

veniapputrii · July 28, 2023, 1:41pm

Hallo everyone!
today, I’ve been follow this workflow
And I found the result of Xpath node as picture below :

And then, I follow this step. I use Xpath Node for scrapping a websites that I take.
but the results I get are not the same as the workflow above, where the results from the Xpath node for the Item column are of type String data, not XML.

Even though all my configurations are the same as the workflow example above.

Anyone can help me?

Thankyou,
Best regards
Veni

armingrudd · July 28, 2023, 2:14pm

Hi @veniapputrii,

In the Xpath query settings, you can change the return type to “Node cell” and then you will have your desired output.

veniapputrii · July 28, 2023, 2:35pm

Thankyou, @armingrudd

I have missed the configuration for this one

veniapputrii · July 28, 2023, 3:14pm

Hi, @armingrudd
I still have one question about Xpath…
I wanna fetch the topic by path : /item/description/
But, I want to delete a picture in the description. Do you know of a path expression to handle it?

armingrudd · July 28, 2023, 3:22pm

Is it possible for you to share the XML? Where is the topic?
Maybe XML and the value that you want.

If not possible, I think, after parsing the XML with Xpath, you can clean it using a String Manipulation node.

veniapputrii · July 29, 2023, 2:38pm

Hallo, @armingrudd

below is a link of the XML file that I used :
https://www.suara.com/rss/news

the part of tag in < item > / < description > / …
there is a tag for “img”

I only want to take the part of the article that is written in the < description > tag. However, some articles in the < description > have tags to load images. While I don’t need it.

Can I remove the existing part of “img” ?

I’ve tried using the String manipulation node. By using the “replace” function. But this doesn’t work.

armingrudd · July 30, 2023, 5:00am

Try this:
strip(regexReplace($description$, "<img\\s.*?/>", ""))

Where $description$ is the output column of the xpath …/item/description

veniapputrii · July 30, 2023, 8:37am

solved!!
Thankyou so much, @armingrudd
where can I learn more about its functions and uses?

armingrudd · July 31, 2023, 6:46am

Here is a short blog post which you may find helpful.

system · August 7, 2023, 6:46am

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.