Although I don’t understand exactly what you want; I still wanted to give an idea by guessing. There are many ways. I think the following will give you an idea of ​​what I share.
Create a simple workflow like the picture below
Paste the xpath path in the image into xpath and always have “dns:” at the beginning. you can find the path to the file in xpath.
the trick is to get the XPath expression right (this can be a bit fiddly with the XPath node, as it involves some trial-and-error). Anyways, here’s one possible solution:
I’m using the following XPath expression to grab the links to MS Excel files:
Get the href attribute of all <a> tags which contain an <img> tag which has a class attribute which contains xls_icon_mini:
(the dns: prefixes are necessary because XPath is very strict in regards to name spaces and dns: represents the http://www.w3.org/1999/xhtml namespace; see tab “Namespace” in the XPath dialog)
On this page, the structure is slightly different (someone should tell those IRI guys to get a proper CMS). The <img> tag is not within the <a>, so you’ll need to modify your XPath.
For the second case, the XPath would look as follows:
In case you want to combine both queries to make them more generic (so that they work on both pages), you should be able to simply combine them with a |: