Hyperlink to URL extraction in KNIME

Greetings,

I came across a problem that was quite easily solved by using macros in Excel but I’d like to automate this part of the process as well.

In my excel files that are being read with Excel Reader Node I have two columns that are hyperlinks.

For my further analysis I need to get the URLs extracted and i can’t find a solution to the problem in KNIME.

Is there a way to use String Manipulation Regex or something like that?

In Excel I’m using such code (VBA):

Sub Extracthyperlinks()
'Updateby Extendoffice
Dim Rng As Range
Dim WorkRng As Range
On Error Resume Next
xTitleId = “KutoolsforExcel”
Set WorkRng = Application.Selection
Set WorkRng = Application.InputBox(“Range”, xTitleId, WorkRng.Address, Type:=8)
For Each Rng In WorkRng
If Rng.Hyperlinks.Count > 0 Then
Rng.Value = Rng.Hyperlinks.Item(1).Address
End If
Next
End Sub

Thanks for all the help in advance!

Hi there,

one solution - the Regex Extractor from Palladian has a built in templates for URL extraction (beside others):

Hope that helps!

-Philipp

5 Likes

@helpmeplease you can maybe use “The ultimate split path, with a single regex” like in this example:

image

But maybe the Palladian node is the better option :slight_smile:

3 Likes

Thank you for the help!

1 Like

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.