Ununiform string extraction

santoss · July 3, 2023, 2:47pm

Hi,

I am trying to extract the number of some documents that resides in a ununiform cell. I would need to extract from AB till the end of the five digit number. I wrote three examples of the cells in the table below. I have tried with the cell splitter but I can’t seem to work out how to do it. Do you have any suggestions of the nodes I need to use.

Workflow started - AB55555-request
Workflow started change to AB51613 - request
Workflow started -AB65243 request

Thank you for your help!

takbb · July 3, 2023, 2:57pm

Hi @santoss, the Regex Split node should help you here. Is it always “AB”?

if so you could use this regex pattern:
.*(AB[0-9]{5}).*

Alternatively, if it’s any two-letter capitals you could use:

.*([A-Z]{2}[0-9]{5}).*

or if it could be capitals, or lowercase
.*([A-Za-z]{2}[0-9]{5}).*

gonhaddock · July 3, 2023, 3:51pm

Hello @santoss and welcome to the KNIME forum

‘String Manipulation’ node :

substr($text$
	, indexOfChars($text$, regexReplace($text$, "\\D", "")) - 2
	, 7)

BR

takbb · July 3, 2023, 4:03pm

@gonhaddock, ok… lol… another challenge I see…

String Manipulation node

regexReplace($text$, ".*([A-Z]{2}[0-9]{5}).*", "$1")

and yes, where are my manners? Welcome @santoss !

system · October 1, 2023, 4:04pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.