Ununiform string extraction

Hi,

I am trying to extract the number of some documents that resides in a ununiform cell. I would need to extract from AB till the end of the five digit number. I wrote three examples of the cells in the table below. I have tried with the cell splitter but I can’t seem to work out how to do it. Do you have any suggestions of the nodes I need to use.

Workflow started - AB55555-request
Workflow started change to AB51613 - request
Workflow started -AB65243 request

Thank you for your help!

Hi @santoss, the Regex Split node should help you here. Is it always “AB”?

if so you could use this regex pattern:
.*(AB[0-9]{5}).*

image
Alternatively, if it’s any two-letter capitals you could use:

.*([A-Z]{2}[0-9]{5}).*

or if it could be capitals, or lowercase
.*([A-Za-z]{2}[0-9]{5}).*

2 Likes

Hello @santoss and welcome to the KNIME forum

‘String Manipulation’ node :tophat: :

substr($text$
	, indexOfChars($text$, regexReplace($text$, "\\D", "")) - 2
	, 7)

BR

2 Likes

@gonhaddock, ok… lol… another challenge I see… :wink:

String Manipulation node

regexReplace($text$, ".*([A-Z]{2}[0-9]{5}).*", "$1")

and yes, where are my manners? Welcome @santoss !

1 Like

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.