Hi,
I’ve been following the RegEx discusions on the forum - I am a bit challenged by the repeating group approach. I have a basic problem where I have a list of items that I need to split up.
Example of code list: SVV11283687 , ICC472987789, EVT3276428373
A simple Cell Splitter could do the job and use a “,” to split out a list - problem is that the above example is the best case scenario. The majority of cases I deal with has “junk text” thrown in around the codes - but the code patterns (e.g. SVV00000000) remain constant. I have tried focusing on the separators - and cleaning the text around the codes, but the variation is huge and creates problems where the sequence of cleaning needs to also be considered. My “gut feel” tells me it would be better to just extract the codes.
RegEx has the notion of a repeating groups - and Iimagine I need to identify groups to create columns that I can later concatenate to form a list that I can split out. Problem is that RegEx Splitter doesn’t seem to provide a nice way to get the following result:
Example
<Group 1> SVV11283687 <Group 2> ICC472987789 <Group 3> EVT3276428373
Does anyone have a view on how to get this broken up into groups so I can make a list? (Note: Some entries have more than 3 codes - could go up to 9 codes with varying order).