How can I extract these data and turn it into a column?

I try to use string splitter (regex) node to extract the data as it is a scalable solutions that work for any number of drivers (imagine there’s a huge amount of drivers).
This is my pattern: (?[A-Z][a-z]+(?:\s+[A-Z][a-z]+))\s+(?||(?:Active|Inactive)||(?:\s+||(?:Active|Inactive)||))\s+(?\d{10}).*?Transport Type\s+(?\w+)
But I am having an issues on it

Expected outcome (it should be like this):

Hi @Jia_Qi
Would it be possible for you to upload the data file you’re referring to? That way, we can experiment with it directly and help structure the data exactly the way you’re aiming for.
In the meantime, based on the structure visible on the screen, I’ve built a preliminary KNIME flow that formats the data as I believe you intended.
It includes:
Realignment of the driver name and status columns
Proper mapping of license expiry dates
Clean-up of any structural inconsistencies
Please feel free to test it once you upload the source file, and I’ll gladly help adjust it further if needed.
Looking forward to it!
Best,
Alpay
Before :
After:

Data:
driver_license_status.xlsx (13.3 KB)
Flow:
Jia.knwf (34.9 KB)
Hi,
as alpay said, it’s hard to develop a solution without data.
But I gave it a try:
1 Like
Hi, I am using image reader node to read this image then proceed to Tess4J node.
Here is the image:
Here is the output of Tess4j :
After that I use String Manipulation node to refine the word “I nactive” to “Inactive” and now I am stuck on organizing the data cell