Group similar strings

Is it possible to cluster or group similar strings of data?
For example, I have a list of various SKU numbers for different products. Each SKU specifies a color and a size. However, I would like to group the SKUs by just their style number, which is typically the first portion of the string.

ODY80JOB-SK18
ODY80-WH19
ODY80JOB-SK18
ODY80JOB-SK18
ODY80-TS-HP19
ODY80JOB-SK18
ODY80-TS-HP19
ODY80L-JR-TQ19
ODY80JOB-SK18
ODY80L-JR-TQ19
ODY80PL-SL-PK19
ODY80-KR-LM19
ODY80L-JR-TQ19
ODY80L-HB-MN19
ODY80-KR-LM19
ODY80-KR-LM19
ODY80L-JR-TQ19
ODY80-KR-LM19
ODY80PL-SL-PK19
ODY80L-HB-MRN

1 Like

Hi @carleeardis

Welcome to KNIME. Of course it’s possible, almost everything is possible in KNIME. I created a workflow with 3 nodes: String Manipulation, Rule Engine and Cell Splitter that can be usefull to extract/split string values. group_similair_strings.knwf (32.9 KB). As an example of some options. More information, answers and inspiration you find on the KNIME Hub.
Hope this helps. If not feel free to ask. Happy KNIMEing.
gr. Hans

4 Likes

This topic was automatically closed 182 days after the last reply. New replies are no longer allowed.