Sequential Data Transformation with resetting counter.

Hi, please help me out.

Current data:

Product

A

B

C

EOS

A

C

D

D

EOS

B

B

D

D

F

EOS

(Continues)

 

I want to have data like this

Product, Group

A, 1

B, 1

C, 1

EOS, 1

A, 2

C, 2

D, 2

D, 2

EOS, 2

B, 3

B, 3

D, 3

D, 3

F, 3

EOS, 3

(Continues)

 How can I make it without writing Java? It is great to have a sample work flow.

I am stacked when doing text mining.

Hi,

it would be great if you could supply a bit more info as to what is the use case (EOS = end of service?) so that others can understand better what it is you are actually seeking to do beyond the example given.

I have attached a workflow that looks for "EOS" in a lagged product column, sets a marker and builds groups via computing the cumulative sum of the "EOS" marker. It seems very cumbersome and probably someone else can suggest a more elegant approach, but at least it does what you are asking for and you won't have to use a Java snippet. ;-)

Best, Johannes

Hi, Johannes, thanks for replying.

EOS means End of Sentence. For example,

original data is

Product_sentence

A B C EOS

A C D D EOS

B B D D F EOS

 

I cannot have the original data because of security policy. All I got is sequential data like below.

Product

A

B

C

EOS

A

C

D

D

EOS

B

B

D

D

F

EOS

And, I want to have the data like:

Product, Group

A, 1

B, 1

C, 1

EOS, 1

A, 2

C, 2

D, 2

D, 2

EOS, 2

B, 3

B, 3

D, 3

D, 3

F, 3

EOS, 3

 

Update:

Hello Johannes, your work flow has worked !!! Thanks!!!!

And, I will study your knwf.

Thanks again!