Concise means of removing redundant data within unique identifier

After many steps I have a table with unique categories by column. Each row has a unique identifier tying these column categories and their range of cell values (string and integer) together (see “temp1” graphic here “ListingId” column header).

Using text processing the string values are standardized text descriptions and the integers are additional identifiers for each of the standardized text. (see temp2 file here)

The text analysis is completed on both a series of matched drop-down values at the point of input (I call these Pre-Set values) and also on Free-Form text which I join after processing. Sometimes I discover and label the same values (text and identifier) for each unique property. I need to remove this redundancy when the strings and identifiers match. (see temp3 here) And note how the first set of data and the second set of data both contain “Quartz Counters 2023” for the Property ID 6642651
temp3

I am working on a many step, convoluted means to remove redundant Values and Identifiers, but am hoping for a far more concise method from you. The table is fairly large with 15609 rows and 88 columns, so my convoluted method is really beginning to suck. Each column is a different header title, so anything related to “Duplicate Row Filter” won’t work.

thanks

To be honest I do not fully grasp what you want to do. Maybe an example workflow that demonstrates your full challenge without giving away any secrets might alosnhelp.

Concerning duplicates, question is if the use of a rank Id might help like in this example:

Also the corresponding article and sample workflow could give you some inspiration, maybe.

2 Likes

Thanks for the link; I will check it out. I am thinking “first/last” may work. Since each cell of each column currently contains multiple strings and multiple integers I believe I would need to use Cell Splitter to make this work? I need to play with it a bit. Thanks