Hi there,
I want to compare two tables with theses situations:
different rows number
different columns number
few number of same columns (for example 3 columns)
I’m familiar with “join”, “table difference finder” and “compare two tables”.
But I want to get more statistical features, like for example similarity percent.
I should say that maybe it is better to change my path to search about ‘dataset comparison’.
Thx for your replies.
so you want to create a measure of similarity between two workflows, correct?
In that case you can use the Extract Table Dimension – KNIME Hub node. This outputs the number of rows and columns of each table.
You then could concatenate both tables and check how the number of columns has changed and use this as a baseline for your similarity metric. The workflow could look something like this:
Important here is that the Concatenade node uses the option “Use intersection of columns”.