Hi Team,
Would like to discuss on a best possible solution whilst comparing 2 csv (A.csv and B.csv) which in an ideal scenario will be identical unless upstream or data source may disrupt the structure. This is where we are trying to explore Knime capabilities to identify the differences and write some logs around it.
Problem statements:
- How to identify if someone added or removed a column in B.csv
- How to identify if someone changed the datatype for a particular column in B.csv
- How to identify if someone changed the order of columns B.csv
Trying to achieve with following approach:
- Read from 2 CSV reader flows (A.csv and B.csv) followed by âExtract column headerâ node followed by Transpose and Joiner nodes
- Table difference finder followed by Group By and Extract table dimension
Unfortunately, i can not upload the workflow, but i tried to explain with the node sequence. Hope it helps to visualize.
Please do suggest if there is a better way to approach this, as the results arenât promising.