Is there a way to have a node where the main data set (=dataset A) would be one input and the second (=dataset B) would be only with two columns (C and D).
Each time column X in dataset A contains a value from column C, this value would be replaced by the corresponding value in column D.
Thank you.
Bernard
Ps: Bonus question: is there a way to identify what records contain a many to many relationships within two specified columns?
the node you are looking for is the Cell Replacer and it does exactly what you are looking for.
Can you elaborate on your bonus question? I am not sure that I am getting it right.
For my bonus question: I would like to identify (or rather filter in a different flow) rows causing a many to many relationship between two columns that I specify.
Suppose a dataset with only 2 columns: “books” and “authors”:
Most of the time: a author writes one or many books. This is a one to many relationships.
There can a few books written by many authors.
Note: I’m assuming the case of a single book written by many authors has many records:
“Book A” “Author X”
“Book A” “Author Y”
“Book A” “Author Z”
I’m not assuming the following dataset:
“Book A” “Author X, Author Y, Author Z”.
These 3 records cause a many to many relationship. How can I identify them?
Is there a way to have a node where the main data set (=dataset X) would be two columns (A and B) and the second (=dataset Y) would also be with two columns (C and D).
Each time column A in dataset X contains a value from column C, column B would be replaced by the corresponding value of C in column D.
Use Group by with count by Book and Group by Author then filter in both groups records with counts greater then 1 and finally join filtered sets by Book and Author fields. The result will be your many to many.