Big Data Table Join

hi

I have two tables with 250 million records and 200 million records
I want to join based on three columns
It gets stuck on 99% with Joiner node , What is the best way I can use ?
RAM processing for Knime 256 GB

Tanks

Hi @qasemiiii1400,

I am pretty sure it is not stuck (forever), but does a large operation such as writing the final output and the progress reporting is just not great. Can you post a screenshot if you hover over the progress bar when it shows 99%?

To speed up the join, you can try to push any filter you have later to the tables before the join in order to decrease the input size.
You can also try to remove unused columns before the join in order to slim down the table size further.

What does your join condition look like?

Best,
Manuel

1 Like

I closed 99% of Knime and now I can’t take a screenshot

I don’t have a filter after the output
There are only 3 columns in each table that I want to compare with left join

Hide quoted text